Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Enable the use of the packet api to evaluate tensor broadcasts. This speed ↵ | 2016-05-17 | |
| | | | | | | | | | | | | | | | things up quite a bit: Before" M_broadcasting/10 500000 3690 27.10 MFlops/s BM_broadcasting/80 500000 4014 1594.24 MFlops/s BM_broadcasting/640 100000 14770 27731.35 MFlops/s BM_broadcasting/4K 5000 632711 39512.48 MFlops/s After: BM_broadcasting/10 500000 4287 23.33 MFlops/s BM_broadcasting/80 500000 4455 1436.41 MFlops/s BM_broadcasting/640 200000 10195 40173.01 MFlops/s BM_broadcasting/4K 5000 423746 58997.57 MFlops/s | ||
* | Fixed syntax error | 2016-05-16 | |
| | |||
* | Fixed compilation errors triggered by old versions of gcc | 2016-05-12 | |
| | |||
* | Eigen cost model part 1. This implements a basic recursive framework to ↵ | 2016-04-14 | |
| | | | | estimate the cost of evaluating tensor expressions. | ||
* | Decoupled the packet type definition from the definition of the tensor ops. ↵ | 2016-03-08 | |
| | | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit. | ||
* | Record whether the underlying tensor storage can be accessed directly during ↵ | 2016-01-19 | |
| | | | | the evaluation of an expression. | ||
* | Fixed a typo. | 2016-01-06 | |
| | |||
* | Optimized the performance of broadcasting of scalars. | 2016-01-06 | |
| | |||
* | Simplified more of the IndexList code. | 2015-11-12 | |
| | |||
* | Added static assertions to avoid misuses of padding, broadcasting and ↵ | 2015-11-06 | |
| | | | | concatenation ops. | ||
* | Many files were missing in previous changeset. | 2015-07-29 | |
| | |||
* | Fixed several compilation warnings reported by clang | 2015-02-25 | |
| | |||
* | Silcenced a few compilation warnings | 2015-02-10 | |
| | |||
* | Improved support for RowMajor tensors | 2015-01-14 | |
| | | | | Misc fixes and API cleanups. | ||
* | Fixed compilation errors with clang. | 2014-11-13 | |
| | | | | H: Enter commit message. Lines beginning with 'HG:' are removed. | ||
* | Optimized broadcasting | 2014-11-12 | |
| | |||
* | Silenced a few compilation warnings | 2014-10-16 | |
| | | | | Generalized a TensorMap constructor | ||
* | Misc improvements and cleanups | 2014-10-13 | |
| | |||
* | Added support for broadcasting | 2014-08-20 | |