Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Allow vectorized padding on GPU. This helps speed things up a little | 2016-05-17 | |
| | | | | | | | | | | | | | Before: BM_padding/10 5000000 460 217.03 MFlops/s BM_padding/80 5000000 460 13899.40 MFlops/s BM_padding/640 5000000 461 888421.17 MFlops/s BM_padding/4K 5000000 460 54316322.55 MFlops/s After: BM_padding/10 5000000 454 220.20 MFlops/s BM_padding/80 5000000 455 14039.86 MFlops/s BM_padding/640 5000000 452 904968.83 MFlops/s BM_padding/4K 5000000 411 60750049.21 MFlops/s | ||
* | Fixed compilation errors triggered by old versions of gcc | 2016-05-12 | |
| | |||
* | Eigen cost model part 1. This implements a basic recursive framework to ↵ | 2016-04-14 | |
| | | | | estimate the cost of evaluating tensor expressions. | ||
* | Decoupled the packet type definition from the definition of the tensor ops. ↵ | 2016-03-08 | |
| | | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit. | ||
* | Added the ability to pad a tensor using a non-zero value | 2016-03-07 | |
| | |||
* | Updated the padding code to work with half floats | 2016-02-23 | |
| | |||
* | Record whether the underlying tensor storage can be accessed directly during ↵ | 2016-01-19 | |
| | | | | the evaluation of an expression. | ||
* | Added static assertions to avoid misuses of padding, broadcasting and ↵ | 2015-11-06 | |
| | | | | concatenation ops. | ||
* | Many files were missing in previous changeset. | 2015-07-29 | |
| | |||
* | Fix shadow warnings in Tensor module | 2015-06-16 | |
| | |||
* | Silenced the last batch of compilation warnings triggered by gcc 4.8 | 2015-02-10 | |
| | |||
* | Improved support for RowMajor tensors | 2015-01-14 | |
| | | | | Misc fixes and API cleanups. | ||
* | Use the proper index type in the padding code | 2014-10-30 | |
| | |||
* | Silenced a few compilation warnings | 2014-10-16 | |
| | | | | Generalized a TensorMap constructor | ||
* | Misc improvements and cleanups | 2014-10-13 | |
| | |||
* | Misc fixes. | 2014-09-05 | |
| | |||
* | Optimized the tensor padding code. | 2014-08-26 | |
| | |||
* | Added support for padding, stridding, and shuffling | 2014-08-14 | |