Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Fix CUDA compilation | 2016-07-21 | |
| | |||
* | Added tests for full contractions using thread pools and gpu devices. | 2016-05-05 | |
| | | | | Fixed a couple of issues in the corresponding code. | ||
* | Fixed compilation error | 2016-05-01 | |
| | |||
* | Fixed contractions of 16 bit floats | 2016-03-22 | |
| | |||
* | Decoupled the packet type definition from the definition of the tensor ops. ↵ | 2016-03-08 | |
| | | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit. | ||
* | Updated the contraction code to make it compatible with half floats. | 2016-02-19 | |
| | |||
* | Reverted a previous change that tripped nvcc when compiling in debug mode. | 2016-01-11 | |
| | |||
* | Silenced a few compilation warnings. | 2016-01-11 | |
| | |||
* | Deleted unused variable. | 2015-11-23 | |
| | |||
* | Fixed a compilation warning | 2015-11-12 | |
| | |||
* | Moved some utilities to TensorMeta.h to make it easier to reuse them accross ↵ | 2015-06-29 | |
| | | | | | | several tensor operations. Created the TensorDimensionList class to encode the list of all the dimensions of a tensor of rank n. This could be done using TensorIndexList, however TensorIndexList require cxx11 which isn't yet supported as widely as we'd like. | ||
* | Added support for RowMajor inputs to the contraction code. | 2015-01-14 | |
| | |||
* | Improved the performance of the contraction code on CUDA | 2015-01-14 | |
| | |||
* | Improved support for CUDA devices. | 2014-10-03 | |
Improved contractions on GPU |