Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Marked a few tensor operations as read only | 2016-05-05 | |
| | |||
* | Deleted trailing commas | 2016-04-29 | |
| | |||
* | Added missing definition of PacketSize in the gpu evaluator of convolution | 2016-04-14 | |
| | |||
* | Eigen cost model part 1. This implements a basic recursive framework to ↵ | 2016-04-14 | |
| | | | | estimate the cost of evaluating tensor expressions. | ||
* | Decoupled the packet type definition from the definition of the tensor ops. ↵ | 2016-03-08 | |
| | | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit. | ||
* | Fixed a number of compilation warnings generated by the cuda tests | 2016-01-31 | |
| | |||
* | Record whether the underlying tensor storage can be accessed directly during ↵ | 2016-01-19 | |
| | | | | the evaluation of an expression. | ||
* | Use numext::mini/numext::maxi instead of std::min/std::max in the tensor code | 2015-08-28 | |
| | |||
* | Many files were missing in previous changeset. | 2015-07-29 | |
| | |||
* | Added support for multi gpu configuration to the GpuDevice class | 2015-07-15 | |
| | |||
* | Misc improvements and optimizations | 2015-07-01 | |
| | |||
* | Enabled the vectorized evaluation of several tensor expressions that was ↵ | 2015-07-01 | |
| | | | | previously disabled by mistake | ||
* | Fixed a compilation error triggered by nvcc 7 | 2015-05-28 | |
| | |||
* | Added support for convolution of tensors laid out in RowMajor mode | 2015-03-31 | |
| | |||
* | Silenced the last batch of compilation warnings triggered by gcc 4.8 | 2015-02-10 | |
| | |||
* | Improved support for RowMajor tensors | 2015-01-14 | |
| | | | | Misc fixes and API cleanups. | ||
* | Silenced a few compilation warnings | 2014-10-16 | |
| | | | | Generalized a TensorMap constructor | ||
* | Misc improvements and cleanups | 2014-10-13 | |
| | |||
* | Improved the performance of the tensor convolution code by a factor of about 4. | 2014-09-03 | |
| | |||
* | Improved the speed of convolutions when running on cuda devices | 2014-08-19 | |
| | |||
* | Fixed misc typos. | 2014-08-13 | |
| | |||
* | Added missing apis. | 2014-08-13 | |
| | |||
* | Updated the convolution and contraction evaluators to follow the new ↵ | 2014-08-13 | |
| | | | | EvalSubExprsIfNeeded apu. | ||
* | Silenced a compilation warning | 2014-06-13 | |
| | |||
* | Reworked the expression evaluation mechanism in order to make it possible to ↵ | 2014-06-13 | |
| | | | | | | | | efficiently compute convolutions and contractions in the future: * The scheduling of computation is moved out the the assignment code and into a new TensorExecutor class * The assignment itself is now a regular node on the expression tree * The expression evaluators start by recursively evaluating all their subexpressions if needed | ||
* | Fixed a few compilation errors. | 2014-06-10 | |
| | |||
* | TensorEval are now typed on the device: this will make it possible to use ↵ | 2014-06-10 | |
| | | | | | | partial template specialization to optimize the strategy of each evaluator for each device type. Started work on partial evaluations. | ||
* | Improved support for rvalues in tensor expressions. | 2014-06-09 | |
| | |||
* | Added support for convolution and reshaping of tensors. | 2014-06-06 | |