Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Fix rule-of-3 for the Tensor module. | Antonio Sanchez | 2020-11-18 |
| | | | | | | | Adds copy constructors to Tensor ops, inherits assignment operators from `TensorBase`. Addresses #1863 | ||
* | Add block evaluation V2 to TensorAsyncExecutor. | Rasmus Munk Larsen | 2019-10-22 |
| | | | | Add async evaluation to a number of ops. | ||
* | Allow move-only done callback in TensorAsyncDevice | Eugene Zhulenev | 2019-09-03 |
| | |||
* | Asynchronous expression evaluation with TensorAsyncDevice | Eugene Zhulenev | 2019-08-30 |
| | |||
* | Deleted redundant code | Benoit Steiner | 2015-12-03 |
| | |||
* | Refined the #ifdef __CUDACC__ guard to ensure that when trying to compile ↵ | Benoit Steiner | 2015-10-23 |
| | | | | gpu code with a non cuda compiler results in a linking error instead of bogus code. | ||
* | Avoid making an unecessary copy of the tensor expression when evaluating it ↵ | Benoit Steiner | 2015-03-25 |
| | | | | on a GPU device | ||
* | Cleaned up the TensorDevice code a little bit. | Benoit Steiner | 2015-03-25 |
| | |||
* | Added the -= operator to the device classes | Benoit Steiner | 2015-03-19 |
| | |||
* | Misc improvements and cleanups | Benoit Steiner | 2014-10-13 |
| | |||
* | Reworked the expression evaluation mechanism in order to make it possible to ↵ | Benoit Steiner | 2014-06-13 |
| | | | | | | | | efficiently compute convolutions and contractions in the future: * The scheduling of computation is moved out the the assignment code and into a new TensorExecutor class * The assignment itself is now a regular node on the expression tree * The expression evaluators start by recursively evaluating all their subexpressions if needed | ||
* | TensorEval are now typed on the device: this will make it possible to use ↵ | Benoit Steiner | 2014-06-10 |
| | | | | | | partial template specialization to optimize the strategy of each evaluator for each device type. Started work on partial evaluations. | ||
* | Added support for tensor contractions | Benoit Steiner | 2014-06-04 |
| | | | | | Updated expression evaluation mechanism to also compute the size of the tensor result Misc fixes and improvements. | ||
* | Vectorized the evaluation of tensor expression (using SSE, AVX, NEON, ...) | Benoit Steiner | 2014-05-16 |
Added the ability to parallelize the evaluation of a tensor expression over multiple cpu cores. Added the ability to offload the evaluation of a tensor expression to a GPU. |