aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor/TensorDevice.h
Commit message (Collapse)AuthorAge
* Fix rule-of-3 for the Tensor module.Gravatar Antonio Sanchez2020-11-18
| | | | | | | Adds copy constructors to Tensor ops, inherits assignment operators from `TensorBase`. Addresses #1863
* Add block evaluation V2 to TensorAsyncExecutor.Gravatar Rasmus Munk Larsen2019-10-22
| | | | Add async evaluation to a number of ops.
* Allow move-only done callback in TensorAsyncDeviceGravatar Eugene Zhulenev2019-09-03
|
* Asynchronous expression evaluation with TensorAsyncDeviceGravatar Eugene Zhulenev2019-08-30
|
* Deleted redundant codeGravatar Benoit Steiner2015-12-03
|
* Refined the #ifdef __CUDACC__ guard to ensure that when trying to compile ↵Gravatar Benoit Steiner2015-10-23
| | | | gpu code with a non cuda compiler results in a linking error instead of bogus code.
* Avoid making an unecessary copy of the tensor expression when evaluating it ↵Gravatar Benoit Steiner2015-03-25
| | | | on a GPU device
* Cleaned up the TensorDevice code a little bit.Gravatar Benoit Steiner2015-03-25
|
* Added the -= operator to the device classesGravatar Benoit Steiner2015-03-19
|
* Misc improvements and cleanupsGravatar Benoit Steiner2014-10-13
|
* Reworked the expression evaluation mechanism in order to make it possible to ↵Gravatar Benoit Steiner2014-06-13
| | | | | | | | efficiently compute convolutions and contractions in the future: * The scheduling of computation is moved out the the assignment code and into a new TensorExecutor class * The assignment itself is now a regular node on the expression tree * The expression evaluators start by recursively evaluating all their subexpressions if needed
* TensorEval are now typed on the device: this will make it possible to use ↵Gravatar Benoit Steiner2014-06-10
| | | | | | partial template specialization to optimize the strategy of each evaluator for each device type. Started work on partial evaluations.
* Added support for tensor contractionsGravatar Benoit Steiner2014-06-04
| | | | | Updated expression evaluation mechanism to also compute the size of the tensor result Misc fixes and improvements.
* Vectorized the evaluation of tensor expression (using SSE, AVX, NEON, ...)Gravatar Benoit Steiner2014-05-16
Added the ability to parallelize the evaluation of a tensor expression over multiple cpu cores. Added the ability to offload the evaluation of a tensor expression to a GPU.