aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor/TensorContractionBlocking.h
Commit message (Collapse)AuthorAge
* syncing this fork with upstreamGravatar Deven Desai2018-06-13
|\
* | Adding support for using Eigen in HIP kernels.Gravatar Deven Desai2018-06-06
| | | | | | | | | | | | | | | | | | This commit enables the use of Eigen on HIP kernels / AMD GPUs. Support has been added along the same lines as what already exists for using Eigen in CUDA kernels / NVidia GPUs. Application code needs to explicitly define EIGEN_USE_HIP when using Eigen in HIP kernels. This is because some of the CUDA headers get picked up by default during Eigen compile (irrespective of whether or not the underlying compiler is CUDACC/NVCC, for e.g. Eigen/src/Core/arch/CUDA/Half.h). In order to maintain this behavior, the EIGEN_USE_HIP macro is used to switch to using the HIP version of those header files (see Eigen/Core and unsupported/Eigen/CXX11/Tensor) Use the "-DEIGEN_TEST_HIP" cmake option to enable the HIP specific unit tests.
| * Fix typos found using codespellGravatar Gael Guennebaud2018-06-07
|/
* Leverage libxsmm kernels within signle threaded contractionsGravatar Benoit Steiner2016-12-21
|
* Use computeProductBlockingSizes to compute blocking for both ShardByCol and ↵Gravatar Rasmus Munk Larsen2016-04-27
| | | | ShardByRow cases.
* Marked several methods EIGEN_DEVICE_FUNCGravatar Benoit Steiner2016-01-28
|
* Created a mechanism to enable contraction mappers to determine the best ↵Gravatar Benoit Steiner2016-01-22
blocking strategy.