aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor/TensorContractionMapper.h
Commit message (Collapse)AuthorAge
* [SYCL] Rebasing the SYCL support branch on top of the Einge upstream master ↵Gravatar Mehdi Goli2019-11-28
| | | | | | | | | | | | | | | | | | | | | | branch. * Unifying all loadLocalTile from lhs and rhs to an extract_block function. * Adding get_tensor operation which was missing in TensorContractionMapper. * Adding the -D method missing from cmake for Disable_Skinny Contraction operation. * Wrapping all the indices in TensorScanSycl into Scan parameter struct. * Fixing typo in Device SYCL * Unifying load to private register for tall/skinny no shared * Unifying load to vector tile for tensor-vector/vector-tensor operation * Removing all the LHS/RHS class for extracting data from global * Removing Outputfunction from TensorContractionSkinnyNoshared. * Combining the local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining the no-local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining General Tensor-Vector and VectorTensor contraction into one kernel. * Making double buffering optional for Tensor contraction when local memory is version is used. * Modifying benchmark to accept custom Reduction Sizes * Disabling AVX optimization for SYCL backend on the host to allow SSE optimization to the host * Adding Test for SYCL * Modifying SYCL CMake
* [SYCL] This PR adds the minimum modifications to the Eigen unsupported ↵Gravatar Mehdi Goli2019-06-28
| | | | | | | | | | module required to run it on devices supporting SYCL. * Abstracting the pointer type so that both SYCL memory and pointer can be captured. * Converting SYCL virtual pointer to SYCL device memory in Eigen evaluator class. * Binding SYCL placeholder accessor to command group handler by using bind method in Eigen evaluator node. * Adding SYCL macro for controlling loop unrolling. * Modifying the TensorDeviceSycl.h and SYCL executor method to adopt the above changes.
* Add support for custom packed Lhs/Rhs blocks in tensor contractionsGravatar Eugene Zhulenev2019-04-01
|
* Workaround lack of support for arbitrary packet-type in Tensor by manually ↵Gravatar Gael Guennebaud2019-01-30
| | | | loading half/quarter packets in tensor contraction mapper.
* Hide some annoying unused variable warnings in g++8.1Gravatar Christoph Hertzberg2019-01-29
|
* Fiw shadowing of last and allGravatar Gael Guennebaud2018-09-21
|
* Fix gebp kernel for real+complex in case only reals are vectorized (e.g., ↵Gravatar Gael Guennebaud2018-09-20
| | | | | | AVX512). This commit also removes "half-packet" from data-mappers: it was not used and conceptually broken anyways.
* Merged eigen/eigen into defaultGravatar Benoit Steiner2016-12-14
|\
* | Adding tensor contraction operation backend for Sycl; adding test for ↵Gravatar Mehdi Goli2016-12-14
| | | | | | | | contractionOp sycl backend; adding temporary solution to prevent memory leak in buffer; cleaning up cxx11_tensor_buildins_sycl.h
| * Fixed build error with MSVCGravatar Benoit Steiner2016-12-09
|/
* Add missing generic load methods.Gravatar Gael Guennebaud2016-12-03
|
* Improved the contraction mapper to properly support tensor productsGravatar Benoit Steiner2016-07-11
|
* Deleted useless trailing commasGravatar Benoit Steiner2016-04-29
|
* Deleted unused default values for template parametersGravatar Benoit Steiner2016-04-29
|
* Fixed a typo.Gravatar Benoit Steiner2016-03-10
|
* Fixed the tensor chipping code.Gravatar Benoit Steiner2016-03-08
|
* Decoupled the packet type definition from the definition of the tensor ops. ↵Gravatar Benoit Steiner2016-03-08
| | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit.
* Don't try to use direct offsets when computing a tensor product, since the ↵Gravatar Benoit Steiner2016-02-02
| | | | required stride isn't available.
* Leverage the new blocking code in the tensor contraction code.Gravatar Benoit Steiner2016-01-22
|
* Reduce the register pressure exerted by the tensor mappers whenever ↵Gravatar Benoit Steiner2016-01-20
| | | | possible. This improves the performance of the contraction of a matrix with a vector by about 35%.
* Moved the contraction mapping code to its own file to make the code more ↵Gravatar Benoit Steiner2016-01-19
manageable.