aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h
Commit message (Collapse)AuthorAge
* Resolve C4346 when building eigen on windowsGravatar jangsoopark2020-04-08
|
* Convert StridedLinearBufferCopy::Kind to enum classGravatar Eugene Zhulenev2020-01-13
|
* Fix for HIP breakage - 191220Gravatar Deven Desai2019-12-20
| | | | | | | | | | | | | | | | | | | | | | | | | | | The breakage was introduced by the following commit : https://gitlab.com/libeigen/eigen/commit/ae07801dd8d295657f28b006e1e4999edf835052 After the commit, HIPCC errors out on some tests with the following error ``` Building HIPCC object unsupported/test/CMakeFiles/cxx11_tensor_device_1.dir/cxx11_tensor_device_1_generated_cxx11_tensor_device.cu.o In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_device.cu:17: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:100: /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:129:12: error: no matching constructor for initialization of 'Eigen::internal::TensorBlockResourceRequirements' return {merge(lhs.shape_type, rhs.shape_type), // shape_type ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit copy constructor) not viable: requires 1 argument, but 3 were provided struct TensorBlockResourceRequirements { ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit move constructor) not viable: requires 1 argument, but 3 were provided /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit copy constructor) not viable: requires 5 arguments, but 3 were provided /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit default constructor) not viable: requires 0 arguments, but 3 were provided ... ... ``` The fix is to explicitly decalre the (implicitly called) constructor as a device func
* Tensor block evaluation cost modelGravatar Eugene Zhulenev2019-12-18
|
* Reduce block evaluation overhead for small tensor expressionsGravatar Eugene Zhulenev2019-12-17
|
* Remove V2 suffix from TensorBlockGravatar Eugene Zhulenev2019-12-10
|
* Remove TensorBlock.h and old TensorBlock/BlockMapperGravatar Eugene Zhulenev2019-12-10
|
* Do not use std::vector in getResourceRequirementsGravatar Eugene Zhulenev2019-12-09
|
* Remove legacy block evaluation supportGravatar Eugene Zhulenev2019-11-12
|
* Tensor block evaluation V2 support for unary/binary/broadcstingGravatar Eugene Zhulenev2019-09-24
|
* Asynchronous expression evaluation with TensorAsyncDeviceGravatar Eugene Zhulenev2019-08-30
|
* [Eigen] Vectorize evaluation of coefficient-wise functions over tensor ↵Gravatar Rasmus Munk Larsen2019-08-07
| | | | | | | | | | | | blocks if the strides are known to be 1. Provides up to 20-25% speedup of the TF cross entropy op with AVX. A few benchmark numbers: name old time/op new time/op delta BM_Xent_16_10000_cpu 448µs ± 3% 389µs ± 2% -13.21% (p=0.008 n=5+5) BM_Xent_32_10000_cpu 575µs ± 6% 454µs ± 3% -21.00% (p=0.008 n=5+5) BM_Xent_64_10000_cpu 933µs ± 4% 712µs ± 1% -23.71% (p=0.008 n=5+5)
* Clean up unnecessary namespace specifiers in TensorBlock.h.Gravatar Rasmus Munk Larsen2019-08-07
|
* [SYCL] This PR adds the minimum modifications to the Eigen unsupported ↵Gravatar Mehdi Goli2019-06-28
| | | | | | | | | | module required to run it on devices supporting SYCL. * Abstracting the pointer type so that both SYCL memory and pointer can be captured. * Converting SYCL virtual pointer to SYCL device memory in Eigen evaluator class. * Binding SYCL placeholder accessor to command group handler by using bind method in Eigen evaluator node. * Adding SYCL macro for controlling loop unrolling. * Modifying the TensorDeviceSycl.h and SYCL executor method to adopt the above changes.
* Add missing EIGEN_DEPRECATED annotations to deprecated functions and fix few ↵Gravatar Eugene Zhulenev2019-04-23
| | | | other doxygen warnings
* Check if it's allowed to squueze inner dimensions in TensorBlockIOGravatar Eugene Zhulenev2018-10-15
|
* Add tests for evalShardedByInnerDim contraction + fix bugsGravatar Eugene Zhulenev2018-09-28
|
* Optimize TensorBlockCopyOpGravatar Eugene Zhulenev2018-09-27
|
* Initialize BlockIteratorState in a C++03 compatible way.Gravatar Rasmus Munk Larsen2018-09-20
|
* Silence more compiler warnings.Gravatar Rasmus Munk Larsen2018-09-19
|
* Merge with upstream eigen/defaultGravatar Eugene Zhulenev2018-09-11
|\
* | Fix compilation of tiled evaluation code with c++03Gravatar Eugene Zhulenev2018-09-11
| |
* | Merge with upstream eigen/defaultGravatar Eugene Zhulenev2018-08-27
|\ \
* \ \ Merge with eigen/defaultGravatar Eugene Zhulenev2018-08-10
|\ \ \
* | | | Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵Gravatar Eugene Zhulenev2018-08-10
| | | | | | | | | | | | | | | | evaluators
* | | | Merged with upstream eigenGravatar Eugene Zhulenev2018-08-08
|\ \ \ \
| | | | * adding EIGEN_DEVICE_FUNC attribute to fix some GPU unit tests that are ↵Gravatar Deven Desai2018-08-27
| | | |/ | | | | | | | | | | | | broken in HIP mode
| | | * Fix some trivial C++11 vs C++03 compatibility warningsGravatar Christoph Hertzberg2018-08-25
| | | |
| | | * Fix several integer conversion and sign-compare warningsGravatar Christoph Hertzberg2018-08-24
| | | |
| | | * Cleaned up the code and make it compile with more compilersGravatar Benoit Steiner2018-08-14
| | | |
| | | * Fixed compilation errors with gcc 4.7 and 4.8Gravatar Benoit Steiner2018-08-14
| | | |
| | | * Don't use the auto keyword since it's not always supported properly.Gravatar Benoit Steiner2018-08-13
| | | |
| | | * Fixed syntax of nested templates chevrons to make it compatible with c++97 mode.Gravatar Benoit Steiner2018-08-13
| | |/
| | * Made the code compile with gcc 5.4.Gravatar Benoit Steiner2018-08-10
| |/
| * Fixing compiler warning in TensorBlock.h as it was creating a lot of noise ↵Gravatar Mehdi Goli2018-08-08
| | | | | | | | at compilation.
| * Silenced a couple of compilation warnings.Gravatar Benoit Steiner2018-08-06
| |
* | Replace all using declarations with typedefs in Tensor opsGravatar Eugene Zhulenev2018-08-01
| |
* | Add block evaluation support to TensorOpsGravatar Eugene Zhulenev2018-07-31
|/
* Rename Index to StorageIndex + use Eigen::Array and Eigen::Map when possibleGravatar Eugene Zhulenev2018-07-27
|
* Add tiled evaluation support to TensorExecutorGravatar Eugene Zhulenev2018-07-25
|
* TensorBlockIOGravatar Eugene Zhulenev2018-07-23
|
* Initial support of TensorBlockGravatar Eugene Zhulenev2018-07-20