Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Resolve C4346 when building eigen on windows | jangsoopark | 2020-04-08 |
| | |||
* | Convert StridedLinearBufferCopy::Kind to enum class | Eugene Zhulenev | 2020-01-13 |
| | |||
* | Fix for HIP breakage - 191220 | Deven Desai | 2019-12-20 |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | The breakage was introduced by the following commit : https://gitlab.com/libeigen/eigen/commit/ae07801dd8d295657f28b006e1e4999edf835052 After the commit, HIPCC errors out on some tests with the following error ``` Building HIPCC object unsupported/test/CMakeFiles/cxx11_tensor_device_1.dir/cxx11_tensor_device_1_generated_cxx11_tensor_device.cu.o In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_device.cu:17: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:100: /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:129:12: error: no matching constructor for initialization of 'Eigen::internal::TensorBlockResourceRequirements' return {merge(lhs.shape_type, rhs.shape_type), // shape_type ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit copy constructor) not viable: requires 1 argument, but 3 were provided struct TensorBlockResourceRequirements { ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit move constructor) not viable: requires 1 argument, but 3 were provided /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit copy constructor) not viable: requires 5 arguments, but 3 were provided /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit default constructor) not viable: requires 0 arguments, but 3 were provided ... ... ``` The fix is to explicitly decalre the (implicitly called) constructor as a device func | ||
* | Tensor block evaluation cost model | Eugene Zhulenev | 2019-12-18 |
| | |||
* | Reduce block evaluation overhead for small tensor expressions | Eugene Zhulenev | 2019-12-17 |
| | |||
* | Remove V2 suffix from TensorBlock | Eugene Zhulenev | 2019-12-10 |
| | |||
* | Remove TensorBlock.h and old TensorBlock/BlockMapper | Eugene Zhulenev | 2019-12-10 |
| | |||
* | Do not use std::vector in getResourceRequirements | Eugene Zhulenev | 2019-12-09 |
| | |||
* | Remove legacy block evaluation support | Eugene Zhulenev | 2019-11-12 |
| | |||
* | Tensor block evaluation V2 support for unary/binary/broadcsting | Eugene Zhulenev | 2019-09-24 |
| | |||
* | Asynchronous expression evaluation with TensorAsyncDevice | Eugene Zhulenev | 2019-08-30 |
| | |||
* | [Eigen] Vectorize evaluation of coefficient-wise functions over tensor ↵ | Rasmus Munk Larsen | 2019-08-07 |
| | | | | | | | | | | | | blocks if the strides are known to be 1. Provides up to 20-25% speedup of the TF cross entropy op with AVX. A few benchmark numbers: name old time/op new time/op delta BM_Xent_16_10000_cpu 448µs ± 3% 389µs ± 2% -13.21% (p=0.008 n=5+5) BM_Xent_32_10000_cpu 575µs ± 6% 454µs ± 3% -21.00% (p=0.008 n=5+5) BM_Xent_64_10000_cpu 933µs ± 4% 712µs ± 1% -23.71% (p=0.008 n=5+5) | ||
* | Clean up unnecessary namespace specifiers in TensorBlock.h. | Rasmus Munk Larsen | 2019-08-07 |
| | |||
* | [SYCL] This PR adds the minimum modifications to the Eigen unsupported ↵ | Mehdi Goli | 2019-06-28 |
| | | | | | | | | | | module required to run it on devices supporting SYCL. * Abstracting the pointer type so that both SYCL memory and pointer can be captured. * Converting SYCL virtual pointer to SYCL device memory in Eigen evaluator class. * Binding SYCL placeholder accessor to command group handler by using bind method in Eigen evaluator node. * Adding SYCL macro for controlling loop unrolling. * Modifying the TensorDeviceSycl.h and SYCL executor method to adopt the above changes. | ||
* | Add missing EIGEN_DEPRECATED annotations to deprecated functions and fix few ↵ | Eugene Zhulenev | 2019-04-23 |
| | | | | other doxygen warnings | ||
* | Check if it's allowed to squueze inner dimensions in TensorBlockIO | Eugene Zhulenev | 2018-10-15 |
| | |||
* | Add tests for evalShardedByInnerDim contraction + fix bugs | Eugene Zhulenev | 2018-09-28 |
| | |||
* | Optimize TensorBlockCopyOp | Eugene Zhulenev | 2018-09-27 |
| | |||
* | Initialize BlockIteratorState in a C++03 compatible way. | Rasmus Munk Larsen | 2018-09-20 |
| | |||
* | Silence more compiler warnings. | Rasmus Munk Larsen | 2018-09-19 |
| | |||
* | Merge with upstream eigen/default | Eugene Zhulenev | 2018-09-11 |
|\ | |||
* | | Fix compilation of tiled evaluation code with c++03 | Eugene Zhulenev | 2018-09-11 |
| | | |||
* | | Merge with upstream eigen/default | Eugene Zhulenev | 2018-08-27 |
|\ \ | |||
* \ \ | Merge with eigen/default | Eugene Zhulenev | 2018-08-10 |
|\ \ \ | |||
* | | | | Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵ | Eugene Zhulenev | 2018-08-10 |
| | | | | | | | | | | | | | | | | evaluators | ||
* | | | | Merged with upstream eigen | Eugene Zhulenev | 2018-08-08 |
|\ \ \ \ | |||
| | | | * | adding EIGEN_DEVICE_FUNC attribute to fix some GPU unit tests that are ↵ | Deven Desai | 2018-08-27 |
| | | |/ | | | | | | | | | | | | | broken in HIP mode | ||
| | | * | Fix some trivial C++11 vs C++03 compatibility warnings | Christoph Hertzberg | 2018-08-25 |
| | | | | |||
| | | * | Fix several integer conversion and sign-compare warnings | Christoph Hertzberg | 2018-08-24 |
| | | | | |||
| | | * | Cleaned up the code and make it compile with more compilers | Benoit Steiner | 2018-08-14 |
| | | | | |||
| | | * | Fixed compilation errors with gcc 4.7 and 4.8 | Benoit Steiner | 2018-08-14 |
| | | | | |||
| | | * | Don't use the auto keyword since it's not always supported properly. | Benoit Steiner | 2018-08-13 |
| | | | | |||
| | | * | Fixed syntax of nested templates chevrons to make it compatible with c++97 mode. | Benoit Steiner | 2018-08-13 |
| | |/ | |||
| | * | Made the code compile with gcc 5.4. | Benoit Steiner | 2018-08-10 |
| |/ | |||
| * | Fixing compiler warning in TensorBlock.h as it was creating a lot of noise ↵ | Mehdi Goli | 2018-08-08 |
| | | | | | | | | at compilation. | ||
| * | Silenced a couple of compilation warnings. | Benoit Steiner | 2018-08-06 |
| | | |||
* | | Replace all using declarations with typedefs in Tensor ops | Eugene Zhulenev | 2018-08-01 |
| | | |||
* | | Add block evaluation support to TensorOps | Eugene Zhulenev | 2018-07-31 |
|/ | |||
* | Rename Index to StorageIndex + use Eigen::Array and Eigen::Map when possible | Eugene Zhulenev | 2018-07-27 |
| | |||
* | Add tiled evaluation support to TensorExecutor | Eugene Zhulenev | 2018-07-25 |
| | |||
* | TensorBlockIO | Eugene Zhulenev | 2018-07-23 |
| | |||
* | Initial support of TensorBlock | Eugene Zhulenev | 2018-07-20 |