aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor
Commit message (Collapse)AuthorAge
* Merge with eigen/defaultGravatar Eugene Zhulenev2018-08-10
|\
* | Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵Gravatar Eugene Zhulenev2018-08-10
| | | | | | | | evaluators
| * Avoided language features that are only available in cxx11 mode.Gravatar Benoit Steiner2018-08-10
| |
| * Made the code compile with gcc 5.4.Gravatar Benoit Steiner2018-08-10
| |
* | Fix bug in a test + compilation errorsGravatar Eugene Zhulenev2018-08-09
| |
* | Merged with upstream eigenGravatar Eugene Zhulenev2018-08-08
|\|
| * Merged in codeplaysoftware/eigen-upstream-pure/Fixing_compiler_warning (pull ↵Gravatar Benoit Steiner2018-08-08
| |\ | | | | | | | | | | | | | | | request PR-462) Fixing compiler warning in TensorBlock.h as it was creating a lot of noise at compilation.
| | * Fixing compiler warning in TensorBlock.h as it was creating a lot of noise ↵Gravatar Mehdi Goli2018-08-08
| | | | | | | | | | | | at compilation.
| * | Fix init order.Gravatar Rasmus Munk Larsen2018-08-07
| |/
| * Silenced a couple of compilation warnings.Gravatar Benoit Steiner2018-08-06
| |
| * Fixed compilation errors.Gravatar Benoit Steiner2018-08-06
| |
| * Forward declare NoOpOutputKernel as struct rather than class to be ↵Gravatar Rasmus Munk Larsen2018-08-06
| | | | | | | | consistent with implementation.
* | Replace all using declarations with typedefs in Tensor opsGravatar Eugene Zhulenev2018-08-01
| |
| * Fix initialization order.Gravatar Rasmus Munk Larsen2018-08-03
| |
| * Fixing the compilation error.Gravatar Mehdi Goli2018-08-03
| |
| * Creating separate SYCL required PR for uncontroversial files.Gravatar Mehdi Goli2018-08-03
| |
* | Fix typo + get rid of redundant member variables for block sizesGravatar Eugene Zhulenev2018-08-01
| |
| * Merged in paultucker/eigen (pull request PR-431)Gravatar Benoit Steiner2018-08-01
| |\ | | | | | | | | | | | | | | | Optional ThreadPoolDevice allocator Approved-by: Benoit Steiner <benoit.steiner.goog@gmail.com>
* | | Merged latest changes from upstream/eigenGravatar Eugene Zhulenev2018-08-01
|\| |
| * | Merged in codeplaysoftware/eigen-upstream-pure/eigen_variadic_assert (pull ↵Gravatar Benoit Steiner2018-08-01
| |\ \ | | | | | | | | | | | | | | | | | | | | request PR-447) Adding variadic version of assert which can take a parameter pack as its input.
| * \ \ Merged in ↵Gravatar Benoit Steiner2018-08-01
| |\ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | codeplaysoftware/eigen-upstream-pure/separating_internal_memory_allocation (pull request PR-446) Distinguishing between internal memory allocation/deallocation from explicit user memory allocation/deallocation.
| | * | | Correcting the position of allocate_temp/deallocate_temp in TensorDeviceGpu.hGravatar Mehdi Goli2018-08-01
| | | | |
| * | | | Merged in codeplaysoftware/eigen-upstream-pure/using_PacketType_class (pull ↵Gravatar Benoit Steiner2018-08-01
| |\ \ \ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | request PR-449) Enabling per device specialisation of packetSize.
| | | | * | Using the suggested modification.Gravatar Mehdi Goli2018-08-01
| | | | | |
| | * | | | Enabling per device specialisation of packetsize.Gravatar Mehdi Goli2018-08-01
| | | | | |
| | | | * | variadic version of assert which can take a parameter pack as its input.Gravatar Mehdi Goli2018-08-01
| | | |/ / | | |/| |
| | | * | Distinguishing between internal memory allocation/deallocation from explicit ↵Gravatar Mehdi Goli2018-08-01
| | |/ / | | | | | | | | | | | | user memory allocation/deallocation.
| * / / Converting ad-hoc inline keyword to EIGEN_STRONG_INLINE MACRO.Gravatar Mehdi Goli2018-08-01
| |/ /
* | | Add block evaluation support to TensorOpsGravatar Eugene Zhulenev2018-07-31
| | |
| * | Merged in yuefengz/eigen (pull request PR-370)Gravatar Benoit Steiner2018-07-31
| |\ \ | | | | | | | | | | | | Use device's allocate function instead of internal::aligned_malloc.
| | | * Change getAllocator() to allocator() in ThreadPoolDevice.Gravatar Paul Tucker2018-07-31
| | | |
| * | | Merged in ezhulenev/eigen/tiling_3 (pull request PR-438)Gravatar Gael Guennebaud2018-07-31
| |\ \ \ | |/ / / |/| | | | | | | Tiled tensor executor
| * | | Speedup trivial tensor broadcasting on GPU by enforcing unaligned loads. See ↵Gravatar Gael Guennebaud2018-07-31
| | | | | | | | | | | | | | | | PR 437.
* | | | Rename Index to StorageIndex + use Eigen::Array and Eigen::Map when possibleGravatar Eugene Zhulenev2018-07-27
| | | |
* | | | Add tiled evaluation support to TensorExecutorGravatar Eugene Zhulenev2018-07-25
| | | |
| * | | Reduce the number of template specializations of classes related to tensor ↵Gravatar Rasmus Munk Larsen2018-07-27
| | | | | | | | | | | | | | | | contraction to reduce binary size.
* | | | TensorBlockIOGravatar Eugene Zhulenev2018-07-23
|/ / /
* | | Initial support of TensorBlockGravatar Eugene Zhulenev2018-07-20
| | |
| | * Add test coverage for ThreadPoolDevice optional allocator.Gravatar Paul Tucker2018-07-19
| | |
* | | PR430: Convert count to the reducer type in MeanReducerGravatar Eugene Zhulenev2018-07-19
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Without explicit conversion Tensorflow fails to compile, pset1 template deduction fails. cannot convert '((const Eigen::internal::MeanReducer<Eigen::half>*)this) ->Eigen::internal::MeanReducer<Eigen::half>::packetCount_' (type 'const DenseIndex {aka const long int}') to type 'const type& {aka const Eigen::half&}' return pdiv(vaccum, pset1<Packet>(packetCount_)); Honestly I’m not sure why it works in Eigen tests, because Eigen::half constructor is explicit, and why it stopped working in TF, I didn’t find any relevant changes since previous Eigen upgrade. static_cast<T>(packetCount_) - breaks cxx11_tensor_reductions test for Eigen::half, also quite surprising.
| | * Actually add optional Allocator* arg to ThreadPoolDevice().Gravatar Paul Tucker2018-07-16
| | |
| | * Add optional Allocator argument to ThreadPoolDevice constructor.Gravatar Paul Tucker2018-07-16
| | | | | | | | | | | | | | | | | | When supplied, this allocator will be used in place of internal::aligned_malloc. This permits e.g. use of a NUMA-node specific allocator where the thread-pool is also restricted a single NUMA-node.
* | | Reduce number of allocations in TensorContractionThreadPool.Gravatar Eugene Zhulenev2018-07-16
| | |
* | | bug #1569: fix Tensor<half>::mean() on AVX with respective unit test.Gravatar Gael Guennebaud2018-07-19
| | |
* | | Assert that no output kernel is defined for GPU contractionGravatar Eugene Zhulenev2018-07-18
| | |
* | | Specify default output kernel for TensorContractionOpGravatar Eugene Zhulenev2018-07-18
| | |
| * | Use device's allocate function instead of internal::aligned_malloc. This ↵Gravatar Yuefeng Zhou2018-02-20
| | | | | | | | | | | | would make it easier to track memory usage in device instances.
* | | Added a move constructor and move assignment operator to Tensor and wrote ↵Gravatar Viktor Csomor2018-02-07
| | | | | | | | | | | | some tests.
* | | Fix TensorContractionOp evaluators for GPU and SYCLGravatar Eugene Zhulenev2018-07-17
| | |
* | | applying EIGEN_DECLARE_TEST to *gpu* testsGravatar Deven Desai2018-07-17
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Also, a few minor fixes for GPU tests running in HIP mode. 1. Adding an include for hip/hip_runtime.h in the Macros.h file For HIP __host__ and __device__ are macros which are defined in hip headers. Their definitions need to be included before their use in the file. 2. Fixing the compile failure in TensorContractionGpu introduced by the commit to "Fuse computations into the Tensor contractions using output kernel" 3. Fixing a HIP/clang specific compile error by making the struct-member assignment explicit