aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported
Commit message (Collapse)AuthorAge
...
* Added special_packetmath test and tweaked bounds on tests.Gravatar Srinivas Vasudevan2020-01-11
| | | | | Refactor shared packetmath code to header file. (Squashed from PR !38)
* Properly initialize b vector in SplineFittingGravatar Matthew Powelson2020-01-09
| | | InterpolateWithDerivative does not initialize the be vector correctly. This issue is discussed In stackoverflow question 48382939.
* Bug #1785: Introduce numext::rint.Gravatar Ilya Tokar2020-01-07
| | | | | | This provides a new op that matches std::rint and previous behavior of pround. Also adds corresponding unsupported/../Tensor op. Performance is the same as e. g. floor (tested SSE/AVX).
* [SYCL Backend]Gravatar mehdi-goli2020-01-07
| | | | | | | * Adding Missing operations for vector comparison in SYCL. This caused compiler error for vector comparison when compiling SYCL * Fixing the compiler error for placement new in TensorForcedEval.h This caused compiler error when compiling SYCL backend * Reducing the SYCL warning by removing the abort function inside the kernel * Adding Strong inline to functions inside SYCL interop.
* Fix for HIP breakage - 191220Gravatar Deven Desai2019-12-20
| | | | | | | | | | | | | | | | | | | | | | | | | | | The breakage was introduced by the following commit : https://gitlab.com/libeigen/eigen/commit/ae07801dd8d295657f28b006e1e4999edf835052 After the commit, HIPCC errors out on some tests with the following error ``` Building HIPCC object unsupported/test/CMakeFiles/cxx11_tensor_device_1.dir/cxx11_tensor_device_1_generated_cxx11_tensor_device.cu.o In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_device.cu:17: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:100: /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:129:12: error: no matching constructor for initialization of 'Eigen::internal::TensorBlockResourceRequirements' return {merge(lhs.shape_type, rhs.shape_type), // shape_type ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit copy constructor) not viable: requires 1 argument, but 3 were provided struct TensorBlockResourceRequirements { ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit move constructor) not viable: requires 1 argument, but 3 were provided /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit copy constructor) not viable: requires 5 arguments, but 3 were provided /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlock.h:75:8: note: candidate constructor (the implicit default constructor) not viable: requires 0 arguments, but 3 were provided ... ... ``` The fix is to explicitly decalre the (implicitly called) constructor as a device func
* Bug #1796: Make matrix squareroot usable for Map and Ref typesGravatar Christoph Hertzberg2019-12-20
|
* Reduce code duplication and avoid confusing DoxygenGravatar Christoph Hertzberg2019-12-19
|
* Hide recursive meta templates from DoxygenGravatar Christoph Hertzberg2019-12-19
|
* Use double-braces initialization (as everywhere else in the test-suite).Gravatar Christoph Hertzberg2019-12-19
|
* Fix trivial shadow warningGravatar Christoph Hertzberg2019-12-19
|
* Fix TensorPadding bug in squeezed reads from inner dimension Gravatar Eugene Zhulenev2019-12-19
|
* Return const data pointer from TensorRef evaluator.data()Gravatar Eugene Zhulenev2019-12-18
|
* Tensor block evaluation cost modelGravatar Eugene Zhulenev2019-12-18
|
* fix compilation due to new HIP scalar accessorGravatar Jeff Daily2019-12-17
|
* Reduce block evaluation overhead for small tensor expressionsGravatar Eugene Zhulenev2019-12-17
|
* Initialize non-trivially constructible types when allocating a temp buffer.Gravatar Eugene Zhulenev2019-12-12
|
* Squeeze reads from two inner dimensions in TensorPaddingGravatar Eugene Zhulenev2019-12-11
|
* Add back accidentally deleted default constructor to ↵Gravatar Eugene Zhulenev2019-12-11
| | | | TensorExecutorTilingContext.
* Remove block memory allocation required by removed block evaluation APIGravatar Eugene Zhulenev2019-12-10
|
* Remove V2 suffix from TensorBlockGravatar Eugene Zhulenev2019-12-10
|
* Remove TensorBlock.h and old TensorBlock/BlockMapperGravatar Eugene Zhulenev2019-12-10
|
* Fix for HIP breakage detected on 191210Gravatar Deven Desai2019-12-10
| | | | | | | | The following commit introduces compile errors when running eigen with hipcc https://gitlab.com/libeigen/eigen/commit/2918f85ba976dbfbf72f7d4c1961a577f5850148 hipcc errors out because it requies the device attribute on the methods within the TensorBlockV2ResourceRequirements struct instroduced by the commit above. The fix is to add the device attribute to those methods
* Do not use std::vector in getResourceRequirementsGravatar Eugene Zhulenev2019-12-09
|
* Undo the block size change.Gravatar Artem Belevich2019-12-09
| | | | .z *is* used by the EigenContractionKernelInternal().
* Add async evaluation support to TensorSelectOpGravatar Eugene Zhulenev2019-12-09
|
* fix AlignedVector3 inconsisent interface with other Vector classes, default ↵Gravatar Janek Kozicki2019-12-06
| | | | constructor and operator- were missing.
* Add recursive work splitting to EvalShardedByInnerDimContextGravatar Eugene Zhulenev2019-12-05
|
* Improve performance of contraction kernelsGravatar Artem Belevich2019-12-05
| | | | | | | | | | * Force-inline implementations. They pass around pointers to shared memory blocks. Without inlining compiler must operate via generic pointers. Inlining allows compiler to detect that we're operating on shared memory which allows generation of substantially faster code. * Fixed a long-standing typo which resulted in launching 8x more kernels than we needed (.z dimension of the block is unused by the kernel).
* Add missing initialization in cxx11_tensor_trace.cpp.Gravatar Rasmus Munk Larsen2019-12-04
|
* Capture TensorMap by value inside tensor expression ASTGravatar Eugene Zhulenev2019-12-03
|
* Remove __host__ annotation for device-only function.Gravatar Rasmus Munk Larsen2019-12-03
|
* Use EIGEN_DEVICE_FUNC macro instead of __device__.Gravatar Rasmus Munk Larsen2019-12-03
|
* [SYCL] Rebasing the SYCL support branch on top of the Einge upstream master ↵Gravatar Mehdi Goli2019-11-28
| | | | | | | | | | | | | | | | | | | | | | branch. * Unifying all loadLocalTile from lhs and rhs to an extract_block function. * Adding get_tensor operation which was missing in TensorContractionMapper. * Adding the -D method missing from cmake for Disable_Skinny Contraction operation. * Wrapping all the indices in TensorScanSycl into Scan parameter struct. * Fixing typo in Device SYCL * Unifying load to private register for tall/skinny no shared * Unifying load to vector tile for tensor-vector/vector-tensor operation * Removing all the LHS/RHS class for extracting data from global * Removing Outputfunction from TensorContractionSkinnyNoshared. * Combining the local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining the no-local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining General Tensor-Vector and VectorTensor contraction into one kernel. * Making double buffering optional for Tensor contraction when local memory is version is used. * Modifying benchmark to accept custom Reduction Sizes * Disabling AVX optimization for SYCL backend on the host to allow SSE optimization to the host * Adding Test for SYCL * Modifying SYCL CMake
* Add async evaluation support to TensorReverseGravatar Eugene Zhulenev2019-11-26
|
* Add async evaluation support to TensorPadding/TensorImagePatch/TensorShufflingGravatar Eugene Zhulenev2019-11-26
|
* STYLE: Convert CMake-language commands to lower caseGravatar Hans Johnson2019-10-31
| | | | | Ancient CMake versions required upper-case commands. Later command names became case-insensitive. Now the preferred style is lower-case.
* STYLE: Remove CMake-language block-end command argumentsGravatar Hans Johnson2019-10-31
| | | | | | Ancient versions of CMake required else(), endif(), and similar block termination commands to have arguments matching the command starting the block. This is no longer the preferred style.
* bug #1747: one more fix for MSVC regarding the Bessel implementation.Gravatar Gael Guennebaud2019-11-15
|
* bug #1281: fix AutoDiffScalar's make_coherent for nested expression of ↵Gravatar Gael Guennebaud2019-11-14
| | | | constant ADs.
* Remove legacy block evaluation supportGravatar Eugene Zhulenev2019-11-12
|
* Fix a race in async tensor evaluation: Don't run on_done() until after ↵Gravatar Rasmus Munk Larsen2019-11-11
| | | | device.deallocate() / evaluator.cleanup() complete, since the device might be destroyed after on_done() runs.
* Break loop dependence in TensorGenerator block accessGravatar Eugene Zhulenev2019-11-11
|
* Fix data race in css11_tensor_notification test.Gravatar Rasmus Munk Larsen2019-11-08
|
* Add EIGEN_HAS_INTRINSIC_INT128 macroGravatar Rasmus Munk Larsen2019-11-06
| | | | Add a new EIGEN_HAS_INTRINSIC_INT128 macro, and use this instead of __SIZEOF_INT128__. This fixes related issues with TensorIntDiv.h when building with Clang for Windows, where support for 128-bit integer arithmetic is advertised but broken in practice.
* Rollback or PR-746 and partial rollback of ↵Gravatar Rasmus Munk Larsen2019-11-05
| | | | | | | | https://bitbucket.org/eigen/eigen/commits/668ab3fc474e54c7919eda4fbaf11f3a99246494 . std::array is still not supported in CUDA device code on Windows.
* Merged in ezhulenev/eigen-01 (pull request PR-746)Gravatar Rasmus Larsen2019-11-04
|\ | | | | | | Remove internal::smart_copy and replace with std::copy
* | Cleanup includes in Tensor module after switch to C++11 and aboveGravatar Eugene Zhulenev2019-10-29
| |
| * Remove internal::smart_copy and replace with std::copyGravatar Eugene Zhulenev2019-10-29
|/
* Fix CXX11Meta compilation with MSVCGravatar Eugene Zhulenev2019-10-28
|
* Prevent potential ODR in TensorExecutorGravatar Eugene Zhulenev2019-10-28
|