aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Collapse)AuthorAge
* Remove internal::smart_copy and replace with std::copyGravatar Eugene Zhulenev2019-10-29
|
* Fix CXX11Meta compilation with MSVCGravatar Eugene Zhulenev2019-10-28
|
* Prevent potential ODR in TensorExecutorGravatar Eugene Zhulenev2019-10-28
|
* This PR fixes:Gravatar Mehdi Goli2019-10-23
| | | | | * The specialization of array class in the different namespace for GCC<=6.4 * The implicit call to `std::array` constructor using the initializer list for GCC <=6.1
* Merged in deven-amd/eigen-hip-fix-191018 (pull request PR-738)Gravatar Rasmus Larsen2019-10-22
|\ | | | | | | Fix for the HIP build+test errors.
* | Add block evaluation V2 to TensorAsyncExecutor.Gravatar Rasmus Munk Larsen2019-10-22
| | | | | | | | Add async evaluation to a number of ops.
| * Fix for the HIP build+test errors.Gravatar Deven Desai2019-10-22
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The errors were introduced by this commit : After the above mentioned commit, some of the tests started failing with the following error ``` Built target cxx11_tensor_reduction Building HIPCC object unsupported/test/CMakeFiles/cxx11_tensor_reduction_gpu_5.dir/cxx11_tensor_reduction_gpu_5_generated_cxx11_tensor_reduction_gpu.cu.o In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:117: /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlockV2.h:155:5: error: the field type is not amp-compatible DestinationBufferKind m_kind; ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlockV2.h:211:3: error: the field type is not amp-compatible DestinationBuffer m_destination; ^ ``` For some reason HIPCC does not like device code to contain enum types which do not have the base-type explicitly declared. The fix is trivial, explicitly state "int" as the basetype
* | Drop support for c++03 in Eigen tensor. Get rid of some code used to emulate ↵Gravatar Rasmus Munk Larsen2019-10-18
|/ | | | c++11 functionality with older compilers.
* Propagate block evaluation preference through rvalue tensor expressionsGravatar Eugene Zhulenev2019-10-17
|
* Cleanup Tensor block destination and materialized block storage allocationGravatar Eugene Zhulenev2019-10-16
|
* TensorBroadcasting support for random/uniform blocksGravatar Eugene Zhulenev2019-10-16
|
* Block evaluation for TensorGenerator/TensorReverse/TensorShufflingGravatar Eugene Zhulenev2019-10-14
|
* bug #1747: fix compilation with MSVCGravatar Gael Guennebaud2019-10-14
|
* Block evaluation for TensorGenerator + TensorReverse + fixed bug in tensor ↵Gravatar Eugene Zhulenev2019-10-10
| | | | reverse op
* Merged in ezhulenev/eigen-01 (pull request PR-726)Gravatar Rasmus Larsen2019-10-10
|\ | | | | | | Block evaluation for TensorChipping + fixed bugs in TensorPadding and TensorSlicing
* | bug #1752: make is_convertible equivalent to the std c++11 equivalent and ↵Gravatar Gael Guennebaud2019-10-10
| | | | | | | | fallback to std::is_convertible when c++11 is enabled.
* | bug #1752: disable some is_convertible tests for recent compilers.Gravatar Gael Guennebaud2019-10-10
| |
| * Block evaluation for TensorChipping + fixed bugs in TensorPadding and ↵Gravatar Eugene Zhulenev2019-10-09
|/ | | | TensorSlicing
* Implement c++03 compatible fix for changeset ↵Gravatar Gael Guennebaud2019-10-09
| | | | 7a43af1a335da2c0489b4119a33ee1cbff0c15d6
* Explicitly bypass resize and memmoves when there is already the exact right ↵Gravatar Gael Guennebaud2019-10-08
| | | | number of elements available.
* Disable an expected warning in unit testGravatar Gael Guennebaud2019-10-08
|
* fix one more possible conflicts with real/imagGravatar Gael Guennebaud2019-10-08
|
* PR 719: fix real/imag namespace conflictGravatar Gael Guennebaud2019-10-08
|
* Fix compilation of FFTW unit testGravatar Gael Guennebaud2019-10-08
|
* Add block evaluation to TensorEvalTo and fix few small bugsGravatar Eugene Zhulenev2019-10-07
|
* Fixing incorrect size in Tensor documentation.Gravatar Brian Zhao2019-10-04
|
* Use "pdiv" rather than operator/ to support packet types.Gravatar Rasmus Munk Larsen2019-10-04
|
* Merged in ezhulenev/eigen-01 (pull request PR-723)Gravatar Rasmus Larsen2019-10-04
|\ | | | | | | | | | | Add block evaluation to TensorReshaping/TensorCasting/TensorPadding/TensorSelect Approved-by: Rasmus Larsen <rmlarsen@google.com>
| * Fix compilation warnings and errors with clang in TensorBlockV2 code and testsGravatar Eugene Zhulenev2019-10-04
| |
* | Address comments on Chebyshev evaluation code:Gravatar Rasmus Munk Larsen2019-10-02
| | | | | | | | | | 1. Use pmadd when possible. 2. Add casts to avoid c++03 warnings.
| * Add block evaluation to TensorReshaping/TensorCasting/TensorPadding/TensorSelectGravatar Eugene Zhulenev2019-10-02
|/
* Add beta to TensorContractionKernel and make memset optionalGravatar Eugene Zhulenev2019-10-02
|
* Prevent infinite loop in the nvcc compiler while unrolling the recurrent ↵Gravatar Rasmus Munk Larsen2019-10-01
| | | | templates for Chebyshev polynomial evaluation.
* Fix perf issue in SimplicialLDLT::solve for complexes (again, m_diag is real)Gravatar Gael Guennebaud2019-10-01
|
* Fix speed issue with SimplicialLDLT for complexes: the diagonal is real!Gravatar Gael Guennebaud2019-09-30
|
* Move implementation of vectorized error function erf() to ↵Gravatar Rasmus Munk Larsen2019-09-27
| | | | SpecialFunctionsImpl.h.
* Fix cxx11_tensor_block_io testGravatar Eugene Zhulenev2019-09-25
|
* Fix erf in c++03Gravatar Eugene Zhulenev2019-09-25
|
* Fix compilation warnings and errors with clang in TensorBlockV2Gravatar Eugene Zhulenev2019-09-25
|
* Fix for the HIP build+test errors.Gravatar Deven Desai2019-09-25
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The errors were introduced by this commit : https://bitbucket.org/eigen/eigen/commits/d38e6fbc27abe0c354ffe90928f6741c378e76e1 After the above mentioned commit, some of the tests started failing with the following error ``` Building HIPCC object unsupported/test/CMakeFiles/cxx11_tensor_reduction_gpu_5.dir/cxx11_tensor_reduction_gpu_5_generated_cxx11_tensor_reduction_gpu.cu.o In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:29: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/../SpecialFunctions:70: /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsHalf.h:28:22: error: call to 'erf' is ambiguous return Eigen::half(Eigen::numext::erf(static_cast<float>(a))); ^~~~~~~~~~~~~~~~~~ /home/rocm-user/eigen/unsupported/test/../../Eigen/src/Core/MathFunctions.h:1600:7: note: candidate function [with T = float] float erf(const float &x) { return ::erff(x); } ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsImpl.h:1897:5: note: candidate function [with Scalar = float] erf(const Scalar& x) { ^ In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:29: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/../SpecialFunctions:75: /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/arch/GPU/GpuSpecialFunctions.h:87:23: error: call to 'erf' is ambiguous return make_double2(erf(a.x), erf(a.y)); ^~~ /home/rocm-user/eigen/unsupported/test/../../Eigen/src/Core/MathFunctions.h:1603:8: note: candidate function [with T = double] double erf(const double &x) { return ::erf(x); } ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsImpl.h:1897:5: note: candidate function [with Scalar = double] erf(const Scalar& x) { ^ In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:29: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/../SpecialFunctions:75: /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/arch/GPU/GpuSpecialFunctions.h:87:33: error: call to 'erf' is ambiguous return make_double2(erf(a.x), erf(a.y)); ^~~ /home/rocm-user/eigen/unsupported/test/../../Eigen/src/Core/MathFunctions.h:1603:8: note: candidate function [with T = double] double erf(const double &x) { return ::erf(x); } ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/../src/SpecialFunctions/SpecialFunctionsImpl.h:1897:5: note: candidate function [with Scalar = double] erf(const Scalar& x) { ^ 3 errors generated. ``` This PR fixes the compile error by removing the "old" implementation for "erf" (assuming that the "new" implementation is what we want going forward. from a GPU point-of-view both implementations are the same). This PR also fixes what seems like a cut-n-paste error in the aforementioned commit
* Fix a bug in a packed block type in TensorContractionThreadPoolGravatar Eugene Zhulenev2019-09-24
|
* Merged in rmlarsen/eigen (pull request PR-704)Gravatar Rasmus Larsen2019-09-24
|\ | | | | | | Add generic PacketMath implementation of the Error Function (erf).
| * Add TODO to cleanup FMA cost modelling.Gravatar Rasmus Munk Larsen2019-09-24
| |
* | Choose TensorBlock StridedLinearCopy type staticallyGravatar Eugene Zhulenev2019-09-24
| |
* | Add new TensorBlock api implementation + testsGravatar Eugene Zhulenev2019-09-24
| |
* | Tensor block evaluation V2 support for unary/binary/broadcstingGravatar Eugene Zhulenev2019-09-24
| |
* | bug #1746: Removed implementation of standard copy-constructor and standard ↵Gravatar Christoph Hertzberg2019-09-24
| | | | | | | | copy-assign-operator from PermutationMatrix and Transpositions to allow malloc-less std::move. Added unit-test to rvalue_types
* | Fix implicit conversion warnings and use pnegate to negate packetsGravatar Christoph Hertzberg2019-09-23
| |
* | Fix (or mask away) conversion warnings introduced in ↵Gravatar Christoph Hertzberg2019-09-23
| | | | | | | | | | | | 553caeb6a3bb545aef895f8fc9f219be44679017 .
* | Add support for asynchronous evaluation of tensor casting expressions.Gravatar Rasmus Munk Larsen2019-09-19
| |