aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Collapse)AuthorAge
* Add async evaluation support to TensorPadding/TensorImagePatch/TensorShufflingGravatar Eugene Zhulenev2019-11-26
|
* Add an explicit example for auto and re-evaluationGravatar Gael Guennebaud2019-11-20
|
* COMP: Simplify install commands for EigenGravatar Hans Johnson2019-11-17
| | | | | | | | | | | | | | | | | | | | | Confirm that install directory is identical before and after this simplifying patch. ```bash hg clone <<Eigen>> mkdir eigen-bld cd eigen-bld cmake ../Eigen -DCMAKE_INSTALL_PREFIX:PATH=/tmp/bef make install find /tmp/pre_eigen_modernize >/tmp/bef # Apply this patch cmake ../Eigen -DCMAKE_INSTALL_PREFIX:PATH=/tmp/aft make install find /tmp/post_eigen_modernize |sed 's/post_e/pre_e/g' >/tmp/aft diff /tmp/bef /tmp/aft ```
* COMP: target_compile_definitions requires cmake 2.8.11Gravatar Hans Johnson2019-11-17
| | | | | | | | | Features committed in 2016 have required cmake verison 2.8.11. `sergiu Tue Nov 22 12:25:06 2016 +0100: target_compile_definitions` Set the minimum cmake version to the minimum version that is capable of compiling or installing the code base.
* Fix duplicate symbol linking error.Gravatar Gael Guennebaud2019-11-20
|
* STYLE: Convert CMake-language commands to lower caseGravatar Hans Johnson2019-10-31
| | | | | Ancient CMake versions required upper-case commands. Later command names became case-insensitive. Now the preferred style is lower-case.
* STYLE: Remove CMake-language block-end command argumentsGravatar Hans Johnson2019-10-31
| | | | | | Ancient versions of CMake required else(), endif(), and similar block termination commands to have arguments matching the command starting the block. This is no longer the preferred style.
* 1. Fix a bug in psqrt and make it return 0 for +inf arguments.Gravatar Rasmus Munk Larsen2019-11-15
| | | | | | | | | | | | | | | | 2. Simplify handling of special cases by taking advantage of the fact that the builtin vrsqrt approximation handles negative, zero and +inf arguments correctly. This speeds up the SSE and AVX implementations by ~20%. 3. Make the Newton-Raphson formula used for rsqrt more numerically robust: Before: y = y * (1.5 - x/2 * y^2) After: y = y * (1.5 - y * (x/2) * y) Forming y^2 can overflow for very large or very small (denormalized) values of x, while x*y ~= 1. For AVX512, this makes it possible to compute accurate results for denormal inputs down to ~1e-42 in single precision. 4. Add a faster double precision implementation for Knights Landing using the vrsqrt28 instruction and a single Newton-Raphson iteration. Benchmark results: https://bitbucket.org/snippets/rmlarsen/5LBq9o
* bug #1744: fix compilation with MSVC 2017 and AVX512, plog1p/pexpm1 require ↵Gravatar Gael Guennebaud2019-11-15
| | | | plog/pexp, but the later was disabled on some compilers
* bug #1747: one more fix for MSVC regarding the Bessel implementation.Gravatar Gael Guennebaud2019-11-15
|
* bug #1281: fix AutoDiffScalar's make_coherent for nested expression of ↵Gravatar Gael Guennebaud2019-11-14
| | | | constant ADs.
* Fix case issue with Lapack unit testsGravatar Gael Guennebaud2019-11-14
|
* bug #1774: fix VectorwiseOp::begin()/end() return types regarding constness.Gravatar Gael Guennebaud2019-11-14
|
* PR 751: Fixed compilation issue when compiling using MSVC with /arch:AVX512 flagGravatar Sakshi Goynar2019-10-31
|
* Enable CompleteOrthogonalDecomposition::pseudoInverse with non-square ↵Gravatar Gael Guennebaud2019-11-13
| | | | fixed-size matrices.
* Move to my.cdash.orgGravatar Gael Guennebaud2019-11-13
|
* Remove legacy block evaluation supportGravatar Eugene Zhulenev2019-11-12
|
* Disable AVX on broken xcode versions. See PR 748.Gravatar Gael Guennebaud2019-11-12
| | | | Patch adapted from Hans Johnson's PR 748.
* Fix a race in async tensor evaluation: Don't run on_done() until after ↵Gravatar Rasmus Munk Larsen2019-11-11
| | | | device.deallocate() / evaluator.cleanup() complete, since the device might be destroyed after on_done() runs.
* Break loop dependence in TensorGenerator block accessGravatar Eugene Zhulenev2019-11-11
|
* Fix data race in css11_tensor_notification test.Gravatar Rasmus Munk Larsen2019-11-08
|
* Add EIGEN_HAS_INTRINSIC_INT128 macroGravatar Rasmus Munk Larsen2019-11-06
| | | | Add a new EIGEN_HAS_INTRINSIC_INT128 macro, and use this instead of __SIZEOF_INT128__. This fixes related issues with TensorIntDiv.h when building with Clang for Windows, where support for 128-bit integer arithmetic is advertised but broken in practice.
* Rollback or PR-746 and partial rollback of ↵Gravatar Rasmus Munk Larsen2019-11-05
| | | | | | | | https://bitbucket.org/eigen/eigen/commits/668ab3fc474e54c7919eda4fbaf11f3a99246494 . std::array is still not supported in CUDA device code on Windows.
* Merged in ezhulenev/eigen-01 (pull request PR-746)Gravatar Rasmus Larsen2019-11-04
|\ | | | | | | Remove internal::smart_copy and replace with std::copy
* | Cleanup includes in Tensor module after switch to C++11 and aboveGravatar Eugene Zhulenev2019-10-29
| |
| * Remove internal::smart_copy and replace with std::copyGravatar Eugene Zhulenev2019-10-29
|/
* Fix CXX11Meta compilation with MSVCGravatar Eugene Zhulenev2019-10-28
|
* Prevent potential ODR in TensorExecutorGravatar Eugene Zhulenev2019-10-28
|
* This PR fixes:Gravatar Mehdi Goli2019-10-23
| | | | | * The specialization of array class in the different namespace for GCC<=6.4 * The implicit call to `std::array` constructor using the initializer list for GCC <=6.1
* Merged in deven-amd/eigen-hip-fix-191018 (pull request PR-738)Gravatar Rasmus Larsen2019-10-22
|\ | | | | | | Fix for the HIP build+test errors.
* | Add block evaluation V2 to TensorAsyncExecutor.Gravatar Rasmus Munk Larsen2019-10-22
| | | | | | | | Add async evaluation to a number of ops.
| * Fix for the HIP build+test errors.Gravatar Deven Desai2019-10-22
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The errors were introduced by this commit : After the above mentioned commit, some of the tests started failing with the following error ``` Built target cxx11_tensor_reduction Building HIPCC object unsupported/test/CMakeFiles/cxx11_tensor_reduction_gpu_5.dir/cxx11_tensor_reduction_gpu_5_generated_cxx11_tensor_reduction_gpu.cu.o In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:117: /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlockV2.h:155:5: error: the field type is not amp-compatible DestinationBufferKind m_kind; ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlockV2.h:211:3: error: the field type is not amp-compatible DestinationBuffer m_destination; ^ ``` For some reason HIPCC does not like device code to contain enum types which do not have the base-type explicitly declared. The fix is trivial, explicitly state "int" as the basetype
* | Drop support for c++03 in Eigen tensor. Get rid of some code used to emulate ↵Gravatar Rasmus Munk Larsen2019-10-18
|/ | | | c++11 functionality with older compilers.
* Propagate block evaluation preference through rvalue tensor expressionsGravatar Eugene Zhulenev2019-10-17
|
* Cleanup Tensor block destination and materialized block storage allocationGravatar Eugene Zhulenev2019-10-16
|
* TensorBroadcasting support for random/uniform blocksGravatar Eugene Zhulenev2019-10-16
|
* Block evaluation for TensorGenerator/TensorReverse/TensorShufflingGravatar Eugene Zhulenev2019-10-14
|
* bug #1747: fix compilation with MSVCGravatar Gael Guennebaud2019-10-14
|
* Block evaluation for TensorGenerator + TensorReverse + fixed bug in tensor ↵Gravatar Eugene Zhulenev2019-10-10
| | | | reverse op
* Merged in ezhulenev/eigen-01 (pull request PR-726)Gravatar Rasmus Larsen2019-10-10
|\ | | | | | | Block evaluation for TensorChipping + fixed bugs in TensorPadding and TensorSlicing
* | bug #1752: make is_convertible equivalent to the std c++11 equivalent and ↵Gravatar Gael Guennebaud2019-10-10
| | | | | | | | fallback to std::is_convertible when c++11 is enabled.
* | bug #1752: disable some is_convertible tests for recent compilers.Gravatar Gael Guennebaud2019-10-10
| |
| * Block evaluation for TensorChipping + fixed bugs in TensorPadding and ↵Gravatar Eugene Zhulenev2019-10-09
|/ | | | TensorSlicing
* Implement c++03 compatible fix for changeset ↵Gravatar Gael Guennebaud2019-10-09
| | | | 7a43af1a335da2c0489b4119a33ee1cbff0c15d6
* Explicitly bypass resize and memmoves when there is already the exact right ↵Gravatar Gael Guennebaud2019-10-08
| | | | number of elements available.
* Disable an expected warning in unit testGravatar Gael Guennebaud2019-10-08
|
* fix one more possible conflicts with real/imagGravatar Gael Guennebaud2019-10-08
|
* PR 719: fix real/imag namespace conflictGravatar Gael Guennebaud2019-10-08
|
* Fix compilation of FFTW unit testGravatar Gael Guennebaud2019-10-08
|
* Add block evaluation to TensorEvalTo and fix few small bugsGravatar Eugene Zhulenev2019-10-07
|