aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Collapse)AuthorAge
* Clamp tanh approximation outside [-c, c] where c is the smallest value where ↵Gravatar Rasmus Munk Larsen2019-12-12
| | | | the approximation is exactly +/-1. Without FMA, c = 7.90531110763549805, with FMA c = 7.99881172180175781.
* Fix implementation of complex expm1. Add tests that fail with previous ↵Gravatar Srinivas Vasudevan2019-12-12
| | | | implementation, but pass with the current one.
* Initialize non-trivially constructible types when allocating a temp buffer.Gravatar Eugene Zhulenev2019-12-12
|
* Squeeze reads from two inner dimensions in TensorPaddingGravatar Eugene Zhulenev2019-12-11
|
* Add back accidentally deleted default constructor to ↵Gravatar Eugene Zhulenev2019-12-11
| | | | TensorExecutorTilingContext.
* Added io testGravatar Joel Holdsworth2019-12-11
|
* IO: Fixed printing of char and unsigned char matricesGravatar Joel Holdsworth2019-12-11
|
* Added Eigen::numext typedefs for uint8_t, int8_t, uint16_t and int16_tGravatar Joel Holdsworth2019-12-11
|
* Bug 1786: fix compilation with MSVCGravatar Gael Guennebaud2019-12-11
|
* Remove block memory allocation required by removed block evaluation APIGravatar Eugene Zhulenev2019-12-10
|
* Remove V2 suffix from TensorBlockGravatar Eugene Zhulenev2019-12-10
|
* Remove TensorBlock.h and old TensorBlock/BlockMapperGravatar Eugene Zhulenev2019-12-10
|
* Fix for HIP breakage detected on 191210Gravatar Deven Desai2019-12-10
| | | | | | | | The following commit introduces compile errors when running eigen with hipcc https://gitlab.com/libeigen/eigen/commit/2918f85ba976dbfbf72f7d4c1961a577f5850148 hipcc errors out because it requies the device attribute on the methods within the TensorBlockV2ResourceRequirements struct instroduced by the commit above. The fix is to add the device attribute to those methods
* Do not use std::vector in getResourceRequirementsGravatar Eugene Zhulenev2019-12-09
|
* Undo the block size change.Gravatar Artem Belevich2019-12-09
| | | | .z *is* used by the EigenContractionKernelInternal().
* Add async evaluation support to TensorSelectOpGravatar Eugene Zhulenev2019-12-09
|
* fix AlignedVector3 inconsisent interface with other Vector classes, default ↵Gravatar Janek Kozicki2019-12-06
| | | | constructor and operator- were missing.
* Add recursive work splitting to EvalShardedByInnerDimContextGravatar Eugene Zhulenev2019-12-05
|
* Improve performance of contraction kernelsGravatar Artem Belevich2019-12-05
| | | | | | | | | | * Force-inline implementations. They pass around pointers to shared memory blocks. Without inlining compiler must operate via generic pointers. Inlining allows compiler to detect that we're operating on shared memory which allows generation of substantially faster code. * Fixed a long-standing typo which resulted in launching 8x more kernels than we needed (.z dimension of the block is unused by the kernel).
* update hg to git hashesGravatar Gael Guennebaud2019-12-05
|
* Add missing initialization in cxx11_tensor_trace.cpp.Gravatar Rasmus Munk Larsen2019-12-04
|
* Replace calls to "hg" by calls to "git"Gravatar Gael Guennebaud2019-12-04
|
* Update old links to bitbucket to point to gitlab.comGravatar Gael Guennebaud2019-12-04
|
* Added tag before-git-migration for changeset ↵Gravatar Gael Guennebaud2019-12-04
| | | | a7c7d329d89e8484be58df6078a586c44523db37
* Merged in ezhulenev/eigen-01 (pull request PR-769)Gravatar Rasmus Larsen2019-12-04
|\ | | | | | | Capture TensorMap by value inside tensor expression AST
* \ Merged in ↵Gravatar Rasmus Larsen2019-12-04
|\ \ | | | | | | | | | | | | | | | | | | | | | anshuljl/eigen-2/Anshul-Jaiswal/update-configurevectorizationh-to-not-op-1573079916090 (pull request PR-754) Update ConfigureVectorization.h to not optimize fp16 routines when compiling with cuda. Approved-by: Deven Desai <deven.desai.amd@gmail.com>
| | * Capture TensorMap by value inside tensor expression ASTGravatar Eugene Zhulenev2019-12-03
| |/ |/|
* | Remove __host__ annotation for device-only function.Gravatar Rasmus Munk Larsen2019-12-03
| |
* | Use EIGEN_DEVICE_FUNC macro instead of __device__.Gravatar Rasmus Munk Larsen2019-12-03
| |
* | Fix QuaternionBase::cast for quaternion map and wrapper.Gravatar Gael Guennebaud2019-12-03
| |
* | bug #1776: fix vector-wise STL iterator's operator-> using a proxy as ↵Gravatar Gael Guennebaud2019-12-03
| | | | | | | | | | | | pointer type. This changeset fixes also the value_type definition.
* | Revert the specialization for scalar_logistic_op<float> introduced in:Gravatar Rasmus Munk Larsen2019-12-02
| | | | | | | | | | | | | | https://bitbucket.org/eigen/eigen/commits/77b447c24e3344e43ff64eb932d4bb35a2db01ce While providing a 50% speedup on Haswell+ processors, the large relative error outside [-18, 18] in this approximation causes problems, e.g., when computing gradients of activation functions like softplus in neural networks.
* | Merged in ezhulenev/eigen-02 (pull request PR-767)Gravatar Rasmus Larsen2019-12-02
|\ \ | | | | | | | | | Fix shadow warnings in AlignedBox and SparseBlock
* | | Fix for the HIP build+test errors.Gravatar Deven Desai2019-12-02
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Recent changes have introduced the following build error when compiling with HIPCC --------- unsupported/test/../../Eigen/src/Core/GenericPacketMath.h:254:58: error: 'ldexp': no overloaded function has restriction specifiers that are compatible with the ambient context 'pldexp' --------- The fix for the error is to pick the math function(s) from the global namespace (where they are declared as device functions in the HIP header files) when compiling with HIPCC.
* | | Merged in codeplaysoftware/eigen/SYCL-Backend (pull request PR-691)Gravatar Rasmus Larsen2019-11-28
|\ \ \ | | | | | | | | | | | | | | | | | | | | SYCL Backend Approved-by: Rasmus Larsen <rmlarsen@google.com>
| * | | [SYCL] Rebasing the SYCL support branch on top of the Einge upstream master ↵Gravatar Mehdi Goli2019-11-28
|/ / / | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | branch. * Unifying all loadLocalTile from lhs and rhs to an extract_block function. * Adding get_tensor operation which was missing in TensorContractionMapper. * Adding the -D method missing from cmake for Disable_Skinny Contraction operation. * Wrapping all the indices in TensorScanSycl into Scan parameter struct. * Fixing typo in Device SYCL * Unifying load to private register for tall/skinny no shared * Unifying load to vector tile for tensor-vector/vector-tensor operation * Removing all the LHS/RHS class for extracting data from global * Removing Outputfunction from TensorContractionSkinnyNoshared. * Combining the local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining the no-local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining General Tensor-Vector and VectorTensor contraction into one kernel. * Making double buffering optional for Tensor contraction when local memory is version is used. * Modifying benchmark to accept custom Reduction Sizes * Disabling AVX optimization for SYCL backend on the host to allow SSE optimization to the host * Adding Test for SYCL * Modifying SYCL CMake
| * / Fix shadow warnings in AlignedBox and SparseBlockGravatar Eugene Zhulenev2019-11-27
|/ /
* | Add missing EIGEN_DEVICE_FUNC attribute to template specializations for pexp ↵Gravatar Rasmus Munk Larsen2019-11-27
| | | | | | | | to fix GPU build.
* | Fix warning due to missing cast for exponent arguments for std::frexp and ↵Gravatar Rasmus Munk Larsen2019-11-26
| | | | | | | | std::lexp.
* | Merged in realjhol/eigen/fix-warnings (pull request PR-760)Gravatar Rasmus Larsen2019-11-26
|\ \ | | | | | | | | | Fix warnings
* | | Add async evaluation support to TensorReverseGravatar Eugene Zhulenev2019-11-26
| | |
* | | Add async evaluation support to TensorPadding/TensorImagePatch/TensorShufflingGravatar Eugene Zhulenev2019-11-26
| | |
| * | SparseRef: Fixed alignment warning on ARM GCCGravatar Joel Holdsworth2019-11-07
| | |
| | * Update ConfigureVectorization.h to not optimize fp16 routines when compiling ↵Gravatar Anshul Jaiswal2019-11-06
| | | | | | | | | | | | with cuda.
| * | test/packetmath: Silence alignment warningsGravatar Joel Holdsworth2019-11-05
|/ /
* | Add an explicit example for auto and re-evaluationGravatar Gael Guennebaud2019-11-20
| |
* | COMP: Simplify install commands for EigenGravatar Hans Johnson2019-11-17
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Confirm that install directory is identical before and after this simplifying patch. ```bash hg clone <<Eigen>> mkdir eigen-bld cd eigen-bld cmake ../Eigen -DCMAKE_INSTALL_PREFIX:PATH=/tmp/bef make install find /tmp/pre_eigen_modernize >/tmp/bef # Apply this patch cmake ../Eigen -DCMAKE_INSTALL_PREFIX:PATH=/tmp/aft make install find /tmp/post_eigen_modernize |sed 's/post_e/pre_e/g' >/tmp/aft diff /tmp/bef /tmp/aft ```
* | COMP: target_compile_definitions requires cmake 2.8.11Gravatar Hans Johnson2019-11-17
| | | | | | | | | | | | | | | | | | Features committed in 2016 have required cmake verison 2.8.11. `sergiu Tue Nov 22 12:25:06 2016 +0100: target_compile_definitions` Set the minimum cmake version to the minimum version that is capable of compiling or installing the code base.
* | Fix duplicate symbol linking error.Gravatar Gael Guennebaud2019-11-20
| |
* | STYLE: Convert CMake-language commands to lower caseGravatar Hans Johnson2019-10-31
| | | | | | | | | | Ancient CMake versions required upper-case commands. Later command names became case-insensitive. Now the preferred style is lower-case.