aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/arch
Commit message (Expand)AuthorAge
* Get rid of redundant `pabs` instruction in complex square root.Gravatar Rasmus Munk Larsen2021-06-29
* Commit 52a5f982 broke conjhelper functionality for HIP GPUs.Gravatar Rohit Santhanam2021-06-25
* Small cleanup: Get rid of the macros EIGEN_HAS_SINGLE_INSTRUCTION_CJMADD and ...Gravatar Rasmus Munk Larsen2021-06-24
* Get rid of code duplication for conj_helper. For packets where LhsType=RhsTyp...Gravatar Rasmus Munk Larsen2021-06-24
* Remove pset, replace with ploadu.Gravatar Antonio Sanchez2021-06-16
* EIGEN_STRONG_INLINE was NOT inlining in some critical needed areas (6.6X slow...Gravatar Chip-Kerchner2021-06-16
* Add missing ppc pcmp_lt_or_nan<Packet8bf>Gravatar Antonio Sanchez2021-06-15
* Use bit_cast to create -0.0 for floating point types to avoid compiler optimi...Gravatar Rasmus Munk Larsen2021-06-11
* Add missing NEON ptranspose implementations.Gravatar Antonio Sanchez2021-05-25
* Changing the storage of the SSE complex packets to that of the wrapper. This ...Gravatar guoqiangqi2021-05-10
* Revert addition of unused `paddsub<Packet2cf>`. This fixes #2242Gravatar Christoph Hertzberg2021-05-06
* Better CUDA complex division.Gravatar Antonio Sanchez2021-04-29
* Add missing pcmp_lt_or_nan for NEON Packet4bf.Gravatar Antonio Sanchez2021-04-27
* Tests added and AVX512 bug fixed for pcmp_lt_or_nanGravatar Jakub Lichman2021-04-25
* Fix taking address of rvalue compiler issue with TensorFlow (plus other warni...Gravatar Chip-Kerchner2021-04-21
* HasExp added for AVX512 Packet8dGravatar Jakub Lichman2021-04-20
* Fix ldexp for AVX512 (#2215)Gravatar Antonio Sanchez2021-04-20
* Avoid using uninitialized inputs and if available, use slightly more efficien...Gravatar Christoph Hertzberg2021-04-13
* Fix address of temporary object errors in clang11.Gravatar Chip Kerchner2021-04-02
* Eliminate `round_impl` double-promotion warnings for c++03.Gravatar Antonio Sanchez2021-03-25
* Fixed performance issues for complex VSX and P10 MMA in gebp_kernel (level 3).Gravatar Chip Kerchner2021-03-25
* Revert "Uses _mm512_abs_pd for Packet8d pabs"Gravatar Christoph Hertzberg2021-03-23
* Remove yet another comma at end of enumGravatar David Tellenbach2021-03-18
* Uses _mm512_abs_pd for Packet8d pabsGravatar Steve Bronder2021-03-18
* Augment NumTraits with min/max_exponent() again.Gravatar Antonio Sanchez2021-03-16
* Fix another warning on missing commasGravatar David Tellenbach2021-03-17
* Revert "Augment NumTraits with min/max_exponent()."Gravatar David Tellenbach2021-03-17
* Augment NumTraits with min/max_exponent().Gravatar Antonio Sanchez2021-03-17
* Silence warning on comma at end of enumerator listGravatar David Tellenbach2021-03-17
* Add fmod(half, half).Gravatar Antonio Sanchez2021-03-15
* Fix pround and add printGravatar Chip Kerchner2021-03-15
* Fix NVCC+ICC issues.Gravatar Antonio Sanchez2021-03-15
* Add increment/decrement operators to Eigen::half.Gravatar Antonio Sanchez2021-03-15
* Fix ambiguous call to CUDA __half constructor.Gravatar Antonio Sanchez2021-03-08
* Fix typo: DEVICE -> GPUGravatar Antonio Sanchez2021-03-08
* Fix non-trivial Half constructor for CUDA.Gravatar Antonio Sanchez2021-03-08
* Changing the Eigen::half implementation for HIPGravatar Deven Desai2021-03-05
* Fix rint SSE/NEON again, using optimization barrier.Gravatar Antonio Sanchez2021-03-05
* Revert "Fix rint for SSE/NEON."Gravatar Antonio Sánchez2021-03-03
* Fix rint for SSE/NEON.Gravatar Antonio Sanchez2021-03-03
* Add print for SSE/NEON, use NEON rounding intrinsics if available.Gravatar Antonio Sanchez2021-02-27
* Make half/bfloat16 constructor take inputs by value, fix powerpc test.Gravatar Antonio Sanchez2021-02-27
* Fix double-promotion warningsGravatar Christoph Hertzberg2021-02-27
* Fix NEON sqrt for 32-bit, add prsqrt.Gravatar Antonio Sanchez2021-02-26
* Fix floor/ceil for NEON fp16.Gravatar Antonio Sanchez2021-02-25
* Fix SSE/NEON pfloor/pceil for saturated values.Gravatar Antonio Sanchez2021-02-25
* Fix clang compile when no MMA flags are set. Simplify MMA compiler detection.Gravatar Chip-Kerchner2021-02-24
* Having forward template function declarations in a P10 file causes bad code i...Gravatar Chip-Kerchner2021-02-24
* Fixes to support old and new versions of the compilers for built-ins. Cast t...Gravatar Chip-Kerchner2021-02-24
* Disable fast psqrt for NEON.Gravatar Antonio Sanchez2021-02-23