aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/arch
Commit message (Expand)AuthorAge
* Reverse arguments for pmin in AVX.Gravatar Rasmus Munk Larsen2017-01-25
* Fix NaN propagation for AVX512.Gravatar Rasmus Munk Larsen2017-01-24
* Make NaN propagatation consistent between the pmax/pmin and std::max/std::min...Gravatar Rasmus Munk Larsen2017-01-24
* Add std:: namespace prefix to all (hopefully) instances if size_t/ptrdfiff_tGravatar Gael Guennebaud2017-01-23
* Avoid using horizontal adds since they're not very efficient.Gravatar Benoit Steiner2016-12-21
* Use native AVX512 types instead of Eigen Packets whenever possible.Gravatar Benoit Steiner2016-12-21
* Fixed compilation errors with gcc6 when compiling the AVX512 intrinsicsGravatar Benoit Steiner2016-12-19
* Use 32 registers on ARM64Gravatar Benoit Jacob2016-12-19
* bug #1360: fix sign issue with pmull on altivecGravatar Gael Guennebaud2016-12-18
* Fix unused warningGravatar Gael Guennebaud2016-12-18
* bug #1363: fix mingw's ABI issueGravatar Gael Guennebaud2016-12-15
* Fix expm1 CUDA implementation (do not shadow exp CUDA implementation).Gravatar Srinivas Vasudevan2016-12-05
* Fix small nit where I changed name of plog1p to pexpm1.Gravatar Srinivas Vasudevan2016-12-02
* Added support for expm1 in Eigen.Gravatar Srinivas Vasudevan2016-12-02
* Add a default constructor for the "fake" __half class when not using theGravatar Rasmus Munk Larsen2016-11-29
* Fix compilation with gcc and old ABI versionGravatar Gael Guennebaud2016-11-23
* Optimize predux<Packet8f> (AVX)Gravatar Gael Guennebaud2016-11-22
* Disable usage of SSE3 _mm_hadd_ps that is extremely slow.Gravatar Gael Guennebaud2016-11-22
* Optimize predux<Packet4d> (AVX)Gravatar Gael Guennebaud2016-11-22
* Disable usage of SSE3 haddpd that is extremely slow.Gravatar Gael Guennebaud2016-11-22
* implement float/std::complex<float> for ZVector as well, minor fixes to ZVectorGravatar Konstantinos Margaritis2016-11-17
* Optimized the computation of exp, sqrt, ceil anf floor for fp16 on Pascal GPUsGravatar Benoit Steiner2016-11-16
* Merged eigen/eigen into defaultGravatar Benoit Steiner2016-11-03
|\
| * Add pinsertfirst function and implement pinsertlast for complex on SSE/AVX.Gravatar Gael Guennebaud2016-11-02
| * Gate the code that refers to cuda fp16 primitives more thoroughlyGravatar Benoit Steiner2016-11-01
| * Add missing inline keywordsGravatar Gael Guennebaud2016-10-25
| * Fixed a typoGravatar Benoit Steiner2016-10-25
| * Add a pinsertlast function replacing the last entry of a packet by a scalar.Gravatar Gael Guennebaud2016-10-25
* | Deleted redundant implementation of preduxGravatar Benoit Steiner2016-10-12
* | Merged eigen/eigen into defaultGravatar Benoit Steiner2016-10-12
|\|
* | Take advantage of AVX512 instructions whenever possible to speedup the proces...Gravatar Benoit Steiner2016-10-12
| * Merged in rmlarsen/eigen (pull request PR-230)Gravatar Gael Guennebaud2016-10-12
| |\
| | * Fix copy-paste error: Must use _mm256_cmp_ps for AVX.Gravatar Rasmus Munk Larsen2016-10-12
| * | bug #1325: fix compilation on NEON with clangGravatar Gael Guennebaud2016-10-12
* | | Renamed predux_half into predux_downto4Gravatar Benoit Steiner2016-10-06
* | | Fixed incorrect commentGravatar Benoit Steiner2016-10-06
* | | Fixed compilation error with gcc >= 5.3Gravatar Benoit Steiner2016-10-06
* | | Silenced a compilation warningGravatar Benoit Steiner2016-10-06
| * | Added missing AVX intrinsics for fp16: in particular, implemented predux whic...Gravatar Benoit Steiner2016-10-06
* | | Deleted unecessary CMakeLists.txt fileGravatar Benoit Steiner2016-10-05
* | | Silenced a compilation warning.Gravatar Benoit Steiner2016-10-05
* | | Merged latest updates from trunkGravatar Benoit Steiner2016-10-05
|\| |
* | | Silenced a few compilation warningsGravatar Benoit Steiner2016-10-05
| * | Properly characterize the CUDA packet primitives for fp16 as device onlyGravatar Benoit Steiner2016-10-04
| | * Update comment for fast sqrt.Gravatar Rasmus Munk Larsen2016-10-04
| | * Update comment for fast sqrt.Gravatar Rasmus Munk Larsen2016-10-04
| | * Fix a bug in the implementation of Carmack's fast sqrt algorithm in Eigen (en...Gravatar Rasmus Munk Larsen2016-10-04
| |/
| * Added support for constand std::complex numbers on GPUGravatar Benoit Steiner2016-10-03
| * Added missing typedefsGravatar Benoit Steiner2016-09-20
| * Add CUDA-specific std::complex<T> specializations for scalar_sum_op, scalar_d...Gravatar RJ Ryan2016-09-20