aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/arch/SSE/PacketMath.h
Commit message (Expand)AuthorAge
...
* Various fixes for packet ops.Gravatar Rasmus Munk Larsen2019-06-20
* Add masked_store_available to unpacket_traitsGravatar Eugene Zhulenev2019-05-02
* Fix regression in changeset ae33e866c750c6c24ada5c6f7f3ec15815d0e683Gravatar Gael Guennebaud2019-05-02
* Fix compilation with PGI version 19Gravatar Andy May2019-04-25
* Adding lowlevel APIs for optimized RHS packet load in TensorFlowGravatar Anuj Rawat2019-04-20
* Fix conflicts and mergeGravatar Gael Guennebaud2019-01-30
|\
* \ Rename pones -> ptrue. Use _CMP_TRUE_UQ where appropriate.Gravatar Rasmus Munk Larsen2019-01-09
|\ \
* | | Collapsed revisionGravatar Rasmus Munk Larsen2019-01-09
| * | Add packet up "pones". Write pnot(a) as pxor(pones(a), a).Gravatar Rasmus Munk Larsen2019-01-09
|/ /
* | Merged eigen/eigen into defaultGravatar Rasmus Larsen2019-01-09
|\ \
| * | bug #1652: implements a much more accurate version of vectorized sin/cos. Thi...Gravatar Gael Guennebaud2019-01-09
* | | Add support for pcmp_eq and pnot, including for complex types.Gravatar Rasmus Munk Larsen2019-01-07
|/ /
| * Introducing "vectorized" byte on unpacket_traits structsGravatar Gustavo Lima Chaves2018-12-19
|/
* Properly set the number of registers for AVX512Gravatar Gael Guennebaud2018-12-11
* Enable FMA with MSVC (through /arch:AVX2). To make this possible, I also has ...Gravatar Gael Guennebaud2018-12-07
* AVX512f includes FMA but GCC does not define __FMA__ with -mavx512f onlyGravatar Gael Guennebaud2018-12-06
* Implement AVX512 vectorization of std::complex<float/double>Gravatar Gael Guennebaud2018-12-06
* same for pmaxGravatar Gael Guennebaud2018-11-28
* pmin/pmax o SSE: make sure to use AVX instruction with AVX enabled, and disab...Gravatar Gael Guennebaud2018-11-28
* Use explicit packet type in SSE/PacketMath pldexpGravatar Eugene Zhulenev2018-11-27
* bug #1631: fix compilation with ARM NEON and clang, and cleanup the weird psh...Gravatar Gael Guennebaud2018-11-27
* Update pshiftleft to pass the shift as a true compile-time integer.Gravatar Gael Guennebaud2018-11-27
* Unify SSE/AVX psin functions.Gravatar Gael Guennebaud2018-11-27
* Unify SSE and AVX pexp for double.Gravatar Gael Guennebaud2018-11-26
* Unify SSE and AVX implementation of pexpGravatar Gael Guennebaud2018-11-26
* First step toward a unification of packet log implementation, currently only ...Gravatar Gael Guennebaud2018-11-26
* Make SSE/AVX pandnot(A,B) consistent with generic version, i.e., "A and not B"Gravatar Gael Guennebaud2018-11-26
* bug #1605: workaround ABI issue with vector types (aka __m128) versus scalar ...Gravatar Gael Guennebaud2018-10-01
* remove double ;;Gravatar Gael Guennebaud2018-07-12
* Fix compilation with MSVC by reverting to char* for _mm_prefetch except for P...Gravatar Gael Guennebaud2018-06-07
* Fix compilation and SSE support with PGI compilerGravatar Gael Guennebaud2018-05-29
* Make NaN propagatation consistent between the pmax/pmin and std::max/std::min...Gravatar Rasmus Munk Larsen2017-01-24
* bug #1363: fix mingw's ABI issueGravatar Gael Guennebaud2016-12-15
* Disable usage of SSE3 _mm_hadd_ps that is extremely slow.Gravatar Gael Guennebaud2016-11-22
* Disable usage of SSE3 haddpd that is extremely slow.Gravatar Gael Guennebaud2016-11-22
* Add pinsertfirst function and implement pinsertlast for complex on SSE/AVX.Gravatar Gael Guennebaud2016-11-02
* Add missing inline keywordsGravatar Gael Guennebaud2016-10-25
* Fixed a typoGravatar Benoit Steiner2016-10-25
* Add a pinsertlast function replacing the last entry of a packet by a scalar.Gravatar Gael Guennebaud2016-10-25
* bug #1195: move NumTraits::Div<>::Cost to internal::scalar_div_cost (with som...Gravatar Gael Guennebaud2016-09-08
* Implement pmadd for float and double to make it consistent with the vectorize...Gravatar Gael Guennebaud2016-08-23
* Remove now-unused protate PacketMath funcGravatar Benoit Jacob2016-05-24
* Optimized implementation of the tanh function for SSEGravatar Benoit Steiner2016-02-10
* Remove custom unaligned loads for SSE. They were only useful for core2 CPU.Gravatar Gael Guennebaud2016-02-08
* Fix "," in non SSE4 modeGravatar Gael Guennebaud2015-11-05
* Add round, ceil and floor for SSE4.1/AVX (Bug #70)Gravatar Alexandre Avenel2015-11-01
* bug #1085: workaround gcc default ABI issueGravatar Gael Guennebaud2015-10-10
* _mm_hadd_epi32 is for SSSE3 only (and not SSE3)Gravatar Gael Guennebaud2015-10-07
* Handle various TODOs in SSE vectorization (remove splitted storeu, enable SSE...Gravatar Gael Guennebaud2015-10-06
* Fix prototype of plset and generalize linspace functor.Gravatar Gael Guennebaud2015-08-07