aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/arch/AVX
Commit message (Expand)AuthorAge
* Move implementation of vectorized error function erf() to SpecialFunctionsImp...Gravatar Rasmus Munk Larsen2019-09-27
* Add generic PacketMath implementation of the Error Function (erf).Gravatar Rasmus Munk Larsen2019-09-19
* Add Bessel functions to SpecialFunctions.Gravatar Srinivas Vasudevan2019-09-14
* Add packetized versions of i0e and i1e special functions.Gravatar Srinivas Vasudevan2019-09-11
* Merging from eigen/eigen.Gravatar Srinivas Vasudevan2019-09-03
|\
* | Add ndtri function, the inverse of the normal distribution function.Gravatar Srinivas Vasudevan2019-08-12
| * Fix missing header inclusion and colliding definitions for half type casting,...Gravatar Rasmus Munk Larsen2019-08-30
| * Clean up float16 a.k.a. Eigen::half support in Eigen. Move the definition of ...Gravatar Rasmus Munk Larsen2019-08-27
| * Implement vectorized versions of log1p and expm1 in Eigen using Kahan's formu...Gravatar Rasmus Munk Larsen2019-08-12
|/
* Add masked_store_available to unpacket_traitsGravatar Eugene Zhulenev2019-05-02
* Add masked pstoreu to AVX and AVX512 PacketMathGravatar Eugene Zhulenev2019-05-02
* Use packet ops instead of AVX2 intrinsicsGravatar Eugene Zhulenev2019-04-23
* Adding lowlevel APIs for optimized RHS packet load in TensorFlowGravatar Anuj Rawat2019-04-20
* Apply SSE's pmin/pmax fix for GCC <= 5 to AVX's pmin/pmaxGravatar Gael Guennebaud2019-03-10
* Fix conflicts and mergeGravatar Gael Guennebaud2019-01-30
|\
* | Fix compilation error for logical packet ops with older compilers.Gravatar Rasmus Munk Larsen2019-01-16
* | Rename pones -> ptrue. Use _CMP_TRUE_UQ where appropriate.Gravatar Rasmus Munk Larsen2019-01-09
|\ \
* | | Collapsed revisionGravatar Rasmus Munk Larsen2019-01-09
| * | Simplify a bit.Gravatar Rasmus Munk Larsen2019-01-09
| * | Add packet up "pones". Write pnot(a) as pxor(pones(a), a).Gravatar Rasmus Munk Larsen2019-01-09
|/ /
* | Merged eigen/eigen into defaultGravatar Rasmus Larsen2019-01-09
|\ \
| * | bug #1652: implements a much more accurate version of vectorized sin/cos. Thi...Gravatar Gael Guennebaud2019-01-09
* | | Add support for pcmp_eq and pnot, including for complex types.Gravatar Rasmus Munk Larsen2019-01-07
|/ /
| * Introducing "vectorized" byte on unpacket_traits structsGravatar Gustavo Lima Chaves2018-12-19
|/
* Properly set the number of registers for AVX512Gravatar Gael Guennebaud2018-12-11
* bug #1641: fix testing of pandnot and fix pandnot for complex on SSE/AVX/AVX512Gravatar Gael Guennebaud2018-12-08
* Enable FMA with MSVC (through /arch:AVX2). To make this possible, I also has ...Gravatar Gael Guennebaud2018-12-07
* bug #1637: workaround register spilling in gebp with clang>=6.0+AVX+FMAGravatar Gael Guennebaud2018-12-07
* AVX512f includes FMA but GCC does not define __FMA__ with -mavx512f onlyGravatar Gael Guennebaud2018-12-06
* Implement AVX512 vectorization of std::complex<float/double>Gravatar Gael Guennebaud2018-12-06
* Add missing padd for Packet8i (it was implicitly generated by clang and gcc)Gravatar Gael Guennebaud2018-11-30
* Several improvements regarding packet-bitwise operations:Gravatar Gael Guennebaud2018-11-30
* CleanupGravatar Gael Guennebaud2018-11-30
* Extend the generic psin_float code to handle cosine and make SSE and AVX use ...Gravatar Gael Guennebaud2018-11-30
* Disable fma gcc's workaround for gcc >= 8 (based on GEMM benchmarks)Gravatar Gael Guennebaud2018-11-28
* bug #1631: fix compilation with ARM NEON and clang, and cleanup the weird psh...Gravatar Gael Guennebaud2018-11-27
* Update pshiftleft to pass the shift as a true compile-time integer.Gravatar Gael Guennebaud2018-11-27
* Unify SSE/AVX psin functions.Gravatar Gael Guennebaud2018-11-27
* cleanupGravatar Gael Guennebaud2018-11-26
* Unify SSE and AVX pexp for double.Gravatar Gael Guennebaud2018-11-26
* Unify SSE and AVX implementation of pexpGravatar Gael Guennebaud2018-11-26
* First step toward a unification of packet log implementation, currently only ...Gravatar Gael Guennebaud2018-11-26
* Make SSE/AVX pandnot(A,B) consistent with generic version, i.e., "A and not B"Gravatar Gael Guennebaud2018-11-26
* Fix compilation with MSVC by reverting to char* for _mm_prefetch except for P...Gravatar Gael Guennebaud2018-06-07
* Fix compilation and SSE support with PGI compilerGravatar Gael Guennebaud2018-05-29
* comment unreachable codeGravatar Gael Guennebaud2018-04-03
* Rename predux_downto4 to be more accurate on its semantic.Gravatar Gael Guennebaud2018-04-03
* bug #1436: fix compilation of Jacobi rotations with ARM NEON, some specializa...Gravatar Gael Guennebaud2017-06-15
* Reverse arguments for pmin in AVX.Gravatar Rasmus Munk Larsen2017-01-25
* Make NaN propagatation consistent between the pmax/pmin and std::max/std::min...Gravatar Rasmus Munk Larsen2017-01-24