aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/arch/AVX512
Commit message (Expand)AuthorAge
...
* Implement vectorized versions of log1p and expm1 in Eigen using Kahan's formu...Gravatar Rasmus Munk Larsen2019-08-12
* Various fixes for packet ops.Gravatar Rasmus Munk Larsen2019-06-20
* Add masked_store_available to unpacket_traitsGravatar Eugene Zhulenev2019-05-02
* Add masked pstoreu to AVX and AVX512 PacketMathGravatar Eugene Zhulenev2019-05-02
* Adding lowlevel APIs for optimized RHS packet load in TensorFlowGravatar Anuj Rawat2019-04-20
* fix alignment in ploadquadGravatar Gael Guennebaud2019-02-22
* AVX512: implement faster ploadquad<Packet16f> thus speeding up GEMMGravatar Gael Guennebaud2019-02-21
* bug #1678: workaround MSVC compilation issues with AVX512Gravatar Gael Guennebaud2019-02-15
* Fix conflicts and mergeGravatar Gael Guennebaud2019-01-30
|\
* | Renaming some more `I` identifiersGravatar Christoph Hertzberg2019-01-26
* | Fix compilation error for logical packet ops with older compilers.Gravatar Rasmus Munk Larsen2019-01-16
* | AVX512: fix pgather/pscatter for Packet4cd and unaligned pointersGravatar Gael Guennebaud2019-01-14
* | AVX512 (r)sqrt(double) was mistakenly disabled with clang and othersGravatar Gael Guennebaud2019-01-14
* | Resolve.Gravatar Rasmus Munk Larsen2019-01-11
|\ \
| * \ Merged eigen/eigen into defaultGravatar Rasmus Larsen2019-01-11
| |\ \
| | * | Remove reinterpret_cast from AVX512 complex implementationGravatar Mark D Ryan2019-01-11
* | | | Rename pones -> ptrue. Use _CMP_TRUE_UQ where appropriate.Gravatar Rasmus Munk Larsen2019-01-09
|\ \ \ \
| | * | | Collapsed revisionGravatar Rasmus Munk Larsen2019-01-09
* | | | | Collapsed revisionGravatar Rasmus Munk Larsen2019-01-09
| |/ / / |/| | |
| * | | Simplify a bit.Gravatar Rasmus Munk Larsen2019-01-09
| * | | Add packet up "pones". Write pnot(a) as pxor(pones(a), a).Gravatar Rasmus Munk Larsen2019-01-09
|/ / /
* | | Merged eigen/eigen into defaultGravatar Rasmus Larsen2019-01-09
|\| |
| * | fix plog(+inf) with AVX512Gravatar Gael Guennebaud2019-01-09
| * | Add dedicated implementations of predux_any for AVX512, NEON, and Altivec/VSEGravatar Gael Guennebaud2019-01-09
| * | Add missing pcmp_lt and others for AVX512Gravatar Gael Guennebaud2019-01-09
* | | Add support for pcmp_eq and pnot, including for complex types.Gravatar Rasmus Munk Larsen2019-01-07
|/ /
* | PR560: Fix the AVX512f only buildsGravatar Mark D Ryan2019-01-03
* | One more stupid AVX 512 fix (I don't have direct access to AVX512 machines)Gravatar Gael Guennebaud2018-12-24
* | Add EIGEN_STRONG_INLINE where requiredGravatar Gael Guennebaud2018-12-24
* | Add missing pcmp_lt_or_nan for AVX512Gravatar Gael Guennebaud2018-12-23
| * Introducing "vectorized" byte on unpacket_traits structsGravatar Gustavo Lima Chaves2018-12-19
|/
* Properly set the number of registers for AVX512Gravatar Gael Guennebaud2018-12-11
* bug #1641: fix testing of pandnot and fix pandnot for complex on SSE/AVX/AVX512Gravatar Gael Guennebaud2018-12-08
* AVX512f includes FMA but GCC does not define __FMA__ with -mavx512f onlyGravatar Gael Guennebaud2018-12-06
* Fix compilation with avx512f only, i.e., no AVX512DQGravatar Gael Guennebaud2018-12-06
* Implement AVX512 vectorization of std::complex<float/double>Gravatar Gael Guennebaud2018-12-06
* Several improvements regarding packet-bitwise operations:Gravatar Gael Guennebaud2018-11-30
* Add psin/pcos on AVX512 -> almost for free, at last!Gravatar Gael Guennebaud2018-11-30
* Fix pandnot order in AVX512Gravatar Gael Guennebaud2018-11-30
* Fix float-to-double warningGravatar Gael Guennebaud2018-10-16
* Fix warning with AVX512fGravatar Gael Guennebaud2018-10-11
* Fix avx512 plog(NaN) to return NaN instead of +infGravatar Gael Guennebaud2018-10-11
* Enable avx512 plog with clangGravatar Gael Guennebaud2018-10-11
* fix alignment issue in ploaddup for AVX512Gravatar Gael Guennebaud2018-09-28
* Fix warnings in AVX512Gravatar Gael Guennebaud2018-09-20
* Use Intel cast intrinsics, since MSVC does not allow direct casting.Gravatar Christoph Hertzberg2018-08-24
* Re-enable FMA for fast sqrt functionsGravatar Mark D Ryan2018-07-30
* Fix AVX512 implementations of psqrtGravatar Mark D Ryan2018-06-25
* Fix compilation with MSVC by reverting to char* for _mm_prefetch except for P...Gravatar Gael Guennebaud2018-06-07
* fix AVX512 plogGravatar Jayaram Bobba2018-04-20