aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/arch
Commit message (Expand)AuthorAge
* Generalized predux4 to support AVX512 packets, and renamed it predux_half.Gravatar Benoit Steiner2016-02-01
* Don't disable the AVX implementations of plset when compiling with AVX512 ena...Gravatar Benoit Steiner2016-01-14
* Added alignment requirement to the AVX512 packet traits.Gravatar Benoit Steiner2016-01-14
* Fixed the signature of the plset primitives for AVX512Gravatar Benoit Steiner2016-01-14
* Fixed the AVX512 signature of the ptranspose primitivesGravatar Benoit Steiner2016-01-14
* pscatter/pgather use Index instead of int to specify the strideGravatar Benoit Steiner2016-01-14
* Deleted unnecessary commas and semicolonsGravatar Benoit Steiner2016-01-14
* Implemented most of the packet primitives for AVX512Gravatar Benoit Steiner2015-12-21
* Free memory allocated using posix_memalign() with free() instead of std::free()Gravatar Benoit Steiner2015-12-21
* Preliminary support for AVX512Gravatar Benoit Steiner2015-12-10
* bug #1103: fix neon vectorization of pmul(Packet1cd,Packet1cd)Gravatar Gael Guennebaud2015-12-10
* Fix "," in non SSE4 modeGravatar Gael Guennebaud2015-11-05
* Fix AVX round/ceil/floor, and fix respective unit testGravatar Gael Guennebaud2015-11-04
* Merged in aavenel/eigen (pull request PR-142)Gravatar Gael Guennebaud2015-11-04
|\
* | Made the CUDA implementation of ploadt_ro compatible with cuda implementation...Gravatar Benoit Steiner2015-11-03
| * Add round, ceil and floor for SSE4.1/AVX (Bug #70)Gravatar Alexandre Avenel2015-11-01
|/
* bug #1085: workaround gcc default ABI issueGravatar Gael Guennebaud2015-10-10
* _mm_hadd_epi32 is for SSSE3 only (and not SSE3)Gravatar Gael Guennebaud2015-10-07
* Handle various TODOs in SSE vectorization (remove splitted storeu, enable SSE...Gravatar Gael Guennebaud2015-10-06
* bug #1069: fix AVX support on MSVC (use of non portable C-style cast)Gravatar Gael Guennebaud2015-09-28
* Added support for predux_mul for CUDA devicesGravatar Benoit Steiner2015-09-08
* Implement plog and pexp for AltiVec.Gravatar Doug Kwan2015-07-30
* Fix prototype of plset and generalize linspace functor.Gravatar Gael Guennebaud2015-08-07
* Include SSE packetmath when AVX is enabled, and enable AVX's sine function on...Gravatar Gael Guennebaud2015-08-07
* Let unpacket_traits<> exposes the required alignment and make use of it every...Gravatar Gael Guennebaud2015-08-07
* Fix shadow warnings triggered by clangGravatar Gael Guennebaud2015-06-09
* Abandon blocking size lookup table approach. Not performing as well in real w...Gravatar Benoit Jacob2015-05-19
* also uninitialized here, see previous csetGravatar Benoit Jacob2015-05-15
* Fix uninitialized var warning. The compiler was clearing the register anyway,...Gravatar Benoit Jacob2015-05-15
* Merged in doug_kwan/eigen (pull request PR-103)Gravatar Konstantinos Margaritis2015-05-05
|\
* | Added a double-precision implementation of the exp() function for AVX.Gravatar Benoit Steiner2015-05-04
* | Pulled latest update from the eigen main codebaseGravatar Benoit Steiner2015-03-24
|\ \
| * | Fixed the CUDA packet primitivesGravatar Benoit Steiner2015-03-24
| * | use unsigned short instead of uint16_t which doesn't exist in c++98Gravatar Benoit Jacob2015-03-17
| * | Update Nexus 5 lookup table from combining now 2 runs of the benchmark, using...Gravatar Benoit Jacob2015-03-16
| * | Provide a empirical lookup table for blocking sizes measured on a Nexus 5. On...Gravatar Benoit Jacob2015-03-15
| | * Fix bug in pdiv<Packet1cd> which swaps 32-bit halves of a pair ofGravatar Doug Kwan2015-03-11
| |/
* | Fixed the optimized AVX implementation of the fast rsqrt functionGravatar Benoit Steiner2015-03-02
* | Added an optimized version of rsqrt for SSE and AVX that is used when EIGEN_F...Gravatar Benoit Steiner2015-03-02
* | Pulled latest updates from trunkGravatar Benoit Steiner2015-02-27
|\ \
* | | Switch to truncated casting when converting floating point types to integer. ...Gravatar Benoit Steiner2015-02-27
* | | Added support for vectorized type casting of tensorsGravatar Benoit Steiner2015-02-27
* | | Added support for fast reciprocal square root computation.Gravatar Benoit Steiner2015-02-26
| | * must also disable complex<double> when disabling double vectorizationGravatar Benoit Jacob2015-03-03
| | * Work around an ICE in Clang 3.5 in the iOS toolchain with double NEON intrins...Gravatar Benoit Jacob2015-03-03
| | * HalfPacket also needed to be disabled for double, on ARMv8.Gravatar Benoit Jacob2015-03-02
| |/
| * remove trailing commaGravatar Benoit Jacob2015-02-27
| * Disable Packet2f/2i halfpacket support in NEON.Gravatar Benoit Jacob2015-02-27
|/
* Marked the CUDA packet primitives as EIGEN_DEVICE_FUNC since they'll end up b...Gravatar Benoit Steiner2015-02-19
* bug #955 - Implement a rotating kernel alternative in the 3px4 gebp pathGravatar Benoit Jacob2015-02-18