| Commit message (Expand) | Author | Age |
* | Generalized predux4 to support AVX512 packets, and renamed it predux_half. | Benoit Steiner | 2016-02-01 |
* | Don't disable the AVX implementations of plset when compiling with AVX512 ena... | Benoit Steiner | 2016-01-14 |
* | Added alignment requirement to the AVX512 packet traits. | Benoit Steiner | 2016-01-14 |
* | Fixed the signature of the plset primitives for AVX512 | Benoit Steiner | 2016-01-14 |
* | Fixed the AVX512 signature of the ptranspose primitives | Benoit Steiner | 2016-01-14 |
* | pscatter/pgather use Index instead of int to specify the stride | Benoit Steiner | 2016-01-14 |
* | Deleted unnecessary commas and semicolons | Benoit Steiner | 2016-01-14 |
* | Implemented most of the packet primitives for AVX512 | Benoit Steiner | 2015-12-21 |
* | Free memory allocated using posix_memalign() with free() instead of std::free() | Benoit Steiner | 2015-12-21 |
* | Preliminary support for AVX512 | Benoit Steiner | 2015-12-10 |
* | bug #1103: fix neon vectorization of pmul(Packet1cd,Packet1cd) | Gael Guennebaud | 2015-12-10 |
* | Fix "," in non SSE4 mode | Gael Guennebaud | 2015-11-05 |
* | Fix AVX round/ceil/floor, and fix respective unit test | Gael Guennebaud | 2015-11-04 |
* | Merged in aavenel/eigen (pull request PR-142) | Gael Guennebaud | 2015-11-04 |
|\ |
|
* | | Made the CUDA implementation of ploadt_ro compatible with cuda implementation... | Benoit Steiner | 2015-11-03 |
| * | Add round, ceil and floor for SSE4.1/AVX (Bug #70) | Alexandre Avenel | 2015-11-01 |
|/ |
|
* | bug #1085: workaround gcc default ABI issue | Gael Guennebaud | 2015-10-10 |
* | _mm_hadd_epi32 is for SSSE3 only (and not SSE3) | Gael Guennebaud | 2015-10-07 |
* | Handle various TODOs in SSE vectorization (remove splitted storeu, enable SSE... | Gael Guennebaud | 2015-10-06 |
* | bug #1069: fix AVX support on MSVC (use of non portable C-style cast) | Gael Guennebaud | 2015-09-28 |
* | Added support for predux_mul for CUDA devices | Benoit Steiner | 2015-09-08 |
* | Implement plog and pexp for AltiVec. | Doug Kwan | 2015-07-30 |
* | Fix prototype of plset and generalize linspace functor. | Gael Guennebaud | 2015-08-07 |
* | Include SSE packetmath when AVX is enabled, and enable AVX's sine function on... | Gael Guennebaud | 2015-08-07 |
* | Let unpacket_traits<> exposes the required alignment and make use of it every... | Gael Guennebaud | 2015-08-07 |
* | Fix shadow warnings triggered by clang | Gael Guennebaud | 2015-06-09 |
* | Abandon blocking size lookup table approach. Not performing as well in real w... | Benoit Jacob | 2015-05-19 |
* | also uninitialized here, see previous cset | Benoit Jacob | 2015-05-15 |
* | Fix uninitialized var warning. The compiler was clearing the register anyway,... | Benoit Jacob | 2015-05-15 |
* | Merged in doug_kwan/eigen (pull request PR-103) | Konstantinos Margaritis | 2015-05-05 |
|\ |
|
* | | Added a double-precision implementation of the exp() function for AVX. | Benoit Steiner | 2015-05-04 |
* | | Pulled latest update from the eigen main codebase | Benoit Steiner | 2015-03-24 |
|\ \ |
|
| * | | Fixed the CUDA packet primitives | Benoit Steiner | 2015-03-24 |
| * | | use unsigned short instead of uint16_t which doesn't exist in c++98 | Benoit Jacob | 2015-03-17 |
| * | | Update Nexus 5 lookup table from combining now 2 runs of the benchmark, using... | Benoit Jacob | 2015-03-16 |
| * | | Provide a empirical lookup table for blocking sizes measured on a Nexus 5. On... | Benoit Jacob | 2015-03-15 |
| | * | Fix bug in pdiv<Packet1cd> which swaps 32-bit halves of a pair of | Doug Kwan | 2015-03-11 |
| |/ |
|
* | | Fixed the optimized AVX implementation of the fast rsqrt function | Benoit Steiner | 2015-03-02 |
* | | Added an optimized version of rsqrt for SSE and AVX that is used when EIGEN_F... | Benoit Steiner | 2015-03-02 |
* | | Pulled latest updates from trunk | Benoit Steiner | 2015-02-27 |
|\ \ |
|
* | | | Switch to truncated casting when converting floating point types to integer. ... | Benoit Steiner | 2015-02-27 |
* | | | Added support for vectorized type casting of tensors | Benoit Steiner | 2015-02-27 |
* | | | Added support for fast reciprocal square root computation. | Benoit Steiner | 2015-02-26 |
| | * | must also disable complex<double> when disabling double vectorization | Benoit Jacob | 2015-03-03 |
| | * | Work around an ICE in Clang 3.5 in the iOS toolchain with double NEON intrins... | Benoit Jacob | 2015-03-03 |
| | * | HalfPacket also needed to be disabled for double, on ARMv8. | Benoit Jacob | 2015-03-02 |
| |/ |
|
| * | remove trailing comma | Benoit Jacob | 2015-02-27 |
| * | Disable Packet2f/2i halfpacket support in NEON. | Benoit Jacob | 2015-02-27 |
|/ |
|
* | Marked the CUDA packet primitives as EIGEN_DEVICE_FUNC since they'll end up b... | Benoit Steiner | 2015-02-19 |
* | bug #955 - Implement a rotating kernel alternative in the 3px4 gebp path | Benoit Jacob | 2015-02-18 |