| Commit message (Expand) | Author | Age |
... | |
* | Various fixes for packet ops. | Rasmus Munk Larsen | 2019-06-20 |
* | Add masked_store_available to unpacket_traits | Eugene Zhulenev | 2019-05-02 |
* | Fix regression in changeset ae33e866c750c6c24ada5c6f7f3ec15815d0e683 | Gael Guennebaud | 2019-05-02 |
* | Fix compilation with PGI version 19 | Andy May | 2019-04-25 |
* | Adding lowlevel APIs for optimized RHS packet load in TensorFlow | Anuj Rawat | 2019-04-20 |
* | Fix conflicts and merge | Gael Guennebaud | 2019-01-30 |
|\ |
|
* \ | Rename pones -> ptrue. Use _CMP_TRUE_UQ where appropriate. | Rasmus Munk Larsen | 2019-01-09 |
|\ \ |
|
* | | | Collapsed revision | Rasmus Munk Larsen | 2019-01-09 |
| * | | Add packet up "pones". Write pnot(a) as pxor(pones(a), a). | Rasmus Munk Larsen | 2019-01-09 |
|/ / |
|
* | | Merged eigen/eigen into default | Rasmus Larsen | 2019-01-09 |
|\ \ |
|
| * | | bug #1652: implements a much more accurate version of vectorized sin/cos. Thi... | Gael Guennebaud | 2019-01-09 |
* | | | Add support for pcmp_eq and pnot, including for complex types. | Rasmus Munk Larsen | 2019-01-07 |
|/ / |
|
| * | Introducing "vectorized" byte on unpacket_traits structs | Gustavo Lima Chaves | 2018-12-19 |
|/ |
|
* | Properly set the number of registers for AVX512 | Gael Guennebaud | 2018-12-11 |
* | Enable FMA with MSVC (through /arch:AVX2). To make this possible, I also has ... | Gael Guennebaud | 2018-12-07 |
* | AVX512f includes FMA but GCC does not define __FMA__ with -mavx512f only | Gael Guennebaud | 2018-12-06 |
* | Implement AVX512 vectorization of std::complex<float/double> | Gael Guennebaud | 2018-12-06 |
* | same for pmax | Gael Guennebaud | 2018-11-28 |
* | pmin/pmax o SSE: make sure to use AVX instruction with AVX enabled, and disab... | Gael Guennebaud | 2018-11-28 |
* | Use explicit packet type in SSE/PacketMath pldexp | Eugene Zhulenev | 2018-11-27 |
* | bug #1631: fix compilation with ARM NEON and clang, and cleanup the weird psh... | Gael Guennebaud | 2018-11-27 |
* | Update pshiftleft to pass the shift as a true compile-time integer. | Gael Guennebaud | 2018-11-27 |
* | Unify SSE/AVX psin functions. | Gael Guennebaud | 2018-11-27 |
* | Unify SSE and AVX pexp for double. | Gael Guennebaud | 2018-11-26 |
* | Unify SSE and AVX implementation of pexp | Gael Guennebaud | 2018-11-26 |
* | First step toward a unification of packet log implementation, currently only ... | Gael Guennebaud | 2018-11-26 |
* | Make SSE/AVX pandnot(A,B) consistent with generic version, i.e., "A and not B" | Gael Guennebaud | 2018-11-26 |
* | bug #1605: workaround ABI issue with vector types (aka __m128) versus scalar ... | Gael Guennebaud | 2018-10-01 |
* | remove double ;; | Gael Guennebaud | 2018-07-12 |
* | Fix compilation with MSVC by reverting to char* for _mm_prefetch except for P... | Gael Guennebaud | 2018-06-07 |
* | Fix compilation and SSE support with PGI compiler | Gael Guennebaud | 2018-05-29 |
* | Make NaN propagatation consistent between the pmax/pmin and std::max/std::min... | Rasmus Munk Larsen | 2017-01-24 |
* | bug #1363: fix mingw's ABI issue | Gael Guennebaud | 2016-12-15 |
* | Disable usage of SSE3 _mm_hadd_ps that is extremely slow. | Gael Guennebaud | 2016-11-22 |
* | Disable usage of SSE3 haddpd that is extremely slow. | Gael Guennebaud | 2016-11-22 |
* | Add pinsertfirst function and implement pinsertlast for complex on SSE/AVX. | Gael Guennebaud | 2016-11-02 |
* | Add missing inline keywords | Gael Guennebaud | 2016-10-25 |
* | Fixed a typo | Benoit Steiner | 2016-10-25 |
* | Add a pinsertlast function replacing the last entry of a packet by a scalar. | Gael Guennebaud | 2016-10-25 |
* | bug #1195: move NumTraits::Div<>::Cost to internal::scalar_div_cost (with som... | Gael Guennebaud | 2016-09-08 |
* | Implement pmadd for float and double to make it consistent with the vectorize... | Gael Guennebaud | 2016-08-23 |
* | Remove now-unused protate PacketMath func | Benoit Jacob | 2016-05-24 |
* | Optimized implementation of the tanh function for SSE | Benoit Steiner | 2016-02-10 |
* | Remove custom unaligned loads for SSE. They were only useful for core2 CPU. | Gael Guennebaud | 2016-02-08 |
* | Fix "," in non SSE4 mode | Gael Guennebaud | 2015-11-05 |
* | Add round, ceil and floor for SSE4.1/AVX (Bug #70) | Alexandre Avenel | 2015-11-01 |
* | bug #1085: workaround gcc default ABI issue | Gael Guennebaud | 2015-10-10 |
* | _mm_hadd_epi32 is for SSSE3 only (and not SSE3) | Gael Guennebaud | 2015-10-07 |
* | Handle various TODOs in SSE vectorization (remove splitted storeu, enable SSE... | Gael Guennebaud | 2015-10-06 |
* | Fix prototype of plset and generalize linspace functor. | Gael Guennebaud | 2015-08-07 |