aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/arch/SSE/PacketMath.h
Commit message (Expand)AuthorAge
* Enable vectorization of pack_rhs with a column-major RHS.Gravatar Gael Guennebaud2014-04-25
* Enable fused madd for AltivecGravatar Gael Guennebaud2014-04-24
* Workaround gcc's default ABI not being able to distinghish between vector typ...Gravatar Gael Guennebaud2014-04-22
* New gebp kernel handling up to 3 packets x 4 register-level blocks. Huge spee...Gravatar Gael Guennebaud2014-04-16
* Optimized SSE unaligned loads and stores when compiling a 64bit target with a...Gravatar Benoit Steiner2014-04-14
* Add a mechanism to recursively access to half-size packet typesGravatar Gael Guennebaud2014-03-28
* Implemented the SSE version of the gather and scatter packet primitives.Gravatar Benoit Steiner2014-03-27
* Created the ptranspose packet primitive that can transpose an array of N pack...Gravatar Benoit Steiner2014-03-26
* Merged latest updates from the parent branchGravatar Benoit Steiner2014-03-26
|\
| * Implement new 1 packet x 8 gebp kernelGravatar Gael Guennebaud2014-03-26
| * add pbroadcast2/4 generic intrinsicsGravatar Gael Guennebaud2014-03-26
* | Added support for FMA instructionsGravatar Benoit Steiner2014-02-24
* | Added support for AVX to Eigen.Gravatar Benoit Steiner2014-01-29
| * Revert previous change and introduce a new workaround regarding gcc generatin...Gravatar Gael Guennebaud2014-03-20
| * Makes gcc to generate a pshufd instruction for pset1Gravatar Gael Guennebaud2014-03-20
|/
* Remove useless register keyword, and optimize predux_min/max for SSE4Gravatar Gael Guennebaud2014-01-25
* Fix bug #642: add vectorization of sqrt for doubles, and make sqrt really saf...Gravatar Gael Guennebaud2013-08-19
* Add missing pconj specializationsGravatar Gael Guennebaud2013-05-17
* Add SSE4 min/max for integersGravatar Gael Guennebaud2013-03-20
* add SSE pexp function for double, make use of _mm_floor_p* for pexp with SSE4.1Gravatar Gael Guennebaud2012-07-27
* Automatic relicensing to MPL2 using Keirs script. Manual fixup follows.Gravatar Benoit Jacob2012-07-13
* Get rid of include directives inside namespace blocks (bug #339).Gravatar Jitse Niesen2012-04-15
* proper C++ castingGravatar Gael Guennebaud2012-01-31
* fix static inline versus inline static issues (the former is the correct order)Gravatar Gael Guennebaud2012-01-31
* no commentGravatar Gael Guennebaud2011-09-21
* quick workaround of MSVC9' ICE in pset1Gravatar Gael Guennebaud2011-09-21
* re-enable fast pset1-pstore by introducing a new higher level pstore1 functionGravatar Gael Guennebaud2011-03-02
* fix bug #203: revert to using _mm_set1_p[sd]Gravatar Benoit Jacob2011-02-28
* remove now-useless commentsGravatar Benoit Jacob2011-02-27
* fix bug #201: Clang too has intrinsics bugs preventing us to use custom unali...Gravatar Benoit Jacob2011-02-27
* bug #195 - fix this once and for all: just never use _mm_load_sd on gcc/i386,...Gravatar Benoit Jacob2011-02-27
* fix bug #195: fast unaligned load for integer using _mm_load_sd failed when t...Gravatar Benoit Jacob2011-02-24
* MSVC does not like using uninitialized SSE variables, so we have to pass all ...Gravatar Hauke Heibel2011-02-12
* implement optimized ploadu for MSVC10: this also fix bad code generation in g...Gravatar Gael Guennebaud2011-02-12
* The pfirst hack is needed also on msvc 2010 as it gets completely nuts, even ...Gravatar Benoit Jacob2011-02-09
* Initial fixes for bug #85.Gravatar Hauke Heibel2010-10-25
* bug #86 : use internal:: namespace instead of ei_ prefixGravatar Benoit Jacob2010-10-25
* fix determination of number of registers on sse:Gravatar Benoit Jacob2010-08-13
* * fix compilation of mixed scalar productGravatar Gael Guennebaud2010-07-19
* * _mm_loaddup_pd is slowGravatar Gael Guennebaud2010-07-19
* wip: extend the gebp kernel to optimize complex and mixed productsGravatar Gael Guennebaud2010-07-19
* mixing types in product step 2:Gravatar Gael Guennebaud2010-07-11
* syncGravatar Gael Guennebaud2010-07-10
|\
| * let ei_pset1 use _mm_loaddup_pd. Not a significant speed improvement, but als...Gravatar Benoit Jacob2010-07-09
| * disable MSVC optimization when the underlying compiler is ICCGravatar Gael Guennebaud2010-07-09
* | scalars fitting in a single packet requires more work, step 1Gravatar Gael Guennebaud2010-07-08
|/
* optimize pmul for complex<double>Gravatar Gael Guennebaud2010-07-07
* s/IsVectorized/VectorizableGravatar Gael Guennebaud2010-07-07
* * add a IsVectorized mechanism (instead of packet-size>1...)Gravatar Gael Guennebaud2010-07-06
* add support for vectorized conjugated productsGravatar Gael Guennebaud2010-07-06