aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src
Commit message (Collapse)AuthorAge
...
* | Enable vectorization of pack_rhs with a column-major RHS.Gravatar Gael Guennebaud2014-04-25
| | | | | | | | Rename and generalize Kernel<*> to PacketBlock<*,N>.
* | Enable fused madd for AltivecGravatar Gael Guennebaud2014-04-24
| |
* | Implement ptranspose on altivec and fix pgather/pscatterGravatar Gael Guennebaud2014-04-24
| |
* | Fixed the NEON implementation of predux_max<Packet4i>.Gravatar Benoit Steiner2014-04-23
| |
* | Created a NEON version of the ptranspose packet primitivesGravatar Benoit Steiner2014-04-23
| |
* | Add Altivec implementation of pgather/pscatter (not tested)Gravatar Gael Guennebaud2014-04-23
| |
* | Fix EIGEN_MAKE_UNALIGNED_ARRAY_ASSERT macroGravatar Gael Guennebaud2014-04-22
| |
* | merge with default branchGravatar Gael Guennebaud2014-04-22
|\ \
* | | Workaround gcc's default ABI not being able to distinghish between vector ↵Gravatar Gael Guennebaud2014-04-22
| | | | | | | | | | | | types of different sizes.
* | | Fix 128bit packet size assumptions in unit tests.Gravatar Gael Guennebaud2014-04-18
| | |
* | | Fix alignment assertion.Gravatar Gael Guennebaud2014-04-18
| | |
* | | Fix calls to lazy products (lazy product does not like matrices with 0 length)Gravatar Gael Guennebaud2014-04-18
| | |
* | | Smarter block size computationGravatar Gael Guennebaud2014-04-18
| | |
* | | Fix typo (was working with clang\!)Gravatar Gael Guennebaud2014-04-18
| | |
* | | Fixes for fixed sizes and non vectorizable typesGravatar Gael Guennebaud2014-04-17
| | |
* | | mergeGravatar Gael Guennebaud2014-04-17
|\ \ \
| * | | Implemented the pgather/pscatter packet primitives for the arm/NEON architectureGravatar Benoit Steiner2014-04-17
| | | |
* | | | Optimize AVX pset1 for complexes and ploaddupGravatar Gael Guennebaud2014-04-17
| | | |
* | | | Fix and optimize mixed productsGravatar Gael Guennebaud2014-04-17
| | | |
* | | | Optimize ploaddup for AVXGravatar Gael Guennebaud2014-04-17
|/ / /
* | | Fallback to lazy products for very small ones.Gravatar Gael Guennebaud2014-04-16
| | |
* | | Enable alloca on MAC OSXGravatar Gael Guennebaud2014-04-16
| | |
* | | New gebp kernel handling up to 3 packets x 4 register-level blocks. Huge ↵Gravatar Gael Guennebaud2014-04-16
| | | | | | | | | | | | | | | | | | speeup on Haswell. This changeset also introduce new vector functions: ploadquad and predux4.
| * | Check IMKL version for compatibility with EigenGravatar Mark Borgerding2014-04-15
| | |
| * | bug #793: detect NaN and INF in EigenSolver instead of aborting with an assert.Gravatar Gael Guennebaud2014-04-14
| | |
| * | Add isfinite overload for complexes.Gravatar Gael Guennebaud2014-04-14
| | |
* | | Optimized SSE unaligned loads and stores when compiling a 64bit target with ↵Gravatar Benoit Steiner2014-04-14
| | | | | | | | | | | | a recent version of gcc (ie gcc 4.8).
| * | bug #790: fix overflow in real_2x2_jacobi_svdGravatar Gael Guennebaud2014-04-14
| | |
| * | bug #793: fix overflow in EigenSolver and add respective regression unit testGravatar Gael Guennebaud2014-04-14
| | |
| * | Updated my previous fix to avoid introducing a compilation warning on ARM ↵Gravatar Benoit Steiner2014-04-10
| | | | | | | | | | | | platforms.
| * | Silenced a compilation warning produced by nvcc.Gravatar Benoit Steiner2014-04-10
| |/
| * doc: Add references to Cholesky methods in SelfAdjointView.Gravatar Jitse Niesen2014-04-07
| |
* | Deleted some dead code.Gravatar Benoit Steiner2014-04-04
| |
| * Fix bug #784: Assert if assigning a product to a triangularView does not ↵Gravatar Christoph Hertzberg2014-04-04
| | | | | | | | match the size.
| * bug #782: Workaround for gcc <= 4.4 compilation error on the NEON PacketMath ↵Gravatar Benoit Steiner2014-04-03
| | | | | | | | code.
* | Finally, prefetching seems to help getting more stable performanceGravatar Gael Guennebaud2014-03-31
| |
* | Workaround alignment warningsGravatar Gael Guennebaud2014-03-30
| |
* | Optimize gebp kernel:Gravatar Gael Guennebaud2014-03-30
| | | | | | | | | | 1 - increase peeling level along the depth dimention (+5% for large matrices, i.e., >1000) 2 - improve pipelining when dealing with latest rows of the lhs
* | Vectorized the loop peeling of the inner loop of the block-panel matrix ↵Gravatar Benoit Steiner2014-03-28
| | | | | | | | multiplication code. This speeds up the multiplication of matrices which size is not a multiple of the packet size.
* | Add a mechanism to recursively access to half-size packet typesGravatar Gael Guennebaud2014-03-28
| |
* | merge with default branchGravatar Gael Guennebaud2014-03-28
|\|
* | Enable vectorization of gemv for PacketSize>4 through unaligned loads (still ↵Gravatar Gael Guennebaud2014-03-28
| | | | | | | | better than no vectorization)
* | Merged latest changes from parent.Gravatar Benoit Steiner2014-03-27
|\ \
* | | Implemented the SSE version of the gather and scatter packet primitives.Gravatar Benoit Steiner2014-03-27
| | |
* | | Implemented the AVX version of the gather and scatter packet primitives.Gravatar Benoit Steiner2014-03-27
| | |
* | | Introduced pscatter/pgather packet primitives. They will be used to optimize ↵Gravatar Benoit Steiner2014-03-27
| | | | | | | | | | | | the loop peeling code of the block-panel matrix multiplication kernel.
| * | enforce the use of vfmadd231ps for pmadd (gcc and clang stupidely generates ↵Gravatar Gael Guennebaud2014-03-27
|/ / | | | | | | the other fmadd variants plus some register moves...)
* | Fixed compilation error when FMA instructions are enabled.Gravatar Benoit Steiner2014-03-27
| |
* | Silenced "unused variable" warnings when compiling with FMA.Gravatar Benoit Steiner2014-03-27
| |
* | Vectorized the packing of a col-major matrix used as the right hand side ↵Gravatar Benoit Steiner2014-03-27
| | | | | | | | argument in a matrix-matrix product when AVX instructions are used. No vectorization takes place when SSE instructions are used, however this doesn't seem to impact performance.