Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | matrix product: move the alpha factor to gebp instead of the packing, | 2010-07-12 | |
| | | | | clean some temporaries, etc. | ||
* | mixing types step 3: | 2010-07-11 | |
| | | | | | - improve support of colmajor by vector and matrix - matrix - now all configurations are well handled, but the perf are not always very good | ||
* | make colmaj * vector uses pointers only | 2010-07-11 | |
| | |||
* | mixing types in product step 2: | 2010-07-11 | |
| | | | | | | | | * pload* and pset1 are now templated on the packet type * gemv routines are now embeded into a structure with a consistent API with respect to gemm * some configurations of vector * matrix and matrix * matrix works fine, some need more work... | ||
* | sync | 2010-07-10 | |
|\ | |||
| * | * generalize rowmajor by vector | 2010-07-10 | |
| | | | | | | | | * fix weird compilation error when constructing a matrix with a row by matrix product | ||
| * | fix compilation: make the check_coordinates* functions const | 2010-07-10 | |
| | | |||
| * | let ei_pset1 use _mm_loaddup_pd. Not a significant speed improvement, but ↵ | 2010-07-09 | |
| | | | | | | | | also not a speed regression, and replaces 3 instructions by 1 single instruction. | ||
| * | Added NEON/Complex.h, ~3.5x faster than scalar std::complex<float> | 2010-07-10 | |
| | | | | | | | | minor fix in AltiVec Complex.h | ||
| * | disable MSVC optimization when the underlying compiler is ICC | 2010-07-09 | |
| | | |||
| * | move ei_conj_if to a more appropriate file | 2010-07-09 | |
| | | |||
| * | forgot to commit ei_p4f_FORWARD; | 2010-07-09 | |
| | | |||
| * | Altivec port of Complex.h. | 2010-07-09 | |
| | | | | | | | | | | | | | | | | Note: For some reason g++ 4.4 is >200% slower than g++ 4.3 on altivec code. The same benchmark (bench_gemm) was tested, on the same hardware/OS (G4/Debian testing), with same CFLAGS. With some code reorganizing I managed to get some minor gain on 4.4, but I just could not reach 4.3 speed. This is most likely a bug, but I'm waiting to see if it's fixed on 4.5. I'll look into this a bit more. | ||
* | | fix a few weird issues with gcc 4.3 32bits and complex<float> | 2010-07-09 | |
| | | |||
* | | fix SliceVectorizedTraversal for packetsize==1 | 2010-07-08 | |
| | | |||
* | | scalars fitting in a single packet requires more work, step 1 | 2010-07-08 | |
| | | | | | | | | | | * add a, Alignable trait * update LinearVectorization assignment | ||
* | | compilation fix | 2010-07-08 | |
| | | |||
| * | enabling aligned loads/store for complex<double> is much more tricky, | 2010-07-07 | |
| | | | | | | | | so the temporary fix is to always perform unaligned load/store | ||
* | | an attempt to fix wrong unaligned store | 2010-07-07 | |
| | | |||
* | | support for real * complex matrix product - step 1 (works for some special ↵ | 2010-07-07 | |
|/ | | | | cases) | ||
* | sync | 2010-07-07 | |
|\ | |||
* | | optimized conjugate products for SSE3 | 2010-07-07 | |
| | | |||
* | | optimize for SSE3 => significant speed up !! | 2010-07-07 | |
| | | |||
* | | optimize pmul for complex<double> | 2010-07-07 | |
| | | |||
* | | optimize gemv for complex<double> and fix gcc alignment issue in 32bits | 2010-07-07 | |
| | | |||
* | | cleanning | 2010-07-07 | |
| | | |||
* | | s/IsVectorized/Vectorizable | 2010-07-07 | |
| | | |||
* | | add a compile time error if someone call packet on Diagonal (instead of ↵ | 2010-07-07 | |
| | | | | | | | | infinite runtime loop) | ||
* | | fix vectorization rule of diagonal-product | 2010-07-07 | |
| | | |||
* | | fix row cwise-prod column in coeff based products... | 2010-07-07 | |
| | | | | | | | | I really don't know why this worked so far... | ||
* | | add a conj_product functor and optimize dot products | 2010-07-07 | |
| | | |||
* | | fix mixing types in DiagonalProduct | 2010-07-07 | |
| | | |||
* | | * add a IsVectorized mechanism (instead of packet-size>1...) | 2010-07-06 | |
| | | | | | | | | * vectorize complex<double> | ||
* | | bug fix forgot to conjugate the scalar factor when needed | 2010-07-06 | |
| | | |||
* | | reduce code generation and minor speed up | 2010-07-06 | |
| | | |||
* | | add support for vectorized conjugated products | 2010-07-06 | |
| | | |||
| * | Various documentation improvements. | 2010-07-06 | |
| | | | | | | | | | | | | | | * Add short documentation for Array class * Put all classes explicitly in Core module (where applicable) * Section on Modules in Quick Reference Guide * Put Page 7 after Page 6 in Contents :) | ||
| * | Avoid calling resizeLike, if EIGEN_NO_AUTOMATIC_RESIZING is defined | 2010-07-06 | |
| | | |||
| * | indentation | 2010-07-06 | |
| | | |||
* | | * extend the Has* packet traits and makes all functor use it | 2010-07-05 | |
| | | | | | | | | * extend the packing routines to support conjugation | ||
| * | oops I did not see that | 2010-07-05 | |
| | | |||
* | | add intitial support for the vectorization of complex<float> | 2010-07-05 | |
| | | |||
| * | add check for non x86 platforms, we get a compile error on arm/powerpc ↵ | 2010-07-05 | |
|/ | | | | | | without the check (there is no known -yet- method to get cpuid, without resolving to kernel /sys interface) | ||
* | fix warning "type qualifiers ignored on function return type" for long long ↵ | 2010-07-05 | |
| | | | | scalar types | ||
* | fix unaligned workspace in sybb | 2010-07-05 | |
| | |||
* | comment the workaround of the EIGEN_EMPTY_STRUCT_CTOR workaround for gcc 4.3 | 2010-07-04 | |
| | |||
* | suppress warning and add a fixme about this transpose argument | 2010-07-03 | |
| | |||
* | fix openmp for row major destination | 2010-07-03 | |
| | |||
* | fix bug with openmp | 2010-07-03 | |
| | |||
* | fix compilation issue with clang | 2010-07-01 | |
| |