Commit message (Collapse) | Author | Age | ||
---|---|---|---|---|
... | ||||
* | fix compilation | Gael Guennebaud | 2011-02-21 | |
| | ||||
* | bug #86 : use internal:: namespace instead of ei_ prefix | Benoit Jacob | 2010-10-25 | |
| | ||||
* | mixing types in product step 2: | Gael Guennebaud | 2010-07-11 | |
| | | | | | | | | * pload* and pset1 are now templated on the packet type * gemv routines are now embeded into a structure with a consistent API with respect to gemm * some configurations of vector * matrix and matrix * matrix works fine, some need more work... | |||
* | Added NEON/Complex.h, ~3.5x faster than scalar std::complex<float> | Konstantinos Margaritis | 2010-07-10 | |
| | | | | minor fix in AltiVec Complex.h | |||
* | Altivec port of Complex.h. | Konstantinos Margaritis | 2010-07-09 | |
Note: For some reason g++ 4.4 is >200% slower than g++ 4.3 on altivec code. The same benchmark (bench_gemm) was tested, on the same hardware/OS (G4/Debian testing), with same CFLAGS. With some code reorganizing I managed to get some minor gain on 4.4, but I just could not reach 4.3 speed. This is most likely a bug, but I'm waiting to see if it's fixed on 4.5. I'll look into this a bit more. |