Commit message (Collapse) | Author | Age | ||
---|---|---|---|---|
... | ||||
* | mostly cleanups and modernizing code | 2016-06-19 | ||
| | ||||
* | fixed pexp<Packet2d>, was failing tests | 2016-06-19 | ||
| | ||||
* | define Packet2d constants with VSX only | 2016-04-28 | ||
| | ||||
* | remove accidentally pasted code | 2016-04-28 | ||
| | ||||
* | improve state of MathFunctions as well | 2016-04-28 | ||
| | ||||
* | bring Altivec/VSX to a better state, implement some of the missing functions | 2016-04-28 | ||
| | ||||
* | minor fix, add to copyright | 2016-04-28 | ||
| | ||||
* | Implement plog and pexp for AltiVec. | 2015-07-30 | ||
| | ||||
* | Fix prototype of plset and generalize linspace functor. | 2015-08-07 | ||
| | ||||
* | Let unpacket_traits<> exposes the required alignment and make use of it ↵ | 2015-08-07 | ||
| | | | | everywhere | |||
* | Fix bug in pdiv<Packet1cd> which swaps 32-bit halves of a pair of | 2015-03-11 | ||
| | | | | doubles instead of swapping the doubles. | |||
* | The usage of DenseIndex is deprecated, so let's replace DenseIndex by Index | 2015-02-16 | ||
| | ||||
* | bug #936, patch 2/3: Remove EIGEN_VECTORIZE_FMA, was redundant with ↵ | 2015-01-30 | ||
| | | | | EIGEN_HAS_SINGLE_INSTRUCTION_MADD | |||
* | bug #936, patch 1.5/3: rename _FUSED_ macros to _SINGLE_INSTRUCTION_, | 2015-01-31 | ||
| | | | | | | | | | because this is what they are about. "Fused" means "no intermediate rounding between the mul and the add, only one rounding at the end". Instead, what we are concerned about here is whether a temporary register is needed, i.e. whether the MUL and ADD are separate instructions. Concretely, on ARM NEON, a single-instruction mul-add is always available: VMLA. But a true fused mul-add is only available on VFPv4: VFMA. | |||
* | bug #936, patch 1/3: some cleanup and renaming for consistency. | 2015-01-30 | ||
| | ||||
* | fixed to make big-endian VSX work as well | 2014-10-01 | ||
| | ||||
* | prefetch are noops on VSX, actually disable the prefetch trait | 2014-09-21 | ||
| | ||||
* | fix compile error on big endian altivec | 2014-09-21 | ||
| | ||||
* | prefetch are noops on VSX | 2014-09-21 | ||
| | ||||
* | VSX supports vec_div, implement where appropriate (float/doubles) | 2014-09-21 | ||
| | ||||
* | VSX port passes packetmath_[1-5] tests! | 2014-09-20 | ||
| | ||||
* | 32-bit floats/ints, 64-bit doubles pass packetmath tests, complex 32/64-bit ↵ | 2014-09-19 | ||
| | | | | remaining | |||
* | First time it compiles, but fails to pass the tests. | 2014-09-09 | ||
| | ||||
* | Initial VSX commit | 2014-08-29 | ||
| | ||||
* | Simplification of some Altivec constants, reuse existing constants and avoid ↵ | 2014-07-22 | ||
| | | | | loading from RAM esp in the case of p16uc_COMPLEX_TRANSPOSE* | |||
* | Added HasDiv=1 to Altivec PacketMath.h, now vectorization_logic test passes. | 2014-07-15 | ||
| | | | | Added comments to the constants, indicative of the actual values | |||
* | Fix many long to int implicit conversions | 2014-07-08 | ||
| | ||||
* | Implement pbroadcast4 on altivec | 2014-04-25 | ||
| | ||||
* | Enable vectorization of pack_rhs with a column-major RHS. | 2014-04-25 | ||
| | | | | Rename and generalize Kernel<*> to PacketBlock<*,N>. | |||
* | Enable fused madd for Altivec | 2014-04-24 | ||
| | ||||
* | Implement ptranspose on altivec and fix pgather/pscatter | 2014-04-24 | ||
| | ||||
* | Add Altivec implementation of pgather/pscatter (not tested) | 2014-04-23 | ||
| | ||||
* | New gebp kernel handling up to 3 packets x 4 register-level blocks. Huge ↵ | 2014-04-16 | ||
| | | | | | | speeup on Haswell. This changeset also introduce new vector functions: ploadquad and predux4. | |||
* | Add a mechanism to recursively access to half-size packet types | 2014-03-28 | ||
| | ||||
* | Fix ploaddup and lin-spaced with AltiVec. | 2013-09-10 | ||
| | ||||
* | Add missing pconj specializations | 2013-05-17 | ||
| | ||||
* | Automatic relicensing to MPL2 using Keirs script. Manual fixup follows. | 2012-07-13 | ||
| | ||||
* | Get rid of include directives inside namespace blocks (bug #339). | 2012-04-15 | ||
| | ||||
* | fix static inline versus inline static issues (the former is the correct order) | 2012-01-31 | ||
| | ||||
* | better fix for gcc 4.6.0 / ptrdiff_t, as suggested by Benoit | 2011-05-05 | ||
| | ||||
* | Fix compilation with gcc-4.6.0, patch provided by Anton Gladky ↵ | 2011-05-05 | ||
| | | | | | | <gladky.anton@gmail.com>, working on debian packaging. | |||
* | fix AltiVec ploaddup | 2011-02-24 | ||
| | ||||
* | fix pset1 for complex | 2011-02-23 | ||
| | ||||
* | implement ploaddup for altivec and add respective unit test | 2011-02-23 | ||
| | ||||
* | fix a couple of issues with pcplxflip | 2011-02-23 | ||
| | ||||
* | implement pcplxflip for altivec | 2011-02-23 | ||
| | ||||
* | Altivec: fix infinite loop (ei_ -> internal:: change) | 2011-02-23 | ||
| | ||||
* | add missing AlignedOnScalar | 2011-02-22 | ||
| | ||||
* | altivec compilation fix | 2011-02-22 | ||
| | ||||
* | more compilation fixes for altivec | 2011-02-21 | ||
| |