| Commit message (Expand) | Author | Age |
* | Enable partial support for half floats on Kepler GPUs. | Benoit Steiner | 2016-03-03 |
* | Enable the conversion between floats and half floats on older GPUs that suppo... | Benoit Steiner | 2016-03-03 |
* | Declare the half float type as arithmetic. | Benoit Steiner | 2016-02-22 |
* | Implemented the ptranspose function on half floats | Benoit Steiner | 2016-02-21 |
* | Added the ability to compute the absolute value of a half float | Benoit Steiner | 2016-02-21 |
* | Moved some of the fp16 operators outside the Eigen namespace to workaround so... | Benoit Steiner | 2016-02-20 |
* | Added support for tensor reductions on half floats | Benoit Steiner | 2016-02-19 |
* | Implemented the scalar division of 2 half floats | Benoit Steiner | 2016-02-19 |
* | Added support for operators +=, -=, *= and /= on CUDA half floats | Benoit Steiner | 2016-02-19 |
* | Implemented protate() for CUDA | Benoit Steiner | 2016-02-19 |
* | Added support for simple coefficient wise tensor expression using half floats... | Benoit Steiner | 2016-02-19 |
* | FP16 on CUDA are only available starting with cuda 7.5. Disable them when usi... | Benoit Steiner | 2016-02-18 |
* | Added preliminary support for half floats on CUDA GPU. For now we can simply ... | Benoit Steiner | 2016-02-19 |
* | Improved implementation of ptanh for SSE and AVX | Benoit Steiner | 2016-02-18 |
* | Avoid implicit cast from double to float. | Benoit Steiner | 2016-02-10 |
* | Optimized implementation of the tanh function for SSE | Benoit Steiner | 2016-02-10 |
* | Optimized implementation of the hyperbolic tangent function for AVX | Benoit Steiner | 2016-02-10 |
* | Make the GCC workaround for sqrt GCC-only; detect Emscripten as non-GCC | Benoit Jacob | 2016-02-10 |
* | Work around Emscripten bug - https://github.com/kripken/emscripten/issues/4088 | Benoit Jacob | 2016-02-10 |
* | Remove custom unaligned loads for SSE. They were only useful for core2 CPU. | Gael Guennebaud | 2016-02-08 |
* | merge | Gael Guennebaud | 2016-01-28 |
|\ |
|
* | | Fix compilation on old gcc+AVX | Gael Guennebaud | 2016-01-21 |
* | | Add numext::sqrt function to enable custom optimized implementation. | Gael Guennebaud | 2016-01-21 |
* | | Workaround clang -Wdocumentation warning about "/*<" | Gael Guennebaud | 2015-12-30 |
| * | Merged eigen/eigen into default | Eugene Brevdo | 2015-12-24 |
| |\
| |/
|/| |
|
| * | Add digamma for CPU + CUDA. Includes tests. | Eugene Brevdo | 2015-12-24 |
* | | Workaround compilers that do not even define _mm256_set_m128. | Gael Guennebaud | 2015-12-24 |
|/ |
|
* | Fixed a typo in previous change. | Benoit Steiner | 2015-12-21 |
* | Added support for CUDA architectures that don's support for 3.5 capabilities | Benoit Steiner | 2015-12-21 |
* | Fixed a typo. | Benoit Steiner | 2015-12-18 |
* | bug #1140: remove custom definition and use of _mm256_setr_m128 | Gael Guennebaud | 2015-12-18 |
* | Merged in ebrevdo/eigen (pull request PR-148) | Gael Guennebaud | 2015-12-11 |
|\ |
|
* | | bug #1103: fix neon vectorization of pmul(Packet1cd,Packet1cd) | Gael Guennebaud | 2015-12-10 |
| * | Add special functions to Eigen: lgamma, erf, erfc. | Eugene Brevdo | 2015-12-07 |
|/ |
|
* | Fix "," in non SSE4 mode | Gael Guennebaud | 2015-11-05 |
* | Fix AVX round/ceil/floor, and fix respective unit test | Gael Guennebaud | 2015-11-04 |
* | Merged in aavenel/eigen (pull request PR-142) | Gael Guennebaud | 2015-11-04 |
|\ |
|
* | | Made the CUDA implementation of ploadt_ro compatible with cuda implementation... | Benoit Steiner | 2015-11-03 |
| * | Add round, ceil and floor for SSE4.1/AVX (Bug #70) | Alexandre Avenel | 2015-11-01 |
|/ |
|
* | bug #1085: workaround gcc default ABI issue | Gael Guennebaud | 2015-10-10 |
* | _mm_hadd_epi32 is for SSSE3 only (and not SSE3) | Gael Guennebaud | 2015-10-07 |
* | Handle various TODOs in SSE vectorization (remove splitted storeu, enable SSE... | Gael Guennebaud | 2015-10-06 |
* | bug #1069: fix AVX support on MSVC (use of non portable C-style cast) | Gael Guennebaud | 2015-09-28 |
* | Added support for predux_mul for CUDA devices | Benoit Steiner | 2015-09-08 |
* | Implement plog and pexp for AltiVec. | Doug Kwan | 2015-07-30 |
* | Fix prototype of plset and generalize linspace functor. | Gael Guennebaud | 2015-08-07 |
* | Include SSE packetmath when AVX is enabled, and enable AVX's sine function on... | Gael Guennebaud | 2015-08-07 |
* | Let unpacket_traits<> exposes the required alignment and make use of it every... | Gael Guennebaud | 2015-08-07 |
* | Fix shadow warnings triggered by clang | Gael Guennebaud | 2015-06-09 |
* | Abandon blocking size lookup table approach. Not performing as well in real w... | Benoit Jacob | 2015-05-19 |