Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Rename predux_downto4 to be more accurate on its semantic. | 2018-04-03 | |
| | |||
* | Add a EIGEN_NO_CUDA option, and introduce EIGEN_CUDACC and EIGEN_CUDA_ARCH ↵ | 2017-07-17 | |
| | | | | aliases | ||
* | Declared the plset, ploadt_ro, and ploaddup packet primitives as usable ↵ | 2017-02-27 | |
| | | | | within a gpu kernel | ||
* | Added support for expm1 in Eigen. | 2016-12-02 | |
| | |||
* | Merged eigen/eigen into default | 2016-11-03 | |
|\ | |||
| * | Add pinsertfirst function and implement pinsertlast for complex on SSE/AVX. | 2016-11-02 | |
| | | |||
| * | Add missing inline keywords | 2016-10-25 | |
| | | |||
| * | Add a pinsertlast function replacing the last entry of a packet by a scalar. | 2016-10-25 | |
| | | | | | | | | (useful to vectorize LinSpaced) | ||
* | | Renamed predux_half into predux_downto4 | 2016-10-06 | |
| | | |||
* | | Merged latest updates from trunk | 2016-10-05 | |
|\| | |||
| * | bug #1249: enable use of __builtin_prefetch for GCC, clang, and ICC only. | 2016-07-25 | |
| | | |||
| * | bug #1232: refactor special functions as a new SpecialFunctions module, ↵ | 2016-07-08 | |
| | | | | | | | | currently in unsupported/. | ||
| * | Add TernaryFunctors and the betainc SpecialFunction. | 2016-06-02 | |
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | TernaryFunctors and their executors allow operations on 3-tuples of inputs. API fully implemented for Arrays and Tensors based on binary functors. Ported the cephes betainc function (regularized incomplete beta integral) to Eigen, with support for CPU and GPU, floats, doubles, and half types. Added unit tests in array.cpp and cxx11_tensor_cuda.cu Collapsed revision * Merged helper methods for betainc across floats and doubles. * Added TensorGlobalFunctions with betainc(). Removed betainc() from TensorBase. * Clean up CwiseTernaryOp checks, change igamma_helper to cephes_helper. * betainc: merge incbcf and incbd into incbeta_cfe. and more cleanup. * Update TernaryOp and SpecialFunctions (betainc) based on review comments. | ||
| * | Expose log1p to Array. | 2016-06-01 | |
| | | |||
| * | Remove now-unused protate PacketMath func | 2016-05-24 | |
| | | |||
* | | Pull latest updates from upstream | 2016-04-11 | |
|\| | |||
| * | Fixed the packet_traits for half floats. | 2016-04-08 | |
| | | |||
| * | Added polygamma function. | 2016-04-01 | |
| | | |||
| * | Added zeta function. | 2016-04-01 | |
| | | |||
| * | Resolve bad merge. | 2016-03-08 | |
| | | |||
| * | Added support for vectorized type casting of int to char. | 2016-02-03 | |
| | | |||
* | | Generalized predux4 to support AVX512 packets, and renamed it predux_half. | 2016-02-01 | |
| | | | | | | | | Disabled the implementation of pabs for avx512 since the corresponding intrinsics are not shipped with gcc | ||
| * | Fixed compilation warning | 2016-01-28 | |
| | | |||
| * | Add digamma for CPU + CUDA. Includes tests. | 2015-12-24 | |
| | | |||
| * | Cleanup | 2015-12-08 | |
| | | |||
| * | Fixed a couple of typos | 2015-12-07 | |
| | | | | | | | | Cleaned up the code a bit. | ||
| * | Add special functions to Eigen: lgamma, erf, erfc. | 2015-12-07 | |
|/ | | | | Includes CUDA support and unit tests. | ||
* | added scalar_sign_op (both real,complex) | 2015-11-24 | |
| | |||
* | Fix prototype of plset and generalize linspace functor. | 2015-08-07 | |
| | |||
* | Let unpacket_traits<> exposes the required alignment and make use of it ↵ | 2015-08-07 | |
| | | | | everywhere | ||
* | First part of a big refactoring of alignment control to enable the handling ↵ | 2015-08-06 | |
| | | | | | | | | | of arbitrarily aligned buffers. It includes: - AlignedBit flag is deprecated. Alignment is now specified by the evaluator through the 'Alignment' enum, e.g., evaluator<Xpr>::Alignment. Its value is in Bytes. - Add several enums to specify alignment: Aligned8, Aligned16, Aligned32, Aligned64, Aligned128. AlignedMax corresponds to EIGEN_MAX_ALIGN_BYTES. Such enums are used to define the above Alignment value, and as the 'Options' template parameter of Map<> and Ref<>. - The Aligned enum is now deprecated. It is now an alias for Aligned16. - Currently, traits<Matrix<>>, traits<Array<>>, traits<Ref<>>, traits<Map<>>, and traits<Block<>> also expose the Alignment enum. | ||
* | Added support for prefetching on cuda devices | 2015-07-17 | |
| | |||
* | bug #80: merge with d_hood branch on adding more coefficient-wise unary ↵ | 2015-06-10 | |
|\ | | | | | | | array functors | ||
| * | Remove packet isNaN, isInf, isFinite | 2015-03-17 | |
| | | |||
| * | Rename isinf to isInf | 2015-03-17 | |
| | | |||
| * | Add isfinite array support as isFinite | 2015-03-17 | |
| | | |||
| * | Rename isnan to isNaN | 2015-03-17 | |
| | | |||
| * | Add hyperbolic trigonometric functions from std array support | 2015-03-11 | |
| | | |||
| * | Add log10 array support | 2015-03-11 | |
| | | |||
| * | Additional unary coeff-wise functors (isnan, round, arg, e.g.) | 2015-03-11 | |
| | | |||
* | | Improved the default implementation of prsqrt | 2015-02-28 | |
| | | |||
* | | Pulled latest updates from trunk | 2015-02-27 | |
|\| | |||
* | | Added support for vectorized type casting of tensors | 2015-02-27 | |
| | | |||
* | | Added support for fast reciprocal square root computation. | 2015-02-26 | |
| | | |||
| * | Reimplement the selection between rotating and non-rotating kernels | 2015-02-27 | |
| | | | | | | | | | | | | using templates instead of macros and if()'s. That was needed to fix the build of unit tests on ARM, which I had broken. My bad for not testing earlier. | ||
| * | Replace a static assert by a runtime one, fixes the build of unit tests on ARM | 2015-02-27 | |
|/ | | | | | Also safely assert in the non-implemented path that should never be taken in practice, and would return wrong results. | ||
* | bug #955 - Implement a rotating kernel alternative in the 3px4 gebp path | 2015-02-18 | |
| | | | | | | | | This is substantially faster on ARM, where it's important to minimize the number of loads. This is specific to the case where all packet types are of size 4. I made my best attempt to minimize how dirty this is... opinions welcome. Eventually one could have a generic rotated kernel, but it would take some work to get there. Also, on sandy bridge, in my experience, it's not beneficial (even about 1% slower). | ||
* | The usage of DenseIndex is deprecated, so let's replace DenseIndex by Index | 2015-02-16 | |
| | |||
* | Pulled the latest changes from the trunk | 2015-02-06 | |
|\ | |||
| * | Introduce unified macros to identify compiler, OS, and architecture. They ↵ | 2014-11-04 | |
| | | | | | | | | are all defined in util/Macros.h and prefixed with EIGEN_COMP_, EIGEN_OS_, and EIGEN_ARCH_ respectively. |