aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/GenericPacketMath.h
Commit message (Collapse)AuthorAge
* Rename predux_downto4 to be more accurate on its semantic.Gravatar Gael Guennebaud2018-04-03
|
* Add a EIGEN_NO_CUDA option, and introduce EIGEN_CUDACC and EIGEN_CUDA_ARCH ↵Gravatar Gael Guennebaud2017-07-17
| | | | aliases
* Declared the plset, ploadt_ro, and ploaddup packet primitives as usable ↵Gravatar Benoit Steiner2017-02-27
| | | | within a gpu kernel
* Added support for expm1 in Eigen.Gravatar Srinivas Vasudevan2016-12-02
|
* Merged eigen/eigen into defaultGravatar Benoit Steiner2016-11-03
|\
| * Add pinsertfirst function and implement pinsertlast for complex on SSE/AVX.Gravatar Gael Guennebaud2016-11-02
| |
| * Add missing inline keywordsGravatar Gael Guennebaud2016-10-25
| |
| * Add a pinsertlast function replacing the last entry of a packet by a scalar.Gravatar Gael Guennebaud2016-10-25
| | | | | | | | (useful to vectorize LinSpaced)
* | Renamed predux_half into predux_downto4Gravatar Benoit Steiner2016-10-06
| |
* | Merged latest updates from trunkGravatar Benoit Steiner2016-10-05
|\|
| * bug #1249: enable use of __builtin_prefetch for GCC, clang, and ICC only.Gravatar Gael Guennebaud2016-07-25
| |
| * bug #1232: refactor special functions as a new SpecialFunctions module, ↵Gravatar Gael Guennebaud2016-07-08
| | | | | | | | currently in unsupported/.
| * Add TernaryFunctors and the betainc SpecialFunction.Gravatar Eugene Brevdo2016-06-02
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | TernaryFunctors and their executors allow operations on 3-tuples of inputs. API fully implemented for Arrays and Tensors based on binary functors. Ported the cephes betainc function (regularized incomplete beta integral) to Eigen, with support for CPU and GPU, floats, doubles, and half types. Added unit tests in array.cpp and cxx11_tensor_cuda.cu Collapsed revision * Merged helper methods for betainc across floats and doubles. * Added TensorGlobalFunctions with betainc(). Removed betainc() from TensorBase. * Clean up CwiseTernaryOp checks, change igamma_helper to cephes_helper. * betainc: merge incbcf and incbd into incbeta_cfe. and more cleanup. * Update TernaryOp and SpecialFunctions (betainc) based on review comments.
| * Expose log1p to Array.Gravatar Gael Guennebaud2016-06-01
| |
| * Remove now-unused protate PacketMath funcGravatar Benoit Jacob2016-05-24
| |
* | Pull latest updates from upstreamGravatar Benoit Steiner2016-04-11
|\|
| * Fixed the packet_traits for half floats.Gravatar Benoit Steiner2016-04-08
| |
| * Added polygamma function.Gravatar Till Hoffmann2016-04-01
| |
| * Added zeta function.Gravatar Till Hoffmann2016-04-01
| |
| * Resolve bad merge.Gravatar Eugene Brevdo2016-03-08
| |
| * Added support for vectorized type casting of int to char.Gravatar Benoit Steiner2016-02-03
| |
* | Generalized predux4 to support AVX512 packets, and renamed it predux_half.Gravatar Benoit Steiner2016-02-01
| | | | | | | | Disabled the implementation of pabs for avx512 since the corresponding intrinsics are not shipped with gcc
| * Fixed compilation warningGravatar Benoit Steiner2016-01-28
| |
| * Add digamma for CPU + CUDA. Includes tests.Gravatar Eugene Brevdo2015-12-24
| |
| * CleanupGravatar Benoit Steiner2015-12-08
| |
| * Fixed a couple of typosGravatar Benoit Steiner2015-12-07
| | | | | | | | Cleaned up the code a bit.
| * Add special functions to Eigen: lgamma, erf, erfc.Gravatar Eugene Brevdo2015-12-07
|/ | | | Includes CUDA support and unit tests.
* added scalar_sign_op (both real,complex)Gravatar Mark Borgerding2015-11-24
|
* Fix prototype of plset and generalize linspace functor.Gravatar Gael Guennebaud2015-08-07
|
* Let unpacket_traits<> exposes the required alignment and make use of it ↵Gravatar Gael Guennebaud2015-08-07
| | | | everywhere
* First part of a big refactoring of alignment control to enable the handling ↵Gravatar Gael Guennebaud2015-08-06
| | | | | | | | | of arbitrarily aligned buffers. It includes: - AlignedBit flag is deprecated. Alignment is now specified by the evaluator through the 'Alignment' enum, e.g., evaluator<Xpr>::Alignment. Its value is in Bytes. - Add several enums to specify alignment: Aligned8, Aligned16, Aligned32, Aligned64, Aligned128. AlignedMax corresponds to EIGEN_MAX_ALIGN_BYTES. Such enums are used to define the above Alignment value, and as the 'Options' template parameter of Map<> and Ref<>. - The Aligned enum is now deprecated. It is now an alias for Aligned16. - Currently, traits<Matrix<>>, traits<Array<>>, traits<Ref<>>, traits<Map<>>, and traits<Block<>> also expose the Alignment enum.
* Added support for prefetching on cuda devicesGravatar Benoit Steiner2015-07-17
|
* bug #80: merge with d_hood branch on adding more coefficient-wise unary ↵Gravatar Gael Guennebaud2015-06-10
|\ | | | | | | array functors
| * Remove packet isNaN, isInf, isFiniteGravatar Deanna Hood2015-03-17
| |
| * Rename isinf to isInfGravatar Deanna Hood2015-03-17
| |
| * Add isfinite array support as isFiniteGravatar Deanna Hood2015-03-17
| |
| * Rename isnan to isNaNGravatar Deanna Hood2015-03-17
| |
| * Add hyperbolic trigonometric functions from std array supportGravatar Deanna Hood2015-03-11
| |
| * Add log10 array supportGravatar Deanna Hood2015-03-11
| |
| * Additional unary coeff-wise functors (isnan, round, arg, e.g.)Gravatar Deanna Hood2015-03-11
| |
* | Improved the default implementation of prsqrtGravatar Benoit Steiner2015-02-28
| |
* | Pulled latest updates from trunkGravatar Benoit Steiner2015-02-27
|\|
* | Added support for vectorized type casting of tensorsGravatar Benoit Steiner2015-02-27
| |
* | Added support for fast reciprocal square root computation.Gravatar Benoit Steiner2015-02-26
| |
| * Reimplement the selection between rotating and non-rotating kernelsGravatar Benoit Jacob2015-02-27
| | | | | | | | | | | | using templates instead of macros and if()'s. That was needed to fix the build of unit tests on ARM, which I had broken. My bad for not testing earlier.
| * Replace a static assert by a runtime one, fixes the build of unit tests on ARMGravatar Benoit Jacob2015-02-27
|/ | | | | Also safely assert in the non-implemented path that should never be taken in practice, and would return wrong results.
* bug #955 - Implement a rotating kernel alternative in the 3px4 gebp pathGravatar Benoit Jacob2015-02-18
| | | | | | | | This is substantially faster on ARM, where it's important to minimize the number of loads. This is specific to the case where all packet types are of size 4. I made my best attempt to minimize how dirty this is... opinions welcome. Eventually one could have a generic rotated kernel, but it would take some work to get there. Also, on sandy bridge, in my experience, it's not beneficial (even about 1% slower).
* The usage of DenseIndex is deprecated, so let's replace DenseIndex by IndexGravatar Gael Guennebaud2015-02-16
|
* Pulled the latest changes from the trunkGravatar Benoit Steiner2015-02-06
|\
| * Introduce unified macros to identify compiler, OS, and architecture. They ↵Gravatar Gael Guennebaud2014-11-04
| | | | | | | | are all defined in util/Macros.h and prefixed with EIGEN_COMP_, EIGEN_OS_, and EIGEN_ARCH_ respectively.