aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/arch/CUDA
Commit message (Expand)AuthorAge
* Better CUDA complex division.Gravatar Antonio Sanchez2021-04-29
* Fix NVCC+ICC issues.Gravatar Antonio Sanchez2021-03-15
* Specialize std::complex operators for use on GPU device.Gravatar Antonio Sanchez2021-01-22
* Improved std::complex sqrt and rsqrt.Gravatar Antonio Sanchez2021-01-17
* Only specialize complex `sqrt_impl` for CUDA if not MSVC.Gravatar Antonio Sanchez2021-01-11
* Fix MSVC complex sqrt and packetmath test.Gravatar Antonio Sanchez2021-01-08
* Add CUDA complex sqrt.Gravatar Antonio Sanchez2020-12-22
* renaming CUDA* to GPU* for some header filesGravatar Deven Desai2018-07-11
* moving Half headers from CUDA dir to GPU dir, removing the HIP versionsGravatar Deven Desai2018-06-13
* Derivative of the incomplete Gamma function and the sample of a Gamma random ...Gravatar Michael Figurnov2018-06-06
* Exponentially scaled modified Bessel functions of order zero and one.Gravatar Michael Figurnov2018-05-31
* bug #1520: workaround some -Wfloat-equal warnings by calling std::equal_toGravatar Gael Guennebaud2018-04-11
* Move up the specialization of std::numeric_limitsGravatar Daniel Trebbien2018-02-18
* Replace __float2half_rn with __float2halfGravatar nluehr2017-11-28
* Fix incorrect integer cast in predux<half2>().Gravatar nluehr2017-11-21
* Restore `__device__`Gravatar Henry Schreiner2017-10-21
* Fixing missing inlines on device functions for newer CUDA cardsGravatar Henry Schreiner2017-10-20
* Add C++11 max_digits10 for half.Gravatar Gael Guennebaud2017-09-06
* Added support for CUDA 9.0.Gravatar Benoit Steiner2017-08-31
* bug #1462: remove all occurences of the deprecated __CUDACC_VER__ macro by in...Gravatar Gael Guennebaud2017-08-24
* Add a EIGEN_NO_CUDA option, and introduce EIGEN_CUDACC and EIGEN_CUDA_ARCH al...Gravatar Gael Guennebaud2017-07-17
* Added missing __device__ qualifierGravatar Benoit Steiner2017-06-13
|\
| * Added missing __device__ qualifierGravatar Benoit Steiner2017-06-13
* | fix compilation of Half in C++98 (issue introduced in previous commit)Gravatar Gael Guennebaud2017-06-09
* | Add missing std::numeric_limits specialization for half, and complete NumTrai...Gravatar Gael Guennebaud2017-06-09
* | Fixed nested angle barckets >> issue when compiling with cuda 8Gravatar Abhijit Kundu2017-04-27
|/
* remove UTF8 symbolsGravatar Gael Guennebaud2017-03-07
* Made most of the packet math primitives usable within CUDA kernel when compil...Gravatar Benoit Steiner2017-02-28
* Avoid unecessary float to double conversions.Gravatar Benoit Steiner2017-02-27
* Fix expm1 CUDA implementation (do not shadow exp CUDA implementation).Gravatar Srinivas Vasudevan2016-12-05
* Fix small nit where I changed name of plog1p to pexpm1.Gravatar Srinivas Vasudevan2016-12-02
* Added support for expm1 in Eigen.Gravatar Srinivas Vasudevan2016-12-02
* Add a default constructor for the "fake" __half class when not using theGravatar Rasmus Munk Larsen2016-11-29
* Optimized the computation of exp, sqrt, ceil anf floor for fp16 on Pascal GPUsGravatar Benoit Steiner2016-11-16
* Merged eigen/eigen into defaultGravatar Benoit Steiner2016-11-03
|\
| * Gate the code that refers to cuda fp16 primitives more thoroughlyGravatar Benoit Steiner2016-11-01
* | Deleted redundant implementation of preduxGravatar Benoit Steiner2016-10-12
* | Merged eigen/eigen into defaultGravatar Benoit Steiner2016-10-12
|\|
* | Take advantage of AVX512 instructions whenever possible to speedup the proces...Gravatar Benoit Steiner2016-10-12
| * Added missing AVX intrinsics for fp16: in particular, implemented predux whic...Gravatar Benoit Steiner2016-10-06
|/
* Properly characterize the CUDA packet primitives for fp16 as device onlyGravatar Benoit Steiner2016-10-04
* Added support for constand std::complex numbers on GPUGravatar Benoit Steiner2016-10-03
* Added missing typedefsGravatar Benoit Steiner2016-09-20
* Add CUDA-specific std::complex<T> specializations for scalar_sum_op, scalar_d...Gravatar RJ Ryan2016-09-20
* use ::hlog if available.Gravatar Gael Guennebaud2016-08-29
* bug #1167: simplify installation of header files using cmake's install(DIRECT...Gravatar Gael Guennebaud2016-08-29
* Fix compilation with MSVC by using our portable numext::log1p implementation.Gravatar Gael Guennebaud2016-08-22
* Fix compilation on CUDA 8 by removing call to h2log1pGravatar Igor Babuschkin2016-08-15
* Add log1p support for CUDA and half floatsGravatar Igor Babuschkin2016-08-08
* Fixed the constructors of the new half_base class.Gravatar Benoit Steiner2016-08-04