aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/arch/CUDA
Commit message (Expand)AuthorAge
* Optimized the computation of exp, sqrt, ceil anf floor for fp16 on Pascal GPUsGravatar Benoit Steiner2016-11-16
* Merged eigen/eigen into defaultGravatar Benoit Steiner2016-11-03
|\
| * Gate the code that refers to cuda fp16 primitives more thoroughlyGravatar Benoit Steiner2016-11-01
* | Deleted redundant implementation of preduxGravatar Benoit Steiner2016-10-12
* | Merged eigen/eigen into defaultGravatar Benoit Steiner2016-10-12
|\|
* | Take advantage of AVX512 instructions whenever possible to speedup the proces...Gravatar Benoit Steiner2016-10-12
| * Added missing AVX intrinsics for fp16: in particular, implemented predux whic...Gravatar Benoit Steiner2016-10-06
|/
* Properly characterize the CUDA packet primitives for fp16 as device onlyGravatar Benoit Steiner2016-10-04
* Added support for constand std::complex numbers on GPUGravatar Benoit Steiner2016-10-03
* Added missing typedefsGravatar Benoit Steiner2016-09-20
* Add CUDA-specific std::complex<T> specializations for scalar_sum_op, scalar_d...Gravatar RJ Ryan2016-09-20
* use ::hlog if available.Gravatar Gael Guennebaud2016-08-29
* bug #1167: simplify installation of header files using cmake's install(DIRECT...Gravatar Gael Guennebaud2016-08-29
* Fix compilation with MSVC by using our portable numext::log1p implementation.Gravatar Gael Guennebaud2016-08-22
* Fix compilation on CUDA 8 by removing call to h2log1pGravatar Igor Babuschkin2016-08-15
* Add log1p support for CUDA and half floatsGravatar Igor Babuschkin2016-08-08
* Fixed the constructors of the new half_base class.Gravatar Benoit Steiner2016-08-04
* Fixed the isnan, isfinite and isinf operations on GPUGravatar Benoit Steiner2016-08-04
* Move Eigen::half_impl::half to Eigen::half while preserving the free function...Gravatar Gael Guennebaud2016-08-04
* half implementation has been moved to half_impl namespaceGravatar Benoit Steiner2016-07-29
* bug #1266: half implementation has been moved to half_impl namespaceGravatar Christoph Hertzberg2016-07-29
* Fix CUDA compilationGravatar Gael Guennebaud2016-07-22
* More cleaning in half:Gravatar Gael Guennebaud2016-07-22
* Fix CUDA compilationGravatar Gael Guennebaud2016-07-21
* Add missing log10 and random generator for half.Gravatar Gael Guennebaud2016-07-21
* Some cleanup in Halh: standard functions should be defined in the namespace o...Gravatar Gael Guennebaud2016-07-21
* Move CUDA's special functions to SpecialFunctions module.Gravatar Gael Guennebaud2016-07-11
* bug #1232: refactor special functions as a new SpecialFunctions module, curre...Gravatar Gael Guennebaud2016-07-08
* Enable the vectorization of adds and mults of fp16Gravatar Benoit Steiner2016-06-07
* Add TernaryFunctors and the betainc SpecialFunction.Gravatar Eugene Brevdo2016-06-02
* Improved support for CUDA 8.0Gravatar Benoit Steiner2016-05-31
* Disable the use of MMX instructions since the code is broken on many platformsGravatar Benoit Steiner2016-05-27
* Deleted extra namespaceGravatar Benoit Steiner2016-05-26
* Disable usage of MMX with msvc.Gravatar Gael Guennebaud2016-05-26
* Add missing inclusion of mmintrin.hGravatar Gael Guennebaud2016-05-26
* Silenced a compilation warningGravatar Benoit Steiner2016-05-25
* Specify the rounding mode in the correct locationGravatar Benoit Steiner2016-05-25
* Explicitly specify the rounding mode when converting floats to fp16Gravatar Benoit Steiner2016-05-25
* Disable the use of MMX instructions on x86_64 since too many compilers only s...Gravatar Benoit Steiner2016-05-25
* Fix compilation with ICC.Gravatar Gael Guennebaud2016-05-25
* Cleaned up the fp16 code a little moreGravatar Benoit Steiner2016-05-24
* Cleaned up the fp16 codeGravatar Benoit Steiner2016-05-24
* Remove now-unused protate PacketMath funcGravatar Benoit Jacob2016-05-24
* Don't attempt to use MMX instructions with visualstudio since they're only pa...Gravatar Benoit Steiner2016-05-24
* Worked around missing clang intrinsicGravatar Benoit Steiner2016-05-24
* Use the generic ploadquad intrinsics since it does the jobGravatar Benoit Steiner2016-05-24
* Worked around missing clang intrinsicsGravatar Benoit Steiner2016-05-24
* Added missing EIGEN_DEVICE_FUNC qualifierGravatar Benoit Steiner2016-05-23
* Use the Index type instead of integers to specify the strides in pgather/psca...Gravatar Benoit Steiner2016-05-23
* Added missing alignment in the fp16 packet traitsGravatar Benoit Steiner2016-05-23