aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor/TensorReductionCuda.h
Commit message (Expand)AuthorAge
* Fixed a race condition that could affect some reductions on CUDA devices.Gravatar Benoit Steiner2016-01-15
* Use warp shuffles instead of shared memory access to speedup the inner reduct...Gravatar Benoit Steiner2016-01-14
* Fixed a boundary condition bug in the outer reduction kernelGravatar Benoit Steiner2016-01-14
* Silenced a few compilation warnings.Gravatar Benoit Steiner2016-01-11
* Deleted unused variable.Gravatar Benoit Steiner2016-01-11
* Silenced a nvcc compilation warningGravatar Benoit Steiner2016-01-11
* Silenced several compilation warnings triggered by nvcc.Gravatar Benoit Steiner2016-01-11
* Merged in jeremy_barnes/eigen/shader-model-3.0 (pull request PR-152)Gravatar Benoit Steiner2016-01-11
|\
* | Re-enabled the optimized reduction CUDA code.Gravatar Benoit Steiner2016-01-11
| * Alternative way of forcing instantiation of device kernels withoutGravatar Jeremy Barnes2016-01-10
|/
* Prevent nvcc from miscompiling the cuda metakernel. Unfortunately this reintr...Gravatar Benoit Steiner2016-01-08
* Improved the performance of reductions on CUDA devicesGravatar Benoit Steiner2016-01-04
* Optimized the configuration of the outer reduction cuda kernelGravatar Benoit Steiner2015-12-22
* Added missing defineGravatar Benoit Steiner2015-12-22
* Made sure the optimized gpu reduction code is actually compiled.Gravatar Benoit Steiner2015-12-22
* Optimized outer reduction on GPUs.Gravatar Benoit Steiner2015-12-22
* Doubled the speed of full reductions on GPUs.Gravatar Benoit Steiner2015-12-18
* Code cleanupGravatar Benoit Steiner2015-11-06