index
:
eigen
master
C++ library for linear algebra
about
summary
refs
log
tree
commit
diff
homepage
log msg
author
committer
range
path:
root
/
unsupported
/
Eigen
/
CXX11
/
src
/
Tensor
/
TensorReductionCuda.h
Commit message (
Expand
)
Author
Age
*
Fixed compilation warning
Benoit Steiner
2016-03-18
*
Improved the performance of large outer reductions on cuda
Benoit Steiner
2016-02-29
*
Made the signature of the inner and outer reducers consistent
Benoit Steiner
2016-02-29
*
Optimized the performance of narrow reductions on CUDA devices
Benoit Steiner
2016-02-29
*
Fixed a race condition that could affect some reductions on CUDA devices.
Benoit Steiner
2016-01-15
*
Use warp shuffles instead of shared memory access to speedup the inner reduct...
Benoit Steiner
2016-01-14
*
Fixed a boundary condition bug in the outer reduction kernel
Benoit Steiner
2016-01-14
*
Silenced a few compilation warnings.
Benoit Steiner
2016-01-11
*
Deleted unused variable.
Benoit Steiner
2016-01-11
*
Silenced a nvcc compilation warning
Benoit Steiner
2016-01-11
*
Silenced several compilation warnings triggered by nvcc.
Benoit Steiner
2016-01-11
*
Merged in jeremy_barnes/eigen/shader-model-3.0 (pull request PR-152)
Benoit Steiner
2016-01-11
|
\
*
|
Re-enabled the optimized reduction CUDA code.
Benoit Steiner
2016-01-11
|
*
Alternative way of forcing instantiation of device kernels without
Jeremy Barnes
2016-01-10
|
/
*
Prevent nvcc from miscompiling the cuda metakernel. Unfortunately this reintr...
Benoit Steiner
2016-01-08
*
Improved the performance of reductions on CUDA devices
Benoit Steiner
2016-01-04
*
Optimized the configuration of the outer reduction cuda kernel
Benoit Steiner
2015-12-22
*
Added missing define
Benoit Steiner
2015-12-22
*
Made sure the optimized gpu reduction code is actually compiled.
Benoit Steiner
2015-12-22
*
Optimized outer reduction on GPUs.
Benoit Steiner
2015-12-22
*
Doubled the speed of full reductions on GPUs.
Benoit Steiner
2015-12-18
*
Code cleanup
Benoit Steiner
2015-11-06