aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor/TensorReduction.h
Commit message (Expand)AuthorAge
...
* Converting all sycl buffers to uninitialised device only buffers; adding memc...Gravatar Mehdi Goli2016-11-08
* Removed the sycl include from Eigen/Core and moved it to Unsupported/Eigen/CX...Gravatar Mehdi Goli2016-11-04
* Fixing the code indentation in the TensorReduction.h file.Gravatar Mehdi Goli2016-10-14
* Reducing the code by generalising sycl backend functions/structs.Gravatar Mehdi Goli2016-10-14
* Fixed a bug impacting some outer reductions on GPUGravatar Benoit Steiner2016-09-12
* Don't attempt to optimize partial reductions when the optimized implementatio...Gravatar Benoit Steiner2016-08-08
* Improved partial reductions in more casesGravatar Benoit Steiner2016-07-22
* Fix warningsGravatar Gael Guennebaud2016-07-08
* Fix warningGravatar Gael Guennebaud2016-07-07
* Use array_prod to compute the number of elements contained in the input tenso...Gravatar Benoit Steiner2016-06-04
* Improved the performance of full reductions.Gravatar Benoit Steiner2016-06-03
* Resolved merge conflictsGravatar Benoit Steiner2016-05-26
* Merged latest reduction improvementsGravatar Benoit Steiner2016-05-26
|\
* | Improved the performance of inner reductions.Gravatar Benoit Steiner2016-05-26
* | Merged in rmlarsen/eigen (pull request PR-188)Gravatar Benoit Steiner2016-05-23
|\ \
* | | Make EIGEN_HAS_CONSTEXPR user configurableGravatar Gael Guennebaud2016-05-20
* | | Make EIGEN_HAS_VARIADIC_TEMPLATES user configurableGravatar Gael Guennebaud2016-05-20
| * | Minor cleanups: 1. Get rid of unused variables. 2. Get rid of last uses of EI...Gravatar Rasmus Munk Larsen2016-05-18
|/ /
| * Allow vectorized padding on GPU. This helps speed things up a little.Gravatar Benoit Steiner2016-05-17
|/
* Improved the portability of the tensor codeGravatar Benoit Steiner2016-05-11
* Properly gate the use of half2.Gravatar Benoit Steiner2016-05-10
* Improved the performance of full reductions on GPU:Gravatar Benoit Steiner2016-05-09
* Eigen Tensor cost model part 2: Thread scheduling for standard evaluators and...Gravatar Rasmus Munk Larsen2016-04-14
* Eigen cost model part 1. This implements a basic recursive framework to estim...Gravatar Rasmus Munk Larsen2016-04-14
* Fixed compilation warnings on armGravatar Benoit Steiner2016-03-28
* Avoid unnecessary conversionsGravatar Benoit Steiner2016-03-23
* Fixed compilation warningGravatar Benoit Steiner2016-03-23
* Use a single Barrier instead of a collection of Notifications to reduce the t...Gravatar Benoit Steiner2016-03-22
* Avoid implicit castGravatar Benoit Steiner2016-03-09
* Avoid unnecessary conversion from 32bit int to 64bit unsigned intGravatar Benoit Steiner2016-03-09
* Replace std::vector with our own implementation, as using the stl when compil...Gravatar Benoit Steiner2016-03-08
* Simplified the full reduction codeGravatar Benoit Steiner2016-03-08
* Decoupled the packet type definition from the definition of the tensor ops. A...Gravatar Benoit Steiner2016-03-08
* Made the signature of the inner and outer reducers consistentGravatar Benoit Steiner2016-02-29
* Optimized the performance of narrow reductions on CUDA devicesGravatar Benoit Steiner2016-02-29
* Fixed a typo in the reduction code that could prevent large full reductionsx ...Gravatar Benoit Steiner2016-02-24
* Fixed a number of compilation warnings generated by the cuda testsGravatar Benoit Steiner2016-01-31
* Fixed a couple of compilation warnings.Gravatar Benoit Steiner2016-01-28
* Fixed some compilation problems with nvcc + clangGravatar Benoit Steiner2016-01-27
* Record whether the underlying tensor storage can be accessed directly during ...Gravatar Benoit Steiner2016-01-19
* Properly record the rank of reduced tensors in the tensor traits.Gravatar Benoit Steiner2016-01-13
* Merged in jeremy_barnes/eigen/shader-model-3.0 (pull request PR-152)Gravatar Benoit Steiner2016-01-11
|\
* | Fixed a bug in the dispatch of optimized reduction kernels.Gravatar Benoit Steiner2016-01-11
* | Re-enabled the optimized reduction CUDA code.Gravatar Benoit Steiner2016-01-11
| * Alternative way of forcing instantiation of device kernels withoutGravatar Jeremy Barnes2016-01-10
|/
* Simplified the dispatch code.Gravatar Benoit Steiner2016-01-08
* Reworked the dispatch of optimized cuda reduction kernels to workaround a nvc...Gravatar Benoit Steiner2016-01-08
* Improved the performance of reductions on CUDA devicesGravatar Benoit Steiner2016-01-04
* Optimized outer reduction on GPUs.Gravatar Benoit Steiner2015-12-22
* Silenced some compilation warnings triggered by nvccGravatar Benoit Steiner2015-12-17