aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Expand)AuthorAge
* Avoid float to double conversionGravatar Benoit Steiner2016-05-17
* Fixed compilation errors triggered by old versions of gccGravatar Benoit Steiner2016-05-12
* Worked around a compilation error triggered by nvcc when compiling a tensor c...Gravatar Benoit Steiner2016-05-12
* Fixed potential race condition in the non blocking thread poolGravatar Benoit Steiner2016-05-12
* Replace implicit cast with an explicit oneGravatar Benoit Steiner2016-05-12
* Worked around compilation errors with older versions of gccGravatar Benoit Steiner2016-05-11
* Improved the portability of the tensor codeGravatar Benoit Steiner2016-05-11
* Fixed a couple of bugs related to the Pascalfamily of GPUsGravatar Benoit Steiner2016-05-11
* Avoid unnecessary conversions between floats and doublesGravatar Benoit Steiner2016-05-11
* Added more tests for half floatsGravatar Benoit Steiner2016-05-11
* Added the ability to load fp16 using the texture path.Gravatar Benoit Steiner2016-05-11
* Misc fixes for fp16Gravatar Benoit Steiner2016-05-11
* Made predux_min and predux_max on fp16 less noisyGravatar Benoit Steiner2016-05-11
* __ldg is only available with cuda architectures >= 3.5Gravatar Benoit Steiner2016-05-11
* Fixed a typoGravatar Benoit Steiner2016-05-11
* Do not copy for trivial 1x1 case. This also avoids a "maybe-uninitialized" wa...Gravatar Christoph Hertzberg2016-05-11
* Added missing EIGEN_DEVICE_FUNCGravatar Benoit Steiner2016-05-11
|\
| * Added missing EIGEN_DEVICE_FUNC qualifiersGravatar Benoit Steiner2016-05-11
* | Removed deprecated flag (which apparently was ignored anyway)Gravatar Christoph Hertzberg2016-05-11
* | fixed some double-promotion and sign-compare warningsGravatar Christoph Hertzberg2016-05-11
* | Split unit testGravatar Christoph Hertzberg2016-05-11
* | Don't flood test output with successful VERIFY_IS_NOT_EQUAL tests.Gravatar Christoph Hertzberg2016-05-11
* | Fix help output of buildtests and check scriptsGravatar Christoph Hertzberg2016-05-11
* | bug #1207: Add and fix logical-op warningsGravatar Christoph Hertzberg2016-05-11
* | bug #1213: Give names to anonymous enumsGravatar Christoph Hertzberg2016-05-06
|/
* Fixed a typo in my previous commitGravatar Benoit Steiner2016-05-11
* Fix potential race condition in the CUDA reduction code.Gravatar Benoit Steiner2016-05-11
* Added a few tests to validate the generation of random tensors on GPU.Gravatar Benoit Steiner2016-05-11
* Explicitely initialize all the atomic variables.Gravatar Benoit Steiner2016-05-11
* Workaround maybe-uninitialized warningGravatar Christoph Hertzberg2016-05-11
* Workaround "misleading-indentation" warningsGravatar Christoph Hertzberg2016-05-11
* Properly gate the use of half2.Gravatar Benoit Steiner2016-05-10
* Extended the tests for ptanhGravatar Benoit Steiner2016-05-10
* Added support for fp16 to the sigmoid functor.Gravatar Benoit Steiner2016-05-10
* Small improvement to the full reduction of fp16Gravatar Benoit Steiner2016-05-10
* Added packet primitives to compute exp, log, sqrt and rsqrt on fp16. This imp...Gravatar Benoit Steiner2016-05-10
* Added a test to validate the new non blocking thread poolGravatar Benoit Steiner2016-05-10
* Simplified the reduction code a little.Gravatar Benoit Steiner2016-05-10
* Fixed compilation warningGravatar Benoit Steiner2016-05-09
* Improved the performance of full reductions on GPU:Gravatar Benoit Steiner2016-05-09
* Added the ability to use a scratch buffer in cuda kernelsGravatar Benoit Steiner2016-05-09
* Added a new parallelFor api to the thread pool device.Gravatar Benoit Steiner2016-05-09
* Optimized the non blocking thread pool:Gravatar Benoit Steiner2016-05-09
* Pulled latest updates from trunkGravatar Benoit Steiner2016-05-07
|\
* | Worked around a bug in nvcc on tegra x1Gravatar Benoit Steiner2016-05-07
* | Merged latest updates from trunkGravatar Benoit Steiner2016-05-06
|\ \
* | | Added support for packet processing of fp16 on kepler and maxwell gpusGravatar Benoit Steiner2016-05-06
| * | Avoid double promotionGravatar Benoit Steiner2016-05-06
* | | Marked a few tensor operations as read onlyGravatar Benoit Steiner2016-05-05
|/ /
* | Added a test to validate full reduction on tensor of half floatsGravatar Benoit Steiner2016-05-05