aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Expand)AuthorAge
* #if defined(EIGEN_USE_NONBLOCKING_THREAD_POOL) is now #if !defined(EIGEN_USE_...Gravatar Benoit Steiner2016-05-17
* Fixed compilation errorGravatar Benoit Steiner2016-05-17
* Fixed compilation error in the tensor thread poolGravatar Benoit Steiner2016-05-17
* Merge upstream.Gravatar Rasmus Munk Larsen2016-05-17
|\
* | Roll back changes to core. Move include of TensorFunctors.h up to satisfy dep...Gravatar Rasmus Munk Larsen2016-05-17
| * Merged eigen/eigen into defaultGravatar Rasmus Larsen2016-05-17
|/|
| * Enable the use of the packet api to evaluate tensor broadcasts. This speed th...Gravatar Benoit Steiner2016-05-17
| * Allow vectorized padding on GPU. This helps speed things up a littleGravatar Benoit Steiner2016-05-17
| * Pulled latest updates from trunk.Gravatar Benoit Steiner2016-05-17
| |\
| * | Don't rely on c++11 extension when we don't have to.Gravatar Benoit Steiner2016-05-17
| * | Avoid float to double conversionGravatar Benoit Steiner2016-05-17
| | * Added missing costPerCoeff methodGravatar Benoit Steiner2016-05-16
| | * Turn on the cost model by default. This results in some significant speedups ...Gravatar Benoit Steiner2016-05-16
| | * Fixed syntax errorGravatar Benoit Steiner2016-05-16
| | * Turnon the new thread pool by default since it scales much better over multip...Gravatar Benoit Steiner2016-05-13
| | * New multithreaded contraction that doesn't rely on the thread pool to run the...Gravatar Benoit Steiner2016-05-13
| | * Added benchmarks for contraction on CPU.Gravatar Benoit Steiner2016-05-13
| | * Removed unnecessary thread synchronizationGravatar Benoit Steiner2016-05-13
| |/
| * Fixed compilation errors triggered by old versions of gccGravatar Benoit Steiner2016-05-12
* | Diasbled cost model by accident. Revert.Gravatar Rasmus Munk Larsen2016-05-12
* | Address comments by bsteiner.Gravatar Rasmus Munk Larsen2016-05-12
* | Improvements to parallelFor.Gravatar Rasmus Munk Larsen2016-05-12
|/
* Worked around a compilation error triggered by nvcc when compiling a tensor c...Gravatar Benoit Steiner2016-05-12
* Fixed potential race condition in the non blocking thread poolGravatar Benoit Steiner2016-05-12
* Replace implicit cast with an explicit oneGravatar Benoit Steiner2016-05-12
* Worked around compilation errors with older versions of gccGravatar Benoit Steiner2016-05-11
* Improved the portability of the tensor codeGravatar Benoit Steiner2016-05-11
* Fixed a couple of bugs related to the Pascalfamily of GPUsGravatar Benoit Steiner2016-05-11
* Avoid unnecessary conversions between floats and doublesGravatar Benoit Steiner2016-05-11
* Added more tests for half floatsGravatar Benoit Steiner2016-05-11
* Added the ability to load fp16 using the texture path.Gravatar Benoit Steiner2016-05-11
* Misc fixes for fp16Gravatar Benoit Steiner2016-05-11
* Made predux_min and predux_max on fp16 less noisyGravatar Benoit Steiner2016-05-11
* __ldg is only available with cuda architectures >= 3.5Gravatar Benoit Steiner2016-05-11
* Fixed a typoGravatar Benoit Steiner2016-05-11
* Do not copy for trivial 1x1 case. This also avoids a "maybe-uninitialized" wa...Gravatar Christoph Hertzberg2016-05-11
* Added missing EIGEN_DEVICE_FUNCGravatar Benoit Steiner2016-05-11
|\
| * Added missing EIGEN_DEVICE_FUNC qualifiersGravatar Benoit Steiner2016-05-11
* | Removed deprecated flag (which apparently was ignored anyway)Gravatar Christoph Hertzberg2016-05-11
* | fixed some double-promotion and sign-compare warningsGravatar Christoph Hertzberg2016-05-11
* | Split unit testGravatar Christoph Hertzberg2016-05-11
* | Don't flood test output with successful VERIFY_IS_NOT_EQUAL tests.Gravatar Christoph Hertzberg2016-05-11
* | Fix help output of buildtests and check scriptsGravatar Christoph Hertzberg2016-05-11
* | bug #1207: Add and fix logical-op warningsGravatar Christoph Hertzberg2016-05-11
* | bug #1213: Give names to anonymous enumsGravatar Christoph Hertzberg2016-05-06
|/
* Fixed a typo in my previous commitGravatar Benoit Steiner2016-05-11
* Fix potential race condition in the CUDA reduction code.Gravatar Benoit Steiner2016-05-11
* Added a few tests to validate the generation of random tensors on GPU.Gravatar Benoit Steiner2016-05-11
* Explicitely initialize all the atomic variables.Gravatar Benoit Steiner2016-05-11
* Workaround maybe-uninitialized warningGravatar Christoph Hertzberg2016-05-11