Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Use signed integers more consistently to encode the number of threads to use ↵ | 2016-06-09 | |
| | | | | to evaluate a tensor expression. | ||
* | Fixed some compilation warnings | 2016-05-26 | |
| | |||
* | Merged in rmlarsen/eigen (pull request PR-188) | 2016-05-23 | |
|\ | | | | | | | Minor cleanups: 1. Get rid of a few unused variables. 2. Get rid of last uses of EIGEN_USE_COST_MODEL. | ||
* | | Fix some sign-compare warnings | 2016-05-22 | |
| | | |||
| * | Minor cleanups: 1. Get rid of unused variables. 2. Get rid of last uses of ↵ | 2016-05-18 | |
|/ | | | | EIGEN_USE_COST_MODEL. | ||
* | Turnon the new thread pool by default since it scales much better over ↵ | 2016-05-13 | |
| | | | | multiple cores. It is still possible to revert to the old thread pool by compiling with the EIGEN_USE_SIMPLE_THREAD_POOL define. | ||
* | New multithreaded contraction that doesn't rely on the thread pool to run ↵ | 2016-05-13 | |
| | | | | the closure in the order in which they are enqueued. This is needed in order to switch to the new non blocking thread pool since this new thread pool can execute the closure in any order. | ||
* | Replace implicit cast with an explicit one | 2016-05-12 | |
| | |||
* | Added tests for full contractions using thread pools and gpu devices. | 2016-05-05 | |
| | | | | Fixed a couple of issues in the corresponding code. | ||
* | Replace std::vector with our own implementation, as using the stl when ↵ | 2016-03-08 | |
| | | | | compiling with nvcc and avx enabled leads to many issues. | ||
* | Fixed the tensor chipping code. | 2016-03-08 | |
| | |||
* | Decoupled the packet type definition from the definition of the tensor ops. ↵ | 2016-03-08 | |
| | | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit. | ||
* | Leverage the new blocking code in the tensor contraction code. | 2016-01-22 | |
| | |||
* | Use numext::mini/numext::maxi instead of std::min/std::max in the tensor code | 2015-08-28 | |
| | |||
* | Moved some utilities to TensorMeta.h to make it easier to reuse them accross ↵ | 2015-06-29 | |
| | | | | | | several tensor operations. Created the TensorDimensionList class to encode the list of all the dimensions of a tensor of rank n. This could be done using TensorIndexList, however TensorIndexList require cxx11 which isn't yet supported as widely as we'd like. | ||
* | Moved away from std::async and std::future as the underlying mechnism for ↵ | 2015-05-20 | |
| | | | | | | the thread pool device. On several platforms, the functions passed to std::async are not scheduled in the order in which they are given to std::async, which leads to massive performance issues in the contraction code. Instead we now have a custom thread pool that ensures that the functions are picked up by the threads in the pool in the order in which they are enqueued in the pool. | ||
* | Fixed another batch of compilation warnings | 2015-02-28 | |
| | |||
* | Fixed several compilation warnings reported by clang | 2015-02-25 | |
| | |||
* | Silenced several compilation warnings | 2015-02-10 | |
| | |||
* | Improved support for RowMajor tensors | 2015-01-14 | |
| | | | | Misc fixes and API cleanups. | ||
* | Added support for RowMajor inputs to the contraction code. | 2015-01-14 | |
| | |||
* | Improved handling of 1d tensors | 2014-11-03 | |
| | |||
* | Avoid calling get_future() more than once on a given promise. | 2014-10-16 | |
| | |||
* | Made the blocking computation aware of the l3 cache | 2014-10-15 | |
| | | | | Also optimized the blocking parameters to take into account the number of threads used for a computation | ||
* | Added support for promises | 2014-10-15 | |
Started to improve multithreaded contractions |