Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Ensured that each thread has it's own copy of the TensorEvaluator: this ↵ | 2015-01-14 | |
| | | | | avoid race conditions when the evaluator calls a non thread safe functor, eg when generating random numbers. | ||
* | Fixed the evaluation of expressions involving tensors of 2 or 3 elements on ↵ | 2014-11-18 | |
| | | | | CUDA devices. | ||
* | Use the proper index type | 2014-10-30 | |
| | |||
* | Misc improvements and cleanups | 2014-10-13 | |
| | |||
* | Fixed the tensor shuffling test | 2014-10-10 | |
| | |||
* | Fixed the thread pool test | 2014-10-10 | |
| | |||
* | Fixed a typo. | 2014-08-13 | |
| | |||
* | Reworked the TensorExecutor code to support in place evaluation. | 2014-08-13 | |
| | |||
* | Improved the efficiency of the tensor evaluation code on thread pools and gpus. | 2014-07-08 | |
| | |||
* | Reworked the expression evaluation mechanism in order to make it possible to ↵ | 2014-06-13 | |
efficiently compute convolutions and contractions in the future: * The scheduling of computation is moved out the the assignment code and into a new TensorExecutor class * The assignment itself is now a regular node on the expression tree * The expression evaluators start by recursively evaluating all their subexpressions if needed |