Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Fix more enum arithmetic. | Rasmus Munk Larsen | 2021-06-15 |
| | |||
* | Fix calls to device functions from host code | Nathan Luehr | 2021-05-11 |
| | |||
* | Remove V2 suffix from TensorBlock | Eugene Zhulenev | 2019-12-10 |
| | |||
* | Remove TensorBlock.h and old TensorBlock/BlockMapper | Eugene Zhulenev | 2019-12-10 |
| | |||
* | Do not use std::vector in getResourceRequirements | Eugene Zhulenev | 2019-12-09 |
| | |||
* | Remove legacy block evaluation support | Eugene Zhulenev | 2019-11-12 |
| | |||
* | Cleanup Tensor block destination and materialized block storage allocation | Eugene Zhulenev | 2019-10-16 |
| | |||
* | Block evaluation for TensorGenerator/TensorReverse/TensorShuffling | Eugene Zhulenev | 2019-10-14 |
| | |||
* | Tensor block evaluation V2 support for unary/binary/broadcsting | Eugene Zhulenev | 2019-09-24 |
| | |||
* | evalSubExprsIfNeededAsync + async TensorContractionThreadPool | Eugene Zhulenev | 2019-08-30 |
| | |||
* | [SYCL] This PR adds the minimum modifications to the Eigen unsupported ↵ | Mehdi Goli | 2019-06-28 |
| | | | | | | | | | | module required to run it on devices supporting SYCL. * Abstracting the pointer type so that both SYCL memory and pointer can be captured. * Converting SYCL virtual pointer to SYCL device memory in Eigen evaluator class. * Binding SYCL placeholder accessor to command group handler by using bind method in Eigen evaluator node. * Adding SYCL macro for controlling loop unrolling. * Modifying the TensorDeviceSycl.h and SYCL executor method to adopt the above changes. | ||
* | Merge with upstream eigen/default | Eugene Zhulenev | 2018-08-27 |
|\ | |||
* | | Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵ | Eugene Zhulenev | 2018-08-10 |
| | | | | | | | | evaluators | ||
| * | Use NULL instead of nullptr to avoid adding a cxx11 requirement. | Benoit Steiner | 2018-08-13 |
|/ | |||
* | Enabling per device specialisation of packetsize. | Mehdi Goli | 2018-08-01 |
| | |||
* | Add tiled evaluation support to TensorExecutor | Eugene Zhulenev | 2018-07-25 |
| | |||
* | Merged in mehdi_goli/opencl/DataDependancy (pull request PR-10) | Benoit Steiner | 2017-06-28 |
| | | | | | | | | | | DataDependancy * Wrapping data type to the pointer class for sycl in non-terminal nodes; not having that breaks Tensorflow Conv2d code. * Applying Ronnan's Comments. * Applying benoit's comments | ||
* | Partial OpenCL support via SYCL compatible with ComputeCpp CE. | Luke Iwanski | 2016-09-19 |
| | |||
* | Deleted useless trailing commas | Benoit Steiner | 2016-04-29 |
| | |||
* | Eigen cost model part 1. This implements a basic recursive framework to ↵ | Rasmus Munk Larsen | 2016-04-14 |
| | | | | estimate the cost of evaluating tensor expressions. | ||
* | Decoupled the packet type definition from the definition of the tensor ops. ↵ | Benoit Steiner | 2016-03-08 |
| | | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit. | ||
* | Record whether the underlying tensor storage can be accessed directly during ↵ | Benoit Steiner | 2016-01-19 |
| | | | | the evaluation of an expression. | ||
* | Removed an incorrect assertion. | Benoit Steiner | 2015-07-27 |
| | |||
* | Fixed a typo in an assertion. | Benoit Steiner | 2015-07-24 |
| | |||
* | Silenced several compilation warnings | Benoit Steiner | 2015-02-10 |
| | |||
* | Improved support for RowMajor tensors | Benoit Steiner | 2015-01-14 |
| | | | | Misc fixes and API cleanups. | ||
* | Misc improvements and cleanups | Benoit Steiner | 2014-10-13 |
| | |||
* | Added suppor for in place evaluation to simple tensor expressions. | Benoit Steiner | 2014-08-13 |
| | | | | Use mempy to speedup tensor copies whenever possible. | ||
* | Reworked the expression evaluation mechanism in order to make it possible to ↵ | Benoit Steiner | 2014-06-13 |
| | | | | | | | | efficiently compute convolutions and contractions in the future: * The scheduling of computation is moved out the the assignment code and into a new TensorExecutor class * The assignment itself is now a regular node on the expression tree * The expression evaluators start by recursively evaluating all their subexpressions if needed | ||
* | TensorEval are now typed on the device: this will make it possible to use ↵ | Benoit Steiner | 2014-06-10 |
| | | | | | | partial template specialization to optimize the strategy of each evaluator for each device type. Started work on partial evaluations. | ||
* | Added support for tensor contractions | Benoit Steiner | 2014-06-04 |
| | | | | | Updated expression evaluation mechanism to also compute the size of the tensor result Misc fixes and improvements. | ||
* | Vectorized the evaluation of tensor expression (using SSE, AVX, NEON, ...) | Benoit Steiner | 2014-05-16 |
| | | | | | Added the ability to parallelize the evaluation of a tensor expression over multiple cpu cores. Added the ability to offload the evaluation of a tensor expression to a GPU. | ||
* | Extended support for Tensors: | Benoit Steiner | 2014-04-28 |
* Added ability to map a region of the memory to a tensor * Added basic support for unary and binary coefficient wise expressions, such as addition or square root * Provided an emulation layer to make it possible to compile the code with compilers (such as nvcc) that don't support cxx11. |