Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Fix calls to device functions from host code | 2021-05-11 | |
| | |||
* | Remove V2 suffix from TensorBlock | 2019-12-10 | |
| | |||
* | Do not use std::vector in getResourceRequirements | 2019-12-09 | |
| | |||
* | Add async evaluation support to TensorPadding/TensorImagePatch/TensorShuffling | 2019-11-26 | |
| | |||
* | Remove legacy block evaluation support | 2019-11-12 | |
| | |||
* | Add block evaluation V2 to TensorAsyncExecutor. | 2019-10-22 | |
| | | | | Add async evaluation to a number of ops. | ||
* | Cleanup Tensor block destination and materialized block storage allocation | 2019-10-16 | |
| | |||
* | Block evaluation for TensorGenerator + TensorReverse + fixed bug in tensor ↵ | 2019-10-10 | |
| | | | | reverse op | ||
* | Add block evaluation to TensorEvalTo and fix few small bugs | 2019-10-07 | |
| | |||
* | Tensor block evaluation V2 support for unary/binary/broadcsting | 2019-09-24 | |
| | |||
* | Merge with Eigen head | 2019-06-28 | |
|\ | |||
* | | Add block access to TensorReverseOp and make sure that TensorForcedEval uses ↵ | 2019-06-28 | |
| | | | | | | | | block access when preferred | ||
| * | [SYCL] This PR adds the minimum modifications to the Eigen unsupported ↵ | 2019-06-28 | |
|/ | | | | | | | | | | module required to run it on devices supporting SYCL. * Abstracting the pointer type so that both SYCL memory and pointer can be captured. * Converting SYCL virtual pointer to SYCL device memory in Eigen evaluator class. * Binding SYCL placeholder accessor to command group handler by using bind method in Eigen evaluator node. * Adding SYCL macro for controlling loop unrolling. * Modifying the TensorDeviceSycl.h and SYCL executor method to adopt the above changes. | ||
* | Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵ | 2018-08-10 | |
| | | | | evaluators | ||
* | Enabling per device specialisation of packetsize. | 2018-08-01 | |
| | |||
* | Add tiled evaluation support to TensorExecutor | 2018-07-25 | |
| | |||
* | Merged in mehdi_goli/opencl/DataDependancy (pull request PR-10) | 2017-06-28 | |
| | | | | | | | | | | DataDependancy * Wrapping data type to the pointer class for sycl in non-terminal nodes; not having that breaks Tensorflow Conv2d code. * Applying Ronnan's Comments. * Applying benoit's comments | ||
* | Converting all parallel for lambda to functor in order to prevent kernel ↵ | 2016-12-16 | |
| | | | | duplication name error; adding tensorConcatinationOp backend for sycl. | ||
* | Removed the sycl include from Eigen/Core and moved it to ↵ | 2016-11-04 | |
| | | | | Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size; | ||
* | Worked around Visual Studio compilation errors | 2016-10-28 | |
| | |||
* | Made TensorEvalTo compatible with c++0x again. | 2016-09-23 | |
| | |||
* | Partial OpenCL support via SYCL compatible with ComputeCpp CE. | 2016-09-19 | |
| | |||
* | An evalTo expression is only aligned iff both the lhs and the rhs are aligned. | 2016-07-12 | |
| | |||
* | Marked a few tensor operations as read only | 2016-05-05 | |
| | |||
* | Deleted trailing commas | 2016-04-29 | |
| | |||
* | Fixed the partial evaluation of non vectorizable tensor subexpressions | 2016-04-25 | |
| | |||
* | Eigen cost model part 1. This implements a basic recursive framework to ↵ | 2016-04-14 | |
| | | | | estimate the cost of evaluating tensor expressions. | ||
* | Marked variables that's only used in debug mode as such | 2016-03-21 | |
| | |||
* | Decoupled the packet type definition from the definition of the tensor ops. ↵ | 2016-03-08 | |
| | | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit. | ||
* | Added missing EIGEN_DEVICE_FUNC qualifier | 2016-01-24 | |
| | |||
* | Record whether the underlying tensor storage can be accessed directly during ↵ | 2016-01-19 | |
| | | | | the evaluation of an expression. | ||
* | Misc improvements and optimizations | 2015-07-01 | |
| | |||
* | Silenced a few compilation warnings generated by nvcc | 2015-02-10 | |
| | |||
* | Improved support for RowMajor tensors | 2015-01-14 | |
| | | | | Misc fixes and API cleanups. | ||
* | Misc improvements and cleanups | 2014-10-13 | |
| | |||
* | Improved the performance of the tensor convolution code by a factor of about 4. | 2014-09-03 | |
| | |||
* | Reworked the expression evaluation mechanism in order to make it possible to ↵ | 2014-06-13 | |
efficiently compute convolutions and contractions in the future: * The scheduling of computation is moved out the the assignment code and into a new TensorExecutor class * The assignment itself is now a regular node on the expression tree * The expression evaluators start by recursively evaluating all their subexpressions if needed |