Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Fix calls to device functions from host code | Nathan Luehr | 2021-05-11 |
| | |||
* | Fix TensorPadding bug in squeezed reads from inner dimension | Eugene Zhulenev | 2019-12-19 |
| | |||
* | Tensor block evaluation cost model | Eugene Zhulenev | 2019-12-18 |
| | |||
* | Squeeze reads from two inner dimensions in TensorPadding | Eugene Zhulenev | 2019-12-11 |
| | |||
* | Remove V2 suffix from TensorBlock | Eugene Zhulenev | 2019-12-10 |
| | |||
* | Do not use std::vector in getResourceRequirements | Eugene Zhulenev | 2019-12-09 |
| | |||
* | Add async evaluation support to TensorPadding/TensorImagePatch/TensorShuffling | Eugene Zhulenev | 2019-11-26 |
| | |||
* | Remove legacy block evaluation support | Eugene Zhulenev | 2019-11-12 |
| | |||
* | Cleanup Tensor block destination and materialized block storage allocation | Eugene Zhulenev | 2019-10-16 |
| | |||
* | Block evaluation for TensorGenerator/TensorReverse/TensorShuffling | Eugene Zhulenev | 2019-10-14 |
| | |||
* | Block evaluation for TensorChipping + fixed bugs in TensorPadding and ↵ | Eugene Zhulenev | 2019-10-09 |
| | | | | TensorSlicing | ||
* | Add block evaluation to TensorEvalTo and fix few small bugs | Eugene Zhulenev | 2019-10-07 |
| | |||
* | Fix compilation warnings and errors with clang in TensorBlockV2 code and tests | Eugene Zhulenev | 2019-10-04 |
| | |||
* | Add block evaluation to TensorReshaping/TensorCasting/TensorPadding/TensorSelect | Eugene Zhulenev | 2019-10-02 |
| | |||
* | Tensor block evaluation V2 support for unary/binary/broadcsting | Eugene Zhulenev | 2019-09-24 |
| | |||
* | [SYCL] This PR adds the minimum modifications to the Eigen unsupported ↵ | Mehdi Goli | 2019-06-28 |
| | | | | | | | | | | module required to run it on devices supporting SYCL. * Abstracting the pointer type so that both SYCL memory and pointer can be captured. * Converting SYCL virtual pointer to SYCL device memory in Eigen evaluator class. * Binding SYCL placeholder accessor to command group handler by using bind method in Eigen evaluator node. * Adding SYCL macro for controlling loop unrolling. * Modifying the TensorDeviceSycl.h and SYCL executor method to adopt the above changes. | ||
* | Fiw shadowing of last and all | Gael Guennebaud | 2018-09-21 |
| | |||
* | Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵ | Eugene Zhulenev | 2018-08-10 |
| | | | | evaluators | ||
* | Enabling per device specialisation of packetsize. | Mehdi Goli | 2018-08-01 |
| | |||
* | Add tiled evaluation support to TensorExecutor | Eugene Zhulenev | 2018-07-25 |
| | |||
* | Merged in mehdi_goli/opencl/DataDependancy (pull request PR-10) | Benoit Steiner | 2017-06-28 |
| | | | | | | | | | | DataDependancy * Wrapping data type to the pointer class for sycl in non-terminal nodes; not having that breaks Tensorflow Conv2d code. * Applying Ronnan's Comments. * Applying benoit's comments | ||
* | Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in ↵ | Mehdi Goli | 2016-12-01 |
| | | | | TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code. | ||
* | Marked unused variables as such | Benoit Steiner | 2016-05-25 |
| | |||
* | Improved the performance of tensor padding | Benoit Steiner | 2016-05-25 |
| | |||
* | Allow vectorized padding on GPU. This helps speed things up a little | Benoit Steiner | 2016-05-17 |
| | | | | | | | | | | | | | Before: BM_padding/10 5000000 460 217.03 MFlops/s BM_padding/80 5000000 460 13899.40 MFlops/s BM_padding/640 5000000 461 888421.17 MFlops/s BM_padding/4K 5000000 460 54316322.55 MFlops/s After: BM_padding/10 5000000 454 220.20 MFlops/s BM_padding/80 5000000 455 14039.86 MFlops/s BM_padding/640 5000000 452 904968.83 MFlops/s BM_padding/4K 5000000 411 60750049.21 MFlops/s | ||
* | Fixed compilation errors triggered by old versions of gcc | Benoit Steiner | 2016-05-12 |
| | |||
* | Eigen cost model part 1. This implements a basic recursive framework to ↵ | Rasmus Munk Larsen | 2016-04-14 |
| | | | | estimate the cost of evaluating tensor expressions. | ||
* | Decoupled the packet type definition from the definition of the tensor ops. ↵ | Benoit Steiner | 2016-03-08 |
| | | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit. | ||
* | Added the ability to pad a tensor using a non-zero value | Benoit Steiner | 2016-03-07 |
| | |||
* | Updated the padding code to work with half floats | Benoit Steiner | 2016-02-23 |
| | |||
* | Record whether the underlying tensor storage can be accessed directly during ↵ | Benoit Steiner | 2016-01-19 |
| | | | | the evaluation of an expression. | ||
* | Added static assertions to avoid misuses of padding, broadcasting and ↵ | Benoit Steiner | 2015-11-06 |
| | | | | concatenation ops. | ||
* | Many files were missing in previous changeset. | Gael Guennebaud | 2015-07-29 |
| | |||
* | Fix shadow warnings in Tensor module | Gael Guennebaud | 2015-06-16 |
| | |||
* | Silenced the last batch of compilation warnings triggered by gcc 4.8 | Benoit Steiner | 2015-02-10 |
| | |||
* | Improved support for RowMajor tensors | Benoit Steiner | 2015-01-14 |
| | | | | Misc fixes and API cleanups. | ||
* | Use the proper index type in the padding code | Benoit Steiner | 2014-10-30 |
| | |||
* | Silenced a few compilation warnings | Benoit Steiner | 2014-10-16 |
| | | | | Generalized a TensorMap constructor | ||
* | Misc improvements and cleanups | Benoit Steiner | 2014-10-13 |
| | |||
* | Misc fixes. | Benoit Steiner | 2014-09-05 |
| | |||
* | Optimized the tensor padding code. | Benoit Steiner | 2014-08-26 |
| | |||
* | Added support for padding, stridding, and shuffling | Benoit Steiner | 2014-08-14 |