aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor/TensorPadding.h
Commit message (Collapse)AuthorAge
* Fix calls to device functions from host codeGravatar Nathan Luehr2021-05-11
|
* Fix TensorPadding bug in squeezed reads from inner dimension Gravatar Eugene Zhulenev2019-12-19
|
* Tensor block evaluation cost modelGravatar Eugene Zhulenev2019-12-18
|
* Squeeze reads from two inner dimensions in TensorPaddingGravatar Eugene Zhulenev2019-12-11
|
* Remove V2 suffix from TensorBlockGravatar Eugene Zhulenev2019-12-10
|
* Do not use std::vector in getResourceRequirementsGravatar Eugene Zhulenev2019-12-09
|
* Add async evaluation support to TensorPadding/TensorImagePatch/TensorShufflingGravatar Eugene Zhulenev2019-11-26
|
* Remove legacy block evaluation supportGravatar Eugene Zhulenev2019-11-12
|
* Cleanup Tensor block destination and materialized block storage allocationGravatar Eugene Zhulenev2019-10-16
|
* Block evaluation for TensorGenerator/TensorReverse/TensorShufflingGravatar Eugene Zhulenev2019-10-14
|
* Block evaluation for TensorChipping + fixed bugs in TensorPadding and ↵Gravatar Eugene Zhulenev2019-10-09
| | | | TensorSlicing
* Add block evaluation to TensorEvalTo and fix few small bugsGravatar Eugene Zhulenev2019-10-07
|
* Fix compilation warnings and errors with clang in TensorBlockV2 code and testsGravatar Eugene Zhulenev2019-10-04
|
* Add block evaluation to TensorReshaping/TensorCasting/TensorPadding/TensorSelectGravatar Eugene Zhulenev2019-10-02
|
* Tensor block evaluation V2 support for unary/binary/broadcstingGravatar Eugene Zhulenev2019-09-24
|
* [SYCL] This PR adds the minimum modifications to the Eigen unsupported ↵Gravatar Mehdi Goli2019-06-28
| | | | | | | | | | module required to run it on devices supporting SYCL. * Abstracting the pointer type so that both SYCL memory and pointer can be captured. * Converting SYCL virtual pointer to SYCL device memory in Eigen evaluator class. * Binding SYCL placeholder accessor to command group handler by using bind method in Eigen evaluator node. * Adding SYCL macro for controlling loop unrolling. * Modifying the TensorDeviceSycl.h and SYCL executor method to adopt the above changes.
* Fiw shadowing of last and allGravatar Gael Guennebaud2018-09-21
|
* Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵Gravatar Eugene Zhulenev2018-08-10
| | | | evaluators
* Enabling per device specialisation of packetsize.Gravatar Mehdi Goli2018-08-01
|
* Add tiled evaluation support to TensorExecutorGravatar Eugene Zhulenev2018-07-25
|
* Merged in mehdi_goli/opencl/DataDependancy (pull request PR-10)Gravatar Benoit Steiner2017-06-28
| | | | | | | | | | DataDependancy * Wrapping data type to the pointer class for sycl in non-terminal nodes; not having that breaks Tensorflow Conv2d code. * Applying Ronnan's Comments. * Applying benoit's comments
* Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in ↵Gravatar Mehdi Goli2016-12-01
| | | | TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code.
* Marked unused variables as suchGravatar Benoit Steiner2016-05-25
|
* Improved the performance of tensor paddingGravatar Benoit Steiner2016-05-25
|
* Allow vectorized padding on GPU. This helps speed things up a littleGravatar Benoit Steiner2016-05-17
| | | | | | | | | | | | | Before: BM_padding/10 5000000 460 217.03 MFlops/s BM_padding/80 5000000 460 13899.40 MFlops/s BM_padding/640 5000000 461 888421.17 MFlops/s BM_padding/4K 5000000 460 54316322.55 MFlops/s After: BM_padding/10 5000000 454 220.20 MFlops/s BM_padding/80 5000000 455 14039.86 MFlops/s BM_padding/640 5000000 452 904968.83 MFlops/s BM_padding/4K 5000000 411 60750049.21 MFlops/s
* Fixed compilation errors triggered by old versions of gccGravatar Benoit Steiner2016-05-12
|
* Eigen cost model part 1. This implements a basic recursive framework to ↵Gravatar Rasmus Munk Larsen2016-04-14
| | | | estimate the cost of evaluating tensor expressions.
* Decoupled the packet type definition from the definition of the tensor ops. ↵Gravatar Benoit Steiner2016-03-08
| | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit.
* Added the ability to pad a tensor using a non-zero valueGravatar Benoit Steiner2016-03-07
|
* Updated the padding code to work with half floatsGravatar Benoit Steiner2016-02-23
|
* Record whether the underlying tensor storage can be accessed directly during ↵Gravatar Benoit Steiner2016-01-19
| | | | the evaluation of an expression.
* Added static assertions to avoid misuses of padding, broadcasting and ↵Gravatar Benoit Steiner2015-11-06
| | | | concatenation ops.
* Many files were missing in previous changeset.Gravatar Gael Guennebaud2015-07-29
|
* Fix shadow warnings in Tensor moduleGravatar Gael Guennebaud2015-06-16
|
* Silenced the last batch of compilation warnings triggered by gcc 4.8Gravatar Benoit Steiner2015-02-10
|
* Improved support for RowMajor tensorsGravatar Benoit Steiner2015-01-14
| | | | Misc fixes and API cleanups.
* Use the proper index type in the padding codeGravatar Benoit Steiner2014-10-30
|
* Silenced a few compilation warningsGravatar Benoit Steiner2014-10-16
| | | | Generalized a TensorMap constructor
* Misc improvements and cleanupsGravatar Benoit Steiner2014-10-13
|
* Misc fixes.Gravatar Benoit Steiner2014-09-05
|
* Optimized the tensor padding code.Gravatar Benoit Steiner2014-08-26
|
* Added support for padding, stridding, and shufflingGravatar Benoit Steiner2014-08-14