aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor/TensorMorphing.h
Commit message (Collapse)AuthorAge
* Move struct outside of method for C++03 compatibility.Gravatar Christoph Hertzberg2018-10-02
|
* Fix bug in copy optimization in Tensor slicing.Gravatar Eugene Zhulenev2018-09-28
|
* Const cast scalar pointer in TensorSlicingOp evaluatorGravatar Eugene Zhulenev2018-09-14
|
* Fix compilation of tiled evaluation code with c++03Gravatar Eugene Zhulenev2018-09-11
|
* Merge with upstream eigen/defaultGravatar Eugene Zhulenev2018-08-27
|\
| * Fixed more sign-compare and type-limits warningsGravatar Christoph Hertzberg2018-08-24
| |
* | Merge with eigen/defaultGravatar Eugene Zhulenev2018-08-10
|\|
* | Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵Gravatar Eugene Zhulenev2018-08-10
| | | | | | | | evaluators
* | Fix bug in a test + compilation errorsGravatar Eugene Zhulenev2018-08-09
| |
* | Replace all using declarations with typedefs in Tensor opsGravatar Eugene Zhulenev2018-08-01
| |
* | Fix typo + get rid of redundant member variables for block sizesGravatar Eugene Zhulenev2018-08-01
| |
* | Merged latest changes from upstream/eigenGravatar Eugene Zhulenev2018-08-01
|\|
| * Enabling per device specialisation of packetsize.Gravatar Mehdi Goli2018-08-01
| |
* | Add block evaluation support to TensorOpsGravatar Eugene Zhulenev2018-07-31
|/
* Add tiled evaluation support to TensorExecutorGravatar Eugene Zhulenev2018-07-25
|
* Updates corresponding to the latest round of PR feedbackGravatar Deven Desai2018-07-11
| | | | | | | | | | | | | | The major changes are 1. Moving CUDA/PacketMath.h to GPU/PacketMath.h 2. Moving CUDA/MathFunctions.h to GPU/MathFunction.h 3. Moving CUDA/CudaSpecialFunctions.h to GPU/GpuSpecialFunctions.h The above three changes effectively enable the Eigen "Packet" layer for the HIP platform 4. Merging the "hip_basic" and "cuda_basic" unit tests into one ("gpu_basic") 5. Updating the "EIGEN_DEVICE_FUNC" marking in some places The change has been tested on the HIP and CUDA platforms.
* Adding support for using Eigen in HIP kernels.Gravatar Deven Desai2018-06-06
| | | | | | | | | This commit enables the use of Eigen on HIP kernels / AMD GPUs. Support has been added along the same lines as what already exists for using Eigen in CUDA kernels / NVidia GPUs. Application code needs to explicitly define EIGEN_USE_HIP when using Eigen in HIP kernels. This is because some of the CUDA headers get picked up by default during Eigen compile (irrespective of whether or not the underlying compiler is CUDACC/NVCC, for e.g. Eigen/src/Core/arch/CUDA/Half.h). In order to maintain this behavior, the EIGEN_USE_HIP macro is used to switch to using the HIP version of those header files (see Eigen/Core and unsupported/Eigen/CXX11/Tensor) Use the "-DEIGEN_TEST_HIP" cmake option to enable the HIP specific unit tests.
* Enable RawAccess to tensor slices whenever possinle.Gravatar Benoit Steiner2018-04-30
| | | | Avoid 32-bit integer overflow in TensorSlicingOp
* Merged in mehdi_goli/opencl/DataDependancy (pull request PR-10)Gravatar Benoit Steiner2017-06-28
| | | | | | | | | | DataDependancy * Wrapping data type to the pointer class for sycl in non-terminal nodes; not having that breaks Tensorflow Conv2d code. * Applying Ronnan's Comments. * Applying benoit's comments
* Adding non-deferrenciable pointer track for ComputeCpp backend; Adding ↵Gravatar Mehdi Goli2017-01-19
| | | | TensorConvolutionOp for ComputeCpp; fixing typos. modifying TensorDeviceSycl to use the LegacyPointer class.
* Adding Tensor ReverseOp; TensorStriding; TensorConversionOp; Modifying ↵Gravatar Mehdi Goli2017-01-16
| | | | Tensor Contractsycl to be located in any place in the expression tree.
* Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in ↵Gravatar Mehdi Goli2016-12-01
| | | | TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code.
* Adding extra test for non-fixed size to broadcast; Replacing stcl with sycl.Gravatar Mehdi Goli2016-11-14
|
* Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing.Gravatar Mehdi Goli2016-11-14
|
* Added missing EIGEN_DEVICE_FUNCGravatar Benoit Steiner2016-06-07
|
* Fixed compilation warningGravatar Benoit Steiner2016-06-01
|
* Reimplement clamp as a static function.Gravatar Benoit Steiner2016-05-27
|
* Use NULL instead of nullptr to preserve the compatibility with cxx03Gravatar Benoit Steiner2016-05-27
|
* Added a new operation to enable more powerful tensorindexing.Gravatar Benoit Steiner2016-05-27
|
* Fixed compilation errors triggered by old versions of gccGravatar Benoit Steiner2016-05-12
|
* Eigen cost model part 1. This implements a basic recursive framework to ↵Gravatar Rasmus Munk Larsen2016-04-14
| | | | estimate the cost of evaluating tensor expressions.
* Worked around a constness related issueGravatar Benoit Steiner2016-03-21
|
* Small cleanupGravatar Benoit Steiner2016-03-21
|
* Decoupled the packet type definition from the definition of the tensor ops. ↵Gravatar Benoit Steiner2016-03-08
| | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit.
* Deleted the coordinate based evaluation of tensor expressions, since it's ↵Gravatar Benoit Steiner2016-02-22
| | | | hardly ever used and started to cause some issues with some versions of xcode.
* Fixed a constness bugGravatar Benoit Steiner2016-01-21
|
* Record whether the underlying tensor storage can be accessed directly during ↵Gravatar Benoit Steiner2016-01-19
| | | | the evaluation of an expression.
* Add alignment requirement for local buffer used by the slicing op.Gravatar Benoit Steiner2015-12-18
|
* Fix return type for TensorEvaluator<TensorSlicingOp>::dataGravatar Benoit Steiner2015-09-17
|
* Many files were missing in previous changeset.Gravatar Gael Guennebaud2015-07-29
|
* Use NumTraits<T>::RequireInitialization instead of ↵Gravatar Benoit Steiner2015-07-07
| | | | internal::is_arithmetic<T>::value to check whether it's possible to bypass the type constructor in the tensor code.
* Misc small fixes to the tensor slicing code.Gravatar Benoit Steiner2015-07-06
|
* Fixed a couple of mistakes in the previous commit.Gravatar Benoit Steiner2015-07-01
|
* Enabled the vectorized evaluation of several tensor expressions that was ↵Gravatar Benoit Steiner2015-07-01
| | | | previously disabled by mistake
* Avoid using the cuda memcpy for small tensor slices since the memcpy kernel ↵Gravatar Benoit Steiner2015-05-19
| | | | is very expensive to launch
* Sped up the assignment of a tensor to a tensor slice, as well as the ↵Gravatar Benoit Steiner2015-04-20
| | | | assigment of a constant slice to a tensor
* Fixed another batch of compilation warningsGravatar Benoit Steiner2015-02-28
|
* Fixed several compilation warnings reported by clangGravatar Benoit Steiner2015-02-25
|
* Silenced a few compilation warnings generated by nvccGravatar Benoit Steiner2015-02-10
|
* Silenced more compilation warningsGravatar Benoit Steiner2015-02-10
|