aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor/TensorScan.h
Commit message (Collapse)AuthorAge
* Workaround stupid warningGravatar Gael Guennebaud2018-10-08
|
* Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵Gravatar Eugene Zhulenev2018-08-10
| | | | evaluators
* Enabling per device specialisation of packetsize.Gravatar Mehdi Goli2018-08-01
|
* merging the CUDA and HIP implementation for the Tensor directory and the ↵Gravatar Deven Desai2018-06-20
| | | | unit tests
* updates based on PR feedbackGravatar Deven Desai2018-06-14
| | | | | | | | | | | | | | | | | There are two major changes (and a few minor ones which are not listed here...see PR discussion for details) 1. Eigen::half implementations for HIP and CUDA have been merged. This means that - `CUDA/Half.h` and `HIP/hcc/Half.h` got merged to a new file `GPU/Half.h` - `CUDA/PacketMathHalf.h` and `HIP/hcc/PacketMathHalf.h` got merged to a new file `GPU/PacketMathHalf.h` - `CUDA/TypeCasting.h` and `HIP/hcc/TypeCasting.h` got merged to a new file `GPU/TypeCasting.h` After this change the `HIP/hcc` directory only contains one file `math_constants.h`. That will go away too once that file becomes a part of the HIP install. 2. new macros EIGEN_GPUCC, EIGEN_GPU_COMPILE_PHASE and EIGEN_HAS_GPU_FP16 have been added and the code has been updated to use them where appropriate. - `EIGEN_GPUCC` is the same as `(EIGEN_CUDACC || EIGEN_HIPCC)` - `EIGEN_GPU_DEVICE_COMPILE` is the same as `(EIGEN_CUDA_ARCH || EIGEN_HIP_DEVICE_COMPILE)` - `EIGEN_HAS_GPU_FP16` is the same as `(EIGEN_HAS_CUDA_FP16 or EIGEN_HAS_HIP_FP16)`
* Adding support for using Eigen in HIP kernels.Gravatar Deven Desai2018-06-06
| | | | | | | | | This commit enables the use of Eigen on HIP kernels / AMD GPUs. Support has been added along the same lines as what already exists for using Eigen in CUDA kernels / NVidia GPUs. Application code needs to explicitly define EIGEN_USE_HIP when using Eigen in HIP kernels. This is because some of the CUDA headers get picked up by default during Eigen compile (irrespective of whether or not the underlying compiler is CUDACC/NVCC, for e.g. Eigen/src/Core/arch/CUDA/Half.h). In order to maintain this behavior, the EIGEN_USE_HIP macro is used to switch to using the HIP version of those header files (see Eigen/Core and unsupported/Eigen/CXX11/Tensor) Use the "-DEIGEN_TEST_HIP" cmake option to enable the HIP specific unit tests.
* Add a EIGEN_NO_CUDA option, and introduce EIGEN_CUDACC and EIGEN_CUDA_ARCH ↵Gravatar Gael Guennebaud2017-07-17
| | | | aliases
* Merged in mehdi_goli/opencl/DataDependancy (pull request PR-10)Gravatar Benoit Steiner2017-06-28
| | | | | | | | | | DataDependancy * Wrapping data type to the pointer class for sycl in non-terminal nodes; not having that breaks Tensorflow Conv2d code. * Applying Ronnan's Comments. * Applying benoit's comments
* Improved the detection of packet size in the tensor scan evaluator.Gravatar Benoit Steiner2016-07-11
|
* Add missing CUDA kernel to tensor scan opGravatar Igor Babuschkin2016-06-29
| | | | | The TensorScanOp implementation was missing a CUDA kernel launch. This adds a simple placeholder implementation.
* Don't store the scan axis in the evaluator of the tensor scan operation ↵Gravatar Benoit Steiner2016-06-27
| | | | | | since it's only used in the constructor. Also avoid taking references to values that may becomes stale after a copy construction.
* Implement exclusive scan optionGravatar Igor Babuschkin2016-06-14
|
* Merged in ibab/eigen (pull request PR-195)Gravatar Benoit Steiner2016-06-10
|\ | | | | | | Add small fixes to TensorScanOp
| * Add small fixes to TensorScanOpGravatar Igor Babuschkin2016-06-07
| |
* | Improved code formattingGravatar Benoit Steiner2016-06-09
|/
* Fixed compilation error with gcc 4.4Gravatar Benoit Steiner2016-06-06
|
* Add tensor scan opGravatar Igor Babuschkin2016-06-02
This is the initial implementation a generic scan operation. Based on this, cumsum and cumprod method have been added to TensorBase.