eigen - C++ library for linear algebra

	Commit message (Collapse)	Author	Age
*	Workaround stupid warning	Gael Guennebaud	2018-10-08
\|
*	Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵	Eugene Zhulenev	2018-08-10
\| \| \| \|	evaluators
*	Enabling per device specialisation of packetsize.	Mehdi Goli	2018-08-01
\|
*	merging the CUDA and HIP implementation for the Tensor directory and the ↵	Deven Desai	2018-06-20
\| \| \| \|	unit tests
*	updates based on PR feedback	Deven Desai	2018-06-14
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	There are two major changes (and a few minor ones which are not listed here...see PR discussion for details) 1. Eigen::half implementations for HIP and CUDA have been merged. This means that - `CUDA/Half.h` and `HIP/hcc/Half.h` got merged to a new file `GPU/Half.h` - `CUDA/PacketMathHalf.h` and `HIP/hcc/PacketMathHalf.h` got merged to a new file `GPU/PacketMathHalf.h` - `CUDA/TypeCasting.h` and `HIP/hcc/TypeCasting.h` got merged to a new file `GPU/TypeCasting.h` After this change the `HIP/hcc` directory only contains one file `math_constants.h`. That will go away too once that file becomes a part of the HIP install. 2. new macros EIGEN_GPUCC, EIGEN_GPU_COMPILE_PHASE and EIGEN_HAS_GPU_FP16 have been added and the code has been updated to use them where appropriate. - `EIGEN_GPUCC` is the same as `(EIGEN_CUDACC \|\| EIGEN_HIPCC)` - `EIGEN_GPU_DEVICE_COMPILE` is the same as `(EIGEN_CUDA_ARCH \|\| EIGEN_HIP_DEVICE_COMPILE)` - `EIGEN_HAS_GPU_FP16` is the same as `(EIGEN_HAS_CUDA_FP16 or EIGEN_HAS_HIP_FP16)`
*	Adding support for using Eigen in HIP kernels.	Deven Desai	2018-06-06
\| \| \| \| \| \| \| \| \|	This commit enables the use of Eigen on HIP kernels / AMD GPUs. Support has been added along the same lines as what already exists for using Eigen in CUDA kernels / NVidia GPUs. Application code needs to explicitly define EIGEN_USE_HIP when using Eigen in HIP kernels. This is because some of the CUDA headers get picked up by default during Eigen compile (irrespective of whether or not the underlying compiler is CUDACC/NVCC, for e.g. Eigen/src/Core/arch/CUDA/Half.h). In order to maintain this behavior, the EIGEN_USE_HIP macro is used to switch to using the HIP version of those header files (see Eigen/Core and unsupported/Eigen/CXX11/Tensor) Use the "-DEIGEN_TEST_HIP" cmake option to enable the HIP specific unit tests.
*	Add a EIGEN_NO_CUDA option, and introduce EIGEN_CUDACC and EIGEN_CUDA_ARCH ↵	Gael Guennebaud	2017-07-17
\| \| \| \|	aliases
*	Merged in mehdi_goli/opencl/DataDependancy (pull request PR-10)	Benoit Steiner	2017-06-28
\| \| \| \| \| \| \| \| \| \|	DataDependancy * Wrapping data type to the pointer class for sycl in non-terminal nodes; not having that breaks Tensorflow Conv2d code. * Applying Ronnan's Comments. * Applying benoit's comments
*	Improved the detection of packet size in the tensor scan evaluator.	Benoit Steiner	2016-07-11
\|
*	Add missing CUDA kernel to tensor scan op	Igor Babuschkin	2016-06-29
\| \| \| \| \|	The TensorScanOp implementation was missing a CUDA kernel launch. This adds a simple placeholder implementation.
*	Don't store the scan axis in the evaluator of the tensor scan operation ↵	Benoit Steiner	2016-06-27
\| \| \| \| \| \|	since it's only used in the constructor. Also avoid taking references to values that may becomes stale after a copy construction.
*	Implement exclusive scan option	Igor Babuschkin	2016-06-14
\|
*	Merged in ibab/eigen (pull request PR-195)	Benoit Steiner	2016-06-10
\|\ \| \| \| \| \| \|	Add small fixes to TensorScanOp
\| *	Add small fixes to TensorScanOp	Igor Babuschkin	2016-06-07
\| \|
* \|	Improved code formatting	Benoit Steiner	2016-06-09
\|/
*	Fixed compilation error with gcc 4.4	Benoit Steiner	2016-06-06
\|
*	Add tensor scan op	Igor Babuschkin	2016-06-02
	This is the initial implementation a generic scan operation. Based on this, cumsum and cumprod method have been added to TensorBase.