Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Move struct outside of method for C++03 compatibility. | 2018-10-02 | |
| | |||
* | Fix bug in copy optimization in Tensor slicing. | 2018-09-28 | |
| | |||
* | Const cast scalar pointer in TensorSlicingOp evaluator | 2018-09-14 | |
| | |||
* | Fix compilation of tiled evaluation code with c++03 | 2018-09-11 | |
| | |||
* | Merge with upstream eigen/default | 2018-08-27 | |
|\ | |||
| * | Fixed more sign-compare and type-limits warnings | 2018-08-24 | |
| | | |||
* | | Merge with eigen/default | 2018-08-10 | |
|\| | |||
* | | Add block evaluationto CwiseUnaryOp and add PreferBlockAccess enum to all ↵ | 2018-08-10 | |
| | | | | | | | | evaluators | ||
* | | Fix bug in a test + compilation errors | 2018-08-09 | |
| | | |||
* | | Replace all using declarations with typedefs in Tensor ops | 2018-08-01 | |
| | | |||
* | | Fix typo + get rid of redundant member variables for block sizes | 2018-08-01 | |
| | | |||
* | | Merged latest changes from upstream/eigen | 2018-08-01 | |
|\| | |||
| * | Enabling per device specialisation of packetsize. | 2018-08-01 | |
| | | |||
* | | Add block evaluation support to TensorOps | 2018-07-31 | |
|/ | |||
* | Add tiled evaluation support to TensorExecutor | 2018-07-25 | |
| | |||
* | Updates corresponding to the latest round of PR feedback | 2018-07-11 | |
| | | | | | | | | | | | | | | The major changes are 1. Moving CUDA/PacketMath.h to GPU/PacketMath.h 2. Moving CUDA/MathFunctions.h to GPU/MathFunction.h 3. Moving CUDA/CudaSpecialFunctions.h to GPU/GpuSpecialFunctions.h The above three changes effectively enable the Eigen "Packet" layer for the HIP platform 4. Merging the "hip_basic" and "cuda_basic" unit tests into one ("gpu_basic") 5. Updating the "EIGEN_DEVICE_FUNC" marking in some places The change has been tested on the HIP and CUDA platforms. | ||
* | Adding support for using Eigen in HIP kernels. | 2018-06-06 | |
| | | | | | | | | | This commit enables the use of Eigen on HIP kernels / AMD GPUs. Support has been added along the same lines as what already exists for using Eigen in CUDA kernels / NVidia GPUs. Application code needs to explicitly define EIGEN_USE_HIP when using Eigen in HIP kernels. This is because some of the CUDA headers get picked up by default during Eigen compile (irrespective of whether or not the underlying compiler is CUDACC/NVCC, for e.g. Eigen/src/Core/arch/CUDA/Half.h). In order to maintain this behavior, the EIGEN_USE_HIP macro is used to switch to using the HIP version of those header files (see Eigen/Core and unsupported/Eigen/CXX11/Tensor) Use the "-DEIGEN_TEST_HIP" cmake option to enable the HIP specific unit tests. | ||
* | Enable RawAccess to tensor slices whenever possinle. | 2018-04-30 | |
| | | | | Avoid 32-bit integer overflow in TensorSlicingOp | ||
* | Merged in mehdi_goli/opencl/DataDependancy (pull request PR-10) | 2017-06-28 | |
| | | | | | | | | | | DataDependancy * Wrapping data type to the pointer class for sycl in non-terminal nodes; not having that breaks Tensorflow Conv2d code. * Applying Ronnan's Comments. * Applying benoit's comments | ||
* | Adding non-deferrenciable pointer track for ComputeCpp backend; Adding ↵ | 2017-01-19 | |
| | | | | TensorConvolutionOp for ComputeCpp; fixing typos. modifying TensorDeviceSycl to use the LegacyPointer class. | ||
* | Adding Tensor ReverseOp; TensorStriding; TensorConversionOp; Modifying ↵ | 2017-01-16 | |
| | | | | Tensor Contractsycl to be located in any place in the expression tree. | ||
* | Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in ↵ | 2016-12-01 | |
| | | | | TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code. | ||
* | Adding extra test for non-fixed size to broadcast; Replacing stcl with sycl. | 2016-11-14 | |
| | |||
* | Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing. | 2016-11-14 | |
| | |||
* | Added missing EIGEN_DEVICE_FUNC | 2016-06-07 | |
| | |||
* | Fixed compilation warning | 2016-06-01 | |
| | |||
* | Reimplement clamp as a static function. | 2016-05-27 | |
| | |||
* | Use NULL instead of nullptr to preserve the compatibility with cxx03 | 2016-05-27 | |
| | |||
* | Added a new operation to enable more powerful tensorindexing. | 2016-05-27 | |
| | |||
* | Fixed compilation errors triggered by old versions of gcc | 2016-05-12 | |
| | |||
* | Eigen cost model part 1. This implements a basic recursive framework to ↵ | 2016-04-14 | |
| | | | | estimate the cost of evaluating tensor expressions. | ||
* | Worked around a constness related issue | 2016-03-21 | |
| | |||
* | Small cleanup | 2016-03-21 | |
| | |||
* | Decoupled the packet type definition from the definition of the tensor ops. ↵ | 2016-03-08 | |
| | | | | All the vectorization is now defined in the tensor evaluators. This will make it possible to relialably support devices with different packet types in the same compilation unit. | ||
* | Deleted the coordinate based evaluation of tensor expressions, since it's ↵ | 2016-02-22 | |
| | | | | hardly ever used and started to cause some issues with some versions of xcode. | ||
* | Fixed a constness bug | 2016-01-21 | |
| | |||
* | Record whether the underlying tensor storage can be accessed directly during ↵ | 2016-01-19 | |
| | | | | the evaluation of an expression. | ||
* | Add alignment requirement for local buffer used by the slicing op. | 2015-12-18 | |
| | |||
* | Fix return type for TensorEvaluator<TensorSlicingOp>::data | 2015-09-17 | |
| | |||
* | Many files were missing in previous changeset. | 2015-07-29 | |
| | |||
* | Use NumTraits<T>::RequireInitialization instead of ↵ | 2015-07-07 | |
| | | | | internal::is_arithmetic<T>::value to check whether it's possible to bypass the type constructor in the tensor code. | ||
* | Misc small fixes to the tensor slicing code. | 2015-07-06 | |
| | |||
* | Fixed a couple of mistakes in the previous commit. | 2015-07-01 | |
| | |||
* | Enabled the vectorized evaluation of several tensor expressions that was ↵ | 2015-07-01 | |
| | | | | previously disabled by mistake | ||
* | Avoid using the cuda memcpy for small tensor slices since the memcpy kernel ↵ | 2015-05-19 | |
| | | | | is very expensive to launch | ||
* | Sped up the assignment of a tensor to a tensor slice, as well as the ↵ | 2015-04-20 | |
| | | | | assigment of a constant slice to a tensor | ||
* | Fixed another batch of compilation warnings | 2015-02-28 | |
| | |||
* | Fixed several compilation warnings reported by clang | 2015-02-25 | |
| | |||
* | Silenced a few compilation warnings generated by nvcc | 2015-02-10 | |
| | |||
* | Silenced more compilation warnings | 2015-02-10 | |
| |