Commit message (Collapse) | Author | Age | ||
---|---|---|---|---|
... | ||||
* | | Adding Tensor ReverseOp; TensorStriding; TensorConversionOp; Modifying ↵ | Mehdi Goli | 2017-01-16 | |
| | | | | | | | | Tensor Contractsycl to be located in any place in the expression tree. | |||
| * | Simplified the way we link libxsmm | Benoit Steiner | 2016-12-21 | |
| | | ||||
| * | Added support for libxsmm in the eigen makefiles | Benoit Steiner | 2016-12-21 | |
|/ | ||||
* | Converting all parallel for lambda to functor in order to prevent kernel ↵ | Mehdi Goli | 2016-12-16 | |
| | | | | duplication name error; adding tensorConcatinationOp backend for sycl. | |||
* | Adding tensor contraction operation backend for Sycl; adding test for ↵ | Mehdi Goli | 2016-12-14 | |
| | | | | contractionOp sycl backend; adding temporary solution to prevent memory leak in buffer; cleaning up cxx11_tensor_buildins_sycl.h | |||
* | Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in ↵ | Mehdi Goli | 2016-12-01 | |
| | | | | TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code. | |||
* | Adding TensorShuffling backend for sycl; adding TensorReshaping backend for ↵ | Mehdi Goli | 2016-11-29 | |
| | | | | sycl; cleaning up the sycl backend. | |||
* | Specialised basic math functions for SYCL device. | Luke Iwanski | 2016-11-17 | |
| | ||||
* | Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing. | Mehdi Goli | 2016-11-14 | |
| | ||||
* | Removed the sycl include from Eigen/Core and moved it to ↵ | Mehdi Goli | 2016-11-04 | |
| | | | | Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size; | |||
* | Reducing the code by generalising sycl backend functions/structs. | Mehdi Goli | 2016-10-14 | |
| | ||||
* | Pull the latest updates from trunk | Benoit Steiner | 2016-10-05 | |
|\ | ||||
| * | Improved support for compiling CUDA code with clang as the host compiler | Benoit Steiner | 2016-10-03 | |
| | | ||||
* | | Renamed the SYCL tests to follow the standard naming convention. | Benoit Steiner | 2016-09-30 | |
| | | ||||
| * | Add CUDA-specific std::complex<T> specializations for scalar_sum_op, ↵ | RJ Ryan | 2016-09-20 | |
| | | | | | | | | scalar_difference_op, scalar_product_op, and scalar_quotient_op. | |||
* | | Merged with default. | Luke Iwanski | 2016-09-19 | |
|\| | ||||
* | | Partial OpenCL support via SYCL compatible with ComputeCpp CE. | Luke Iwanski | 2016-09-19 | |
| | | ||||
| * | Register the cxx11_tensor_device only for recent cuda architectures (i.e. >= ↵ | Benoit Steiner | 2016-09-12 | |
| | | | | | | | | 3.0) since the test instantiate contractions that require a modern gpu. | |||
| * | CUDA contractions require arch >= 3.0: don't compile the cuda contraction ↵ | Benoit Steiner | 2016-09-12 | |
| | | | | | | | | tests on older architectures. | |||
| * | Added the ability to compute the absolute value of a complex number on GPU, ↵ | Benoit Steiner | 2016-09-12 | |
| | | | | | | | | as well as a test to catch the problem. | |||
| * | merge EulerAngles module | Gael Guennebaud | 2016-08-30 | |
|/| | ||||
* | | Deleted redundant regression test. | Benoit Steiner | 2016-08-03 | |
| | | ||||
* | | Added a test for fp16 | Benoit Steiner | 2016-08-03 | |
| | | ||||
* | | Move half unit test from unsupported to main tests | Gael Guennebaud | 2016-07-22 | |
| | | ||||
* | | Manually add -stdd=c++11 to nvcc for old cmake versions | Gael Guennebaud | 2016-07-12 | |
| | | ||||
* | | reverted unintended change. | Benoit Steiner | 2016-07-11 | |
| | | ||||
* | | Made the packetmath test compile again. A better fix would be to move the ↵ | Benoit Steiner | 2016-07-11 | |
| | | | | | | | | special function tests to the unsupported directory where the code now resides. | |||
* | | merge | Gael Guennebaud | 2016-07-11 | |
|\ \ | ||||
| * | | Fix configuration of CUDA: | Gael Guennebaud | 2016-07-11 | |
| | | | | | | | | | | | | | | | | | | - preserve user defined CUDA_NVCC_FLAGS - remove the -ansi flag that conflicts with -std=c++11 - do not add -std=c++11 if already there | |||
* | | | bug #1232: refactor special functions as a new SpecialFunctions module, ↵ | Gael Guennebaud | 2016-07-08 | |
|/ / | | | | | | | currently in unsupported/. | |||
* | | Add missing CUDA kernel to tensor scan op | Igor Babuschkin | 2016-06-29 | |
| | | | | | | | | | | The TensorScanOp implementation was missing a CUDA kernel launch. This adds a simple placeholder implementation. | |||
* | | Added a test to validate the tensor scan evaluation on GPU. The test is ↵ | Benoit Steiner | 2016-06-27 | |
| | | | | | | | | currently disabled since the code segfaults. | |||
* | | Disable the tensor tests when using msvc since older versions of the ↵ | Benoit Steiner | 2016-06-04 | |
| | | | | | | | | compiler fail to handle this code | |||
| * | Merged eigen/eigen into default | Tal Hadad | 2016-06-02 | |
| |\ | |/ |/| | ||||
* | | Add tensor scan op | Igor Babuschkin | 2016-06-02 | |
| | | | | | | | | | | This is the initial implementation a generic scan operation. Based on this, cumsum and cumprod method have been added to TensorBase. | |||
* | | Fixed option '--relaxed-constexpr' has been deprecated and replaced by ↵ | Benoit Steiner | 2016-05-27 | |
| | | | | | | | | option '--expt-relaxed-constexpr' warning generated by nvcc 7.5 | |||
* | | Added a test to validate the new non blocking thread pool | Benoit Steiner | 2016-05-10 | |
| | | ||||
* | | Made several tensor tests compatible with cxx03 | Benoit Steiner | 2016-04-29 | |
| | | ||||
* | | Moved a number of tensor tests that don't require cxx11 to work properly ↵ | Benoit Steiner | 2016-04-29 | |
| | | | | | | | | outside the EIGEN_TEST_CXX11 test section | |||
* | | Made a coupe of tensor tests compile without requiring c++11 support. | Benoit Steiner | 2016-04-29 | |
| | | ||||
* | | Enabled the new threadpool tests | Benoit Steiner | 2016-04-14 | |
| | | ||||
* | | Disabled the use of half2 on cuda devices of compute capability < 5.3 | Benoit Steiner | 2016-04-08 | |
| | | ||||
* | | Created the new EIGEN_TEST_CUDA_CLANG option to compile the CUDA tests using ↵ | Benoit Steiner | 2016-04-08 | |
| | | | | | | | | clang instead of nvcc | |||
* | | Renamed float16 into cxx11_float16 since the test relies on c++11 features | Benoit Steiner | 2016-04-07 | |
| | | ||||
* | | Added tests for float16 | Benoit Steiner | 2016-04-07 | |
| | | ||||
* | | Renamed the EIGEN_TEST_NVCC cmake option into EIGEN_TEST_CUDA per the ↵ | Benoit Steiner | 2016-04-06 | |
| | | | | | | | | discussion in bug #1173. | |||
* | | Added a test to verify that notifications are working properly | Benoit Steiner | 2016-03-23 | |
| | | ||||
* | | Filter some compilation flags that nvcc warns about. | Benoit Steiner | 2016-03-22 | |
| | | ||||
* | | Gate all the CUDA tests under the EIGEN_TEST_NVCC option | Benoit Steiner | 2016-03-18 | |
| | | ||||
* | | Fixed compilation warnings in the cuda tests | Benoit Steiner | 2016-03-18 | |
| | |