Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Fixing LLVM error on TensorMorphingSycl.h on GPU; fixing int64_t crash for ↵ | 2016-11-25 | |
| | | | | tensor_broadcast_sycl on GPU; adding get_sycl_supported_devices() on syclDevice.h. | ||
* | Removing unsupported device from test case; cleaning the tensor device sycl. | 2016-11-23 | |
| | |||
* | Added test for cwiseMin, cwiseMax and operator%. | 2016-11-19 | |
| | |||
* | Code cleanup | 2016-11-18 | |
| | |||
* | Updated cxx11_tensor_device_sycl to run only on the OpenCL devices available ↵ | 2016-11-18 | |
| | | | | on the host | ||
* | Only runs the cxx11_tensor_reduction_sycl on devices that are available. | 2016-11-18 | |
| | |||
* | Added the ability to run test exclusively OpenCL devices that are listed by ↵ | 2016-11-18 | |
| | | | | sycl::device::get_devices(). | ||
* | Test broadcasting on OpenCL devices with 64 bit indexing | 2016-11-18 | |
| | |||
* | Fixed the cxx11_tensor_device_sycl test | 2016-11-18 | |
| | |||
* | adding Benoit changes on the TensorDeviceSycl.h | 2016-11-18 | |
| | |||
* | Modifying TensorDeviceSycl.h to always create buffer of type uint8_t and ↵ | 2016-11-18 | |
| | | | | convert them to the actual type at the execution on the device; adding the queue interface class to separate the lifespan of sycl queue and buffers,created for that queue, from Eigen::SyclDevice; modifying sycl tests to support the evaluation of the results for both row major and column major data layout on all different devices that are supported by Sycl{CPU; GPU; and Host}. | ||
* | Added isnan, isfinite and isinf for SYCL device. Plus test for that. | 2016-11-18 | |
| | |||
* | Now testing out (+=, =) in.FUNC() and out (+=, =) out.FUNC() | 2016-11-18 | |
| | |||
* | Added a way to detect errors generated by the opencl device from the host | 2016-11-17 | |
| | |||
* | Created a test to check that the sycl runtime can successfully report errors ↵ | 2016-11-17 | |
| | | | | | | (like ivision by 0). Small cleanup | ||
* | Avoid calling log(0) or 1/0 | 2016-11-17 | |
| | |||
* | Fixed existing test. | 2016-11-17 | |
| | |||
* | Specialised basic math functions for SYCL device. | 2016-11-17 | |
| | |||
* | Adding extra test for non-fixed size to broadcast; Replacing stcl with sycl. | 2016-11-14 | |
| | |||
* | Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing. | 2016-11-14 | |
| | |||
* | Adding EIGEN_STRONG_INLINE back; using size() instead of ↵ | 2016-11-10 | |
| | | | | dimensions.TotalSize() on Tensor. | ||
* | Adding Memset; optimising MecopyDeviceToHost by removing double copying; | 2016-11-10 | |
| | |||
* | Added a test to validate memory transfers between host and sycl device | 2016-11-09 | |
| | |||
* | Merged in benoitsteiner/opencl (pull request PR-246) | 2016-11-08 | |
|\ | | | | | | | Improved support for OpenCL | ||
| * | Converting all sycl buffers to uninitialised device only buffers; adding ↵ | 2016-11-08 | |
| | | | | | | | | memcpyHostToDevice and memcpyDeviceToHost on syclDevice; modifying all examples to obey the new rules; moving sycl queue creating to the device based on Benoit suggestion; removing the sycl specefic condition for returning m_result in TensorReduction.h according to Benoit suggestion. | ||
* | | Removed unnecessary statement | 2016-11-05 | |
| | | |||
* | | Extend mpreal unit test to check LLT with complexes. | 2016-11-05 | |
| | | |||
| * | Removed the sycl include from Eigen/Core and moved it to ↵ | 2016-11-04 | |
|/ | | | | Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size; | ||
* | Sharded the tensor_image_patch test to help it run on low power devices | 2016-10-27 | |
| | |||
* | Fixed a few more compilation warnings | 2016-10-28 | |
| | |||
* | Fixed a compilation warning | 2016-10-28 | |
| | |||
* | Fixed compilation warning | 2016-10-28 | |
| | |||
* | Merge latest updates from trunk | 2016-10-20 | |
|\ | |||
* \ | Merged ComputeCpp into default. | 2016-10-14 | |
|\ \ | |||
| * | | Reducing the code by generalising sycl backend functions/structs. | 2016-10-14 | |
| | | | |||
| | * | Relaxed the resizing checks so that they don't fail with gcc >= 5.3 | 2016-10-13 | |
| | | | |||
* | | | Merged eigen/eigen into default | 2016-10-12 | |
|\ \ \ | | |/ | |/| | |||
| * | | Cleaned up a regression test | 2016-10-08 | |
| | | | |||
* | | | Merge the content of the ComputeCpp branch into the default branch | 2016-10-07 | |
|\ \ \ | | |/ | |/| | |||
| | * | Add a test that GPU complex product reductions match CPU reductions. | 2016-10-06 | |
| |/ |/| | |||
| * | Fixed a couple of compilation warnings | 2016-10-05 | |
| | | |||
| * | Pull the latest updates from trunk | 2016-10-05 | |
| |\ | |||
| * | | Fixed compilation warnings | 2016-10-05 | |
| | | | |||
* | | | Increased the robustness of the reduction tests on fp16 | 2016-10-05 | |
| | | | |||
* | | | Increase the tolerance to numerical noise. | 2016-10-05 | |
| |/ |/| | |||
* | | Improved support for compiling CUDA code with clang as the host compiler | 2016-10-03 | |
| | | |||
| * | Renamed the SYCL tests to follow the standard naming convention. | 2016-09-30 | |
| | | |||
* | | Updated the tensor sum and mean reducer to enable them to process complex ↵ | 2016-09-28 | |
| | | | | | | | | numbers on cuda gpus. | ||
* | | Don't use c++11 features and fix include. | 2016-09-20 | |
| | | |||
* | | Add CUDA-specific std::complex<T> specializations for scalar_sum_op, ↵ | 2016-09-20 | |
| | | | | | | | | scalar_difference_op, scalar_product_op, and scalar_quotient_op. |