Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Fixing LLVM error on TensorMorphingSycl.h on GPU; fixing int64_t crash for ↵ | 2016-11-25 | |
| | | | | tensor_broadcast_sycl on GPU; adding get_sycl_supported_devices() on syclDevice.h. | ||
* | Removing unsupported device from test case; cleaning the tensor device sycl. | 2016-11-23 | |
| | |||
* | Made the QueueInterface thread safe | 2016-11-20 | |
| | |||
* | Merged eigen/eigen into default | 2016-11-20 | |
|\ | |||
| * | Fixed a typo | 2016-11-20 | |
| | | |||
| * | Fixed merge conflicts | 2016-11-19 | |
| | | |||
* | | Made it possible to check the state of a sycl device without synchronization | 2016-11-19 | |
| | | |||
| * | Added missing include | 2016-11-19 | |
|/ | |||
* | Merged in benoitsteiner/opencl (pull request PR-253) | 2016-11-19 | |
|\ | | | | | | | OpenCL improvements | ||
| * | Added the ability to run test exclusively OpenCL devices that are listed by ↵ | 2016-11-18 | |
| | | | | | | | | sycl::device::get_devices(). | ||
* | | Deleted unnecessary semicolons | 2016-11-18 | |
| | | |||
| * | Cleaned up the sycl device code | 2016-11-18 | |
| | | |||
| * | adding Benoit changes on the TensorDeviceSycl.h | 2016-11-18 | |
| | | |||
| * | Modifying TensorDeviceSycl.h to always create buffer of type uint8_t and ↵ | 2016-11-18 | |
| | | | | | | | | convert them to the actual type at the execution on the device; adding the queue interface class to separate the lifespan of sycl queue and buffers,created for that queue, from Eigen::SyclDevice; modifying sycl tests to support the evaluation of the results for both row major and column major data layout on all different devices that are supported by Sycl{CPU; GPU; and Host}. | ||
| * | Merged eigen/eigen into default | 2016-11-17 | |
| |\ | |/ |/| | |||
| * | Added a way to detect errors generated by the opencl device from the host | 2016-11-17 | |
| | | |||
| * | Cleanup | 2016-11-17 | |
| | | |||
| * | Created a test to check that the sycl runtime can successfully report errors ↵ | 2016-11-17 | |
| | | | | | | | | | | | | (like ivision by 0). Small cleanup | ||
* | | Made TensorDeviceCuda.h compile on windows | 2016-11-17 | |
| | | |||
| * | Specialised basic math functions for SYCL device. | 2016-11-17 | |
|/ | |||
* | Merged eigen/eigen into default | 2016-11-14 | |
|\ | |||
| * | Reduce dispatch overhead in parallelFor by only calling ↵ | 2016-11-14 | |
| | | | | | | | | thread_pool.Schedule() for one of the two recursive calls in handleRange. This avoids going through the scedule path to push both recursive calls onto another thread-queue in the binary tree, but instead executes one of them on the main thread. At the leaf level this will still activate a full complement of threads, but will save up to 50% of the overhead in Schedule (random number generation, insertion in queue which includes signaling via atomics). | ||
* | | Adding extra test for non-fixed size to broadcast; Replacing stcl with sycl. | 2016-11-14 | |
| | | |||
* | | Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing. | 2016-11-14 | |
| | | |||
* | | Adding comment to TensorDeviceSycl.h and cleaning the code. | 2016-11-11 | |
|/ | |||
* | Adding EIGEN_STRONG_INLINE back; using size() instead of ↵ | 2016-11-10 | |
| | | | | dimensions.TotalSize() on Tensor. | ||
* | adding the missing in eigen_assert! | 2016-11-10 | |
| | |||
* | Adding Memset; optimising MecopyDeviceToHost by removing double copying; | 2016-11-10 | |
| | |||
* | Fixed the formatting of the code | 2016-11-08 | |
| | |||
* | #if EIGEN_EXCEPTION -> #ifdef EIGEN_EXCEPTIONS. | 2016-11-08 | |
| | |||
* | Fix for SYCL queue initialisation. | 2016-11-08 | |
| | |||
* | Use try/catch only when exceptions are enabled. | 2016-11-08 | |
| | |||
* | Converting all sycl buffers to uninitialised device only buffers; adding ↵ | 2016-11-08 | |
| | | | | memcpyHostToDevice and memcpyDeviceToHost on syclDevice; modifying all examples to obey the new rules; moving sycl queue creating to the device based on Benoit suggestion; removing the sycl specefic condition for returning m_result in TensorReduction.h according to Benoit suggestion. | ||
* | Added missing includes | 2016-11-05 | |
| | |||
* | Removed the sycl include from Eigen/Core and moved it to ↵ | 2016-11-04 | |
| | | | | Unsupported/Eigen/CXX11/Tensor; added TensorReduction for sycl (full reduction and partial reduction); added TensorReduction test case for sycl (full reduction and partial reduction); fixed the tile size on TensorSyclRun.h based on the device max work group size; | ||
* | Disable vectorization on device only when compiling for sycl | 2016-11-02 | |
| | |||
* | Fixed the ambiguity in callig make_tuple for sycl backend. | 2016-10-31 | |
| | |||
* | Worked around Visual Studio compilation errors | 2016-10-28 | |
| | |||
* | Added missing template parameters | 2016-10-28 | |
| | |||
* | Workaround MSVC issue. | 2016-10-27 | |
| | |||
* | Removed a template parameter for fixed sized tensors | 2016-10-26 | |
| | |||
* | Replaced tabs with spaces | 2016-10-25 | |
| | |||
* | Code cleanup | 2016-10-25 | |
| | |||
* | Merge latest updates from trunk | 2016-10-20 | |
|\ | |||
* | | Added SYCL include in Tensor. | 2016-10-20 | |
| | | |||
| * | Fixed a few typos in the ternary tensor expressions types | 2016-10-19 | |
| | | |||
* | | Fixing the code indentation in the TensorReduction.h file. | 2016-10-14 | |
| | | |||
* | | Merged ComputeCpp into default. | 2016-10-14 | |
|\ \ | |||
| * | | Reducing the code by generalising sycl backend functions/structs. | 2016-10-14 | |
| | | | |||
* | | | Merged eigen/eigen into default | 2016-10-12 | |
|\ \ \ | | |/ | |/| |