Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Makingt default numeric_list works with sycl. | 2016-12-02 | |
| | |||
* | Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in ↵ | 2016-12-01 | |
| | | | | TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code. | ||
* | Cleaned up forward declarations | 2016-11-30 | |
| | |||
* | Moved the MemCopyFunctor back to TensorSyclDevice since it's the only caller ↵ | 2016-11-30 | |
| | | | | and it makes TensorFlow compile again | ||
* | Added the deallocate_all() method back | 2016-11-30 | |
| | |||
* | Simplified includes | 2016-11-30 | |
| | |||
* | Improved formatting | 2016-11-30 | |
| | |||
* | Udated the Sizes class to work on AMD gpus without requiring a separate ↵ | 2016-11-30 | |
| | | | | implementation | ||
* | Added an implementation of numeric_list that works with sycl | 2016-11-30 | |
| | |||
* | Added EIGEN_STRONG_INLINE to get_sycl_supported_device(). | 2016-11-30 | |
| | |||
* | Adding TensorShuffling backend for sycl; adding TensorReshaping backend for ↵ | 2016-11-29 | |
| | | | | sycl; cleaning up the sycl backend. | ||
* | Merged eigen/eigen into default | 2016-11-27 | |
|\ | |||
| * | Fixed compilation warnings | 2016-11-26 | |
| | | |||
* | | Merged eigen/eigen into default | 2016-11-26 | |
|\| | |||
| * | Avoided unnecessary type conversion | 2016-11-26 | |
| | | |||
| * | Added missing array_get method for numeric_list | 2016-11-26 | |
| | | |||
* | | Fixing LLVM error on TensorMorphingSycl.h on GPU; fixing int64_t crash for ↵ | 2016-11-25 | |
| | | | | | | | | tensor_broadcast_sycl on GPU; adding get_sycl_supported_devices() on syclDevice.h. | ||
| * | Fixed the documentation of Scalar Tensors | 2016-11-24 | |
| | | |||
* | | Removing unsupported device from test case; cleaning the tensor device sycl. | 2016-11-23 | |
|/ | |||
* | Made the QueueInterface thread safe | 2016-11-20 | |
| | |||
* | Merged eigen/eigen into default | 2016-11-20 | |
|\ | |||
| * | Fixed a typo | 2016-11-20 | |
| | | |||
| * | Fixed merge conflicts | 2016-11-19 | |
| | | |||
* | | Made it possible to check the state of a sycl device without synchronization | 2016-11-19 | |
| | | |||
| * | Added missing include | 2016-11-19 | |
|/ | |||
* | Merged in benoitsteiner/opencl (pull request PR-253) | 2016-11-19 | |
|\ | | | | | | | OpenCL improvements | ||
| * | Added the ability to run test exclusively OpenCL devices that are listed by ↵ | 2016-11-18 | |
| | | | | | | | | sycl::device::get_devices(). | ||
* | | Deleted unnecessary semicolons | 2016-11-18 | |
| | | |||
| * | Cleaned up the sycl device code | 2016-11-18 | |
| | | |||
| * | adding Benoit changes on the TensorDeviceSycl.h | 2016-11-18 | |
| | | |||
| * | Modifying TensorDeviceSycl.h to always create buffer of type uint8_t and ↵ | 2016-11-18 | |
| | | | | | | | | convert them to the actual type at the execution on the device; adding the queue interface class to separate the lifespan of sycl queue and buffers,created for that queue, from Eigen::SyclDevice; modifying sycl tests to support the evaluation of the results for both row major and column major data layout on all different devices that are supported by Sycl{CPU; GPU; and Host}. | ||
| * | Merged eigen/eigen into default | 2016-11-17 | |
| |\ | |/ |/| | |||
| * | Added a way to detect errors generated by the opencl device from the host | 2016-11-17 | |
| | | |||
| * | Cleanup | 2016-11-17 | |
| | | |||
| * | Created a test to check that the sycl runtime can successfully report errors ↵ | 2016-11-17 | |
| | | | | | | | | | | | | (like ivision by 0). Small cleanup | ||
* | | Made TensorDeviceCuda.h compile on windows | 2016-11-17 | |
| | | |||
| * | Specialised basic math functions for SYCL device. | 2016-11-17 | |
|/ | |||
* | Merged eigen/eigen into default | 2016-11-14 | |
|\ | |||
| * | Reduce dispatch overhead in parallelFor by only calling ↵ | 2016-11-14 | |
| | | | | | | | | thread_pool.Schedule() for one of the two recursive calls in handleRange. This avoids going through the scedule path to push both recursive calls onto another thread-queue in the binary tree, but instead executes one of them on the main thread. At the leaf level this will still activate a full complement of threads, but will save up to 50% of the overhead in Schedule (random number generation, insertion in queue which includes signaling via atomics). | ||
* | | Adding extra test for non-fixed size to broadcast; Replacing stcl with sycl. | 2016-11-14 | |
| | | |||
* | | Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing. | 2016-11-14 | |
| | | |||
* | | Adding comment to TensorDeviceSycl.h and cleaning the code. | 2016-11-11 | |
|/ | |||
* | Adding EIGEN_STRONG_INLINE back; using size() instead of ↵ | 2016-11-10 | |
| | | | | dimensions.TotalSize() on Tensor. | ||
* | adding the missing in eigen_assert! | 2016-11-10 | |
| | |||
* | Adding Memset; optimising MecopyDeviceToHost by removing double copying; | 2016-11-10 | |
| | |||
* | Fixed the formatting of the code | 2016-11-08 | |
| | |||
* | #if EIGEN_EXCEPTION -> #ifdef EIGEN_EXCEPTIONS. | 2016-11-08 | |
| | |||
* | Fix for SYCL queue initialisation. | 2016-11-08 | |
| | |||
* | Use try/catch only when exceptions are enabled. | 2016-11-08 | |
| | |||
* | Converting all sycl buffers to uninitialised device only buffers; adding ↵ | 2016-11-08 | |
| | | | | memcpyHostToDevice and memcpyDeviceToHost on syclDevice; modifying all examples to obey the new rules; moving sycl queue creating to the device based on Benoit suggestion; removing the sycl specefic condition for returning m_result in TensorReduction.h according to Benoit suggestion. |