Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Merged in benoitsteiner/opencl (pull request PR-253) | Benoit Steiner | 2016-11-19 |
|\ | | | | | | | OpenCL improvements | ||
| * | Code cleanup | Benoit Steiner | 2016-11-18 |
| | | |||
| * | Updated cxx11_tensor_device_sycl to run only on the OpenCL devices available ↵ | Benoit Steiner | 2016-11-18 |
| | | | | | | | | on the host | ||
| * | Only runs the cxx11_tensor_reduction_sycl on devices that are available. | Benoit Steiner | 2016-11-18 |
| | | |||
| * | Added the ability to run test exclusively OpenCL devices that are listed by ↵ | Benoit Steiner | 2016-11-18 |
| | | | | | | | | sycl::device::get_devices(). | ||
* | | Enable EIGEN_HAS_C99_MATH when building with the latest version of Visual Studio | Benoit Steiner | 2016-11-18 |
| | | |||
* | | Deleted unnecessary semicolons | Benoit Steiner | 2016-11-18 |
| | | |||
| * | Test broadcasting on OpenCL devices with 64 bit indexing | Benoit Steiner | 2016-11-18 |
| | | |||
* | | Merged in ChunW/eigen (pull request PR-252) | Gael Guennebaud | 2016-11-18 |
|\ \ | | | | | | | | | | Workaround for error in VS2012 with /clr | ||
| | * | Cleaned up the sycl device code | Benoit Steiner | 2016-11-18 |
| | | | |||
| | * | Fixed the cxx11_tensor_device_sycl test | Benoit Steiner | 2016-11-18 |
| | | | |||
| | * | adding Benoit changes on the TensorDeviceSycl.h | Mehdi Goli | 2016-11-18 |
| | | | |||
| | * | Modifying TensorDeviceSycl.h to always create buffer of type uint8_t and ↵ | Mehdi Goli | 2016-11-18 |
| | | | | | | | | | | | | convert them to the actual type at the execution on the device; adding the queue interface class to separate the lifespan of sycl queue and buffers,created for that queue, from Eigen::SyclDevice; modifying sycl tests to support the evaluation of the results for both row major and column major data layout on all different devices that are supported by Sycl{CPU; GPU; and Host}. | ||
| | * | Added isnan, isfinite and isinf for SYCL device. Plus test for that. | Luke Iwanski | 2016-11-18 |
| | | | |||
| | * | Now testing out (+=, =) in.FUNC() and out (+=, =) out.FUNC() | Luke Iwanski | 2016-11-18 |
| | | | |||
* | | | bug #1343: fix compilation regression in mat+=selfadjoint_view. | Gael Guennebaud | 2016-11-18 |
| | | | | | | | | | | | | Generic EigenBase2EigenBase assignment was incomplete. | ||
* | | | bug #1343: fix compilation regression in array = matrix_product | Gael Guennebaud | 2016-11-18 |
| | | | |||
| | * | Merged eigen/eigen into default | Benoit Steiner | 2016-11-17 |
| | |\ | |_|/ |/| | | |||
| | * | Added a way to detect errors generated by the opencl device from the host | Benoit Steiner | 2016-11-17 |
| | | | |||
| | * | Cleanup | Benoit Steiner | 2016-11-17 |
| | | | |||
| | * | Created a test to check that the sycl runtime can successfully report errors ↵ | Benoit Steiner | 2016-11-17 |
| | | | | | | | | | | | | | | | | | | (like ivision by 0). Small cleanup | ||
* | | | Made TensorDeviceCuda.h compile on windows | Benoit Steiner | 2016-11-17 |
| | | | |||
| * | | Workaround for error in VS2012 with /clr | Chun Wang | 2016-11-17 |
|/ / | |||
| * | Avoid calling log(0) or 1/0 | Benoit Steiner | 2016-11-17 |
| | | |||
* | | replace sizeof(Packet) with PacketSize else it breaks for ZVector.Packet4f | Konstantinos Margaritis | 2016-11-17 |
| | | |||
* | | implement float/std::complex<float> for ZVector as well, minor fixes to ZVector | Konstantinos Margaritis | 2016-11-17 |
| | | |||
| * | Fixed existing test. | Luke Iwanski | 2016-11-17 |
| | | |||
| * | Specialised basic math functions for SYCL device. | Luke Iwanski | 2016-11-17 |
| | | |||
* | | Enable the use of AVX512 instruction by default | Benoit Steiner | 2016-11-16 |
| | | |||
* | | bump default branch to 3.3.90 | Gael Guennebaud | 2016-11-16 |
| | | |||
* | | Optimized the computation of exp, sqrt, ceil anf floor for fp16 on Pascal GPUs | Benoit Steiner | 2016-11-16 |
|/ | |||
* | Merged eigen/eigen into default | Benoit Steiner | 2016-11-14 |
|\ | |||
| * | Reduce dispatch overhead in parallelFor by only calling ↵ | Rasmus Munk Larsen | 2016-11-14 |
| | | | | | | | | thread_pool.Schedule() for one of the two recursive calls in handleRange. This avoids going through the scedule path to push both recursive calls onto another thread-queue in the binary tree, but instead executes one of them on the main thread. At the leaf level this will still activate a full complement of threads, but will save up to 50% of the overhead in Schedule (random number generation, insertion in queue which includes signaling via atomics). | ||
* | | Adding extra test for non-fixed size to broadcast; Replacing stcl with sycl. | Mehdi Goli | 2016-11-14 |
| | | |||
* | | Adding TensorFixsize; adding sycl device memcpy; adding insial stage of slicing. | Mehdi Goli | 2016-11-14 |
| | | |||
| * | Optimize sparse<bool> && sparse<bool> to use the same path as for coeff-wise ↵ | Gael Guennebaud | 2016-11-14 |
| | | | | | | | | products. | ||
| * | bug #426: move operator && and || to MatrixBase and SparseMatrixBase. | Gael Guennebaud | 2016-11-14 |
| | | |||
| * | Merged in olesalscheider/eigen (pull request PR-248) | Gael Guennebaud | 2016-11-14 |
| |\ | | | | | | | | | | Make sure not to call numext::maxi on expression templates | ||
| * | | Fix regression in SparseMatrix::ReverseInnerIterator | Gael Guennebaud | 2016-11-14 |
| | | | |||
| | * | Make sure not to call numext::maxi on expression templates | Niels Ole Salscheider | 2016-11-12 |
| |/ | |||
* | | Adding comment to TensorDeviceSycl.h and cleaning the code. | Mehdi Goli | 2016-11-11 |
| | | |||
| * | Merged in benoitsteiner/opencl (pull request PR-247) | Benoit Steiner | 2016-11-11 |
| |\ | |/ |/| | |||
* | | Adding EIGEN_STRONG_INLINE back; using size() instead of ↵ | Mehdi Goli | 2016-11-10 |
| | | | | | | | | dimensions.TotalSize() on Tensor. | ||
* | | adding the missing in eigen_assert! | Mehdi Goli | 2016-11-10 |
| | | |||
* | | Adding Memset; optimising MecopyDeviceToHost by removing double copying; | Mehdi Goli | 2016-11-10 |
| | | |||
| * | bump to 3.3.0 | Gael Guennebaud | 2016-11-10 |
| | | |||
| * | Fix printing of sparse expressions | Gael Guennebaud | 2016-11-10 |
| | | |||
* | | Added a test to validate memory transfers between host and sycl device | Benoit Steiner | 2016-11-09 |
|/ | |||
* | Merged in benoitsteiner/opencl (pull request PR-246) | Benoit Steiner | 2016-11-08 |
|\ | | | | | | | Improved support for OpenCL | ||
| * | Fixed the formatting of the code | Benoit Steiner | 2016-11-08 |
| | |