Commit message (Collapse) | Author | Age | ||
---|---|---|---|---|
... | ||||
* | Removed unecesasry #include | Benoit Steiner | 2017-10-22 | |
| | ||||
* | Merged in infinitei/eigen (pull request PR-328) | Gael Guennebaud | 2017-09-06 | |
|\ | | | | | | | | | | | | | bug #1464 : Fixes construction of EulerAngles from 3D vector expression. Approved-by: Tal Hadad <tal_hd@hotmail.com> Approved-by: Abhijit Kundu <abhijit.kundu@gatech.edu> | |||
* | | Added support for CUDA 9.0. | Benoit Steiner | 2017-08-31 | |
| | | ||||
| * | bug #1464 : Fixes construction of EulerAngles from 3D vector expression. | Abhijit Kundu | 2017-08-30 | |
|/ | ||||
* | Handle min/max/inf/etc issue in cuda_fp16.h directly in test/main.h | Gael Guennebaud | 2017-08-24 | |
| | ||||
* | bug #1462: remove all occurences of the deprecated __CUDACC_VER__ macro by ↵ | Gael Guennebaud | 2017-08-24 | |
| | | | | introducing EIGEN_CUDACC_VER | |||
* | Merged in benoitsteiner/opencl (pull request PR-323) | Benoit Steiner | 2017-07-07 | |
|\ | | | | | | | Improved support for OpenCL | |||
* | | Merged in tntnatbry/eigen (pull request PR-319) | Benoit Steiner | 2017-07-07 | |
| | | | | | | | | Tensor Trace op | |||
| * | Merged in mehdi_goli/upstr_benoit/TensorSYCLImageVolumePatchFixed (pull ↵ | Benoit Steiner | 2017-07-06 | |
|/ | | | | | | | | | | | | request PR-14) Applying Benoit's comment for Fixing ImageVolumePatch. * Applying Benoit's comment for Fixing ImageVolumePatch. Fixing conflict on cmake file. * Fixing dealocation of the memory in ImagePatch test for SYCL. * Fixing the automerge issue. | |||
* | Merged in benoitsteiner/opencl (pull request PR-318) | Benoit Steiner | 2017-06-13 | |
|\ | | | | | | | Improved support for OpenCL | |||
* | | fix compilation in C++98 | Gael Guennebaud | 2017-06-09 | |
| | | ||||
| * | Merged eigen/eigen into default | Benoit Steiner | 2017-05-26 | |
| |\ | ||||
| * \ | Merge changed from upstream | a-doumoulakis | 2017-05-24 | |
| |\ \ | ||||
* | | | | Specializing numeric_limits For AutoDiffScalar | Mmanu Chaturvedi | 2017-05-23 | |
| |_|/ |/| | | ||||
| | * | Fixing Cmake Dependency for SYCL | Mehdi Goli | 2017-05-22 | |
| | | | ||||
| * | | Add support for triSYCL | a-doumoulakis | 2017-05-05 | |
| |/ | | | | | | | | | | | Eigen is now able to use triSYCL with EIGEN_SYCL_TRISYCL and TRISYCL_INCLUDE_DIR options Fix contraction kernel with correct nd_item dimension | |||
* / | Use scalar_sum_op and scalar_quotient_op instead of operator+ and operator/ ↵ | RJ Ryan | 2017-04-14 | |
|/ | | | | | | | | | | in MeanReducer. Improves support for std::complex types when compiling for CUDA. Expands on e2e9cdd16970914cf0a892fea5e7c4402b3ede41 and 2bda1b0d93fb627d0c500ec48b20302d44c32cb7 . | |||
* | Preserve file naming conventions | Benoit Steiner | 2017-04-04 | |
| | ||||
* | Fixing TensorArgMaxSycl.h; Removing warning related to the hardcoded type of ↵ | Mehdi Goli | 2017-03-28 | |
| | | | | dims to be int in Argmax. | |||
* | Merged eigen/eigen into default | Benoit Steiner | 2017-03-15 | |
|\ | ||||
* | | Temporary: Disables cxx11_tensor_argmax_sycl test since it is causing zombie ↵ | Luke Iwanski | 2017-03-15 | |
| | | | | | | | | thread | |||
| * | Make the non-blocking threadpool more flexible and less wasteful of CPU ↵ | Rasmus Munk Larsen | 2017-03-09 | |
| | | | | | | | | | | | | | | | | | | | | | | | | cycles for high-latency use-cases. * Adds a hint to ThreadPool allowing us to turn off spin waiting. Currently each reader and record yielder op in a graph creates a threadpool with a thread that spins for 1000 iterations through the work stealing loop before yielding. This is wasteful for such ops that process I/O. * This also changes the number of iterations through the steal loop to be inversely proportional to the number of threads. Since the time of each iteration is proportional to the number of threads, this yields roughly a constant spin time. * Implement a separate worker loop for the num_threads == 1 case since there is no point in going through the expensive steal loop. Moreover, since Steal() calls PopBack() on the victim queues it might reverse the order in which ops are executed, compared to the order in which they are scheduled, which is usually counter-productive for the types of I/O workloads the single thread pools tend to be used for. * Store num_threads in a member variable for simplicity and to avoid a data race between the thread creation loop and worker threads calling threads_.size(). | |||
* | | Adding TensorIndexTuple and TensorTupleReduceOP backend (ArgMax/Min) for ↵ | Mehdi Goli | 2017-03-07 | |
| | | | | | | | | sycl; fixing the address space issue for const TensorMap; converting all discard_write to write due to data missmatch. | |||
* | | Adding sycl backend for TensorCustomOp; fixing the partial lhs modification ↵ | Mehdi Goli | 2017-02-28 | |
| | | | | | | | | issue on sycl when the rhs is TensorContraction, reduction or convolution; Fixing the partial modification for memset when sycl backend is used. | |||
* | | Adding TensorVolumePatchOP.h for sycl | Mehdi Goli | 2017-02-24 | |
| | | ||||
* | | Adding Sycl Backend for TensorGenerator.h. | Mehdi Goli | 2017-02-22 | |
| | | ||||
* | | Reducing the number of warnings. | Mehdi Goli | 2017-02-21 | |
| | | ||||
* | | Adding Sycl backend for TensorImagePatchOP.h; adding Sycl backend for ↵ | Mehdi Goli | 2017-02-20 | |
| | | | | | | | | TensorInflation.h. | |||
* | | Adding TensorLayoutSwapOp for sycl. | Mehdi Goli | 2017-02-15 | |
| | | ||||
* | | Adding TensorPatch.h for sycl backend. | Mehdi Goli | 2017-02-15 | |
|/ | ||||
* | Adding TensorChippingOP for sycl backend; fixing the index value in the ↵ | Mehdi Goli | 2017-02-13 | |
| | | | | verification operation for cxx11_tensorChipping.cpp test | |||
* | Adding mean to TensorReductionSycl.h | Mehdi Goli | 2017-02-07 | |
| | ||||
* | Fixing TensorReductionSycl for min and max. | Mehdi Goli | 2017-02-06 | |
| | ||||
* | Converting ptrdiff_t type to int64_t type in cxx11_tensor_contract_sycl.cpp ↵ | Mehdi Goli | 2017-02-01 | |
| | | | | in order to be the same as other tests. | |||
* | Reducing warnings in Sycl backend. | Mehdi Goli | 2017-02-01 | |
| | ||||
* | Merge latest changes from upstream | Benoit Steiner | 2017-01-30 | |
|\ | ||||
| * | Make NaN propagatation consistent between the pmax/pmin and ↵ | Rasmus Munk Larsen | 2017-01-24 | |
| | | | | | | | | | | | | std::max/std::min. This makes the NaN propagation consistent between the scalar and vectorized code paths of Eigen's scalar_max_op and scalar_min_op. See #1373 for details. | |||
* | | Adding non-deferrenciable pointer track for ComputeCpp backend; Adding ↵ | Mehdi Goli | 2017-01-19 | |
| | | | | | | | | TensorConvolutionOp for ComputeCpp; fixing typos. modifying TensorDeviceSycl to use the LegacyPointer class. | |||
* | | Adding Tensor ReverseOp; TensorStriding; TensorConversionOp; Modifying ↵ | Mehdi Goli | 2017-01-16 | |
| | | | | | | | | Tensor Contractsycl to be located in any place in the expression tree. | |||
| * | Simplified the way we link libxsmm | Benoit Steiner | 2016-12-21 | |
| | | ||||
| * | Added support for libxsmm in the eigen makefiles | Benoit Steiner | 2016-12-21 | |
|/ | ||||
* | Merged eigen/eigen into default | Benoit Steiner | 2016-12-20 | |
|\ | ||||
| * | Properly adjust precision when saving to Market format. | Gael Guennebaud | 2016-12-20 | |
| | | ||||
| * | Speed up parsing of sparse Market file. | Gael Guennebaud | 2016-12-20 | |
| | | ||||
* | | Added an OpenCL regression test | Benoit Steiner | 2016-12-19 | |
| | | ||||
| * | Fixed race condition in the tensor_shuffling_sycl test | Benoit Steiner | 2016-12-19 | |
|/ | ||||
* | Converting all parallel for lambda to functor in order to prevent kernel ↵ | Mehdi Goli | 2016-12-16 | |
| | | | | duplication name error; adding tensorConcatinationOp backend for sycl. | |||
* | Merged eigen/eigen into default | Benoit Steiner | 2016-12-14 | |
|\ | ||||
* | | Adding asynchronous execution as it improves the performance. | Mehdi Goli | 2016-12-14 | |
| | | ||||
* | | Adding tensor contraction operation backend for Sycl; adding test for ↵ | Mehdi Goli | 2016-12-14 | |
| | | | | | | | | contractionOp sycl backend; adding temporary solution to prevent memory leak in buffer; cleaning up cxx11_tensor_buildins_sycl.h |