aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11
Commit message (Expand)AuthorAge
* [SYCL] This PR adds the minimum modifications to the Eigen unsupported module...Gravatar Mehdi Goli2019-06-28
* Remove extra comma (causes warnings in C++03)Gravatar Christoph Hertzberg2019-06-26
* Optimize evaluation strategy for TensorSlicingOp and TensorChippingOpGravatar Eugene Zhulenev2019-06-25
* Clean up CUDA/NVCC version macros and their use in Eigen, and a few other CUD...Gravatar Rasmus Munk Larsen2019-05-31
* Merged in rmlarsen/eigen (pull request PR-643)Gravatar Rasmus Larsen2019-05-20
|\
* | Prevent potential division by zero in TensorExecutorGravatar Eugene Zhulenev2019-05-17
* | Always evaluate Tensor expressions with broadcasting via tiled evaluation cod...Gravatar Eugene Zhulenev2019-05-16
| * Make Eigen build with cuda 10 and clang.Gravatar Rasmus Munk Larsen2019-05-15
|/
* A) fix deadlocks in thread pool caused by EventCountGravatar Rasmus Munk Larsen2019-05-08
* Fix stupid shadow-warnings (with old clang versions)Gravatar Christoph Hertzberg2019-05-07
* Restore C++03 compatibilityGravatar Christoph Hertzberg2019-05-07
* Check if gpu_assert was overridden in TensorGpuHipCudaDefinesGravatar Eugene Zhulenev2019-04-25
* Remove deprecation annotation from typedef Eigen::Index Index, as it would ge...Gravatar Rasmus Munk Larsen2019-04-24
* Add missing EIGEN_DEPRECATED annotations to deprecated functions and fix few ...Gravatar Eugene Zhulenev2019-04-23
* Adding lowlevel APIs for optimized RHS packet load in TensorFlowGravatar Anuj Rawat2019-04-20
* Tweak cost model for tensor contraction when parallelizing over the inner dim...Gravatar Rasmus Munk Larsen2019-04-12
* Update TheadPoolDevice example to include ThreadPool creation and passing poi...Gravatar Jonathon Koyle2019-04-10
* adding EIGEN_DEVICE_FUNC to the recently added TensorContractionKernel constr...Gravatar Deven Desai2019-04-08
* Add missing semicolonGravatar Eugene Zhulenev2019-04-02
* Add support for custom packed Lhs/Rhs blocks in tensor contractionsGravatar Eugene Zhulenev2019-04-01
* Merged eigen/eigen into defaultGravatar Deven Desai2019-03-19
|\
| * Fix segfaults with cuda compilationGravatar Eugene Zhulenev2019-03-11
| * Fix a bug in TensorGenerator for 1d tensorsGravatar Eugene Zhulenev2019-03-11
| * Fix a data race in NonBlockingThreadPoolGravatar Eugene Zhulenev2019-03-11
| * Merge.Gravatar Rasmus Munk Larsen2019-03-06
| |\
| * | Add macro EIGEN_AVOID_THREAD_LOCAL to make it possible to manually disable th...Gravatar Rasmus Munk Larsen2019-03-06
| | * Fix placement of "#if defined(EIGEN_GPUCC)" guard region.Gravatar Rasmus Munk Larsen2019-03-06
| | |\
| | * | Fix placement of "#if defined(EIGEN_GPUCC)" guard region.Gravatar Rasmus Munk Larsen2019-03-06
| |/ /
| | * Add missing return to NonBlockingThreadPool::LocalStealGravatar Eugene Zhulenev2019-03-06
| | * Remove redundant steal loopGravatar Eugene Zhulenev2019-03-06
| |/
| * Check that inner block dimension is continuousGravatar Eugene Zhulenev2019-03-05
| * Block evaluation for TensorGeneratorOpGravatar Eugene Zhulenev2019-03-05
| * Tune tensor contraction threadpool heuristicsGravatar Eugene Zhulenev2019-03-05
| * Add an extra check for the RunQueue size estimateGravatar Eugene Zhulenev2019-03-05
| * Do not initialize invalid fast_strides in TensorGeneratorOpGravatar Eugene Zhulenev2019-03-04
| * Add tiled evaluation for TensorForcedEvalOpGravatar Eugene Zhulenev2019-03-04
| * Use fast divisors in TensorGeneratorOpGravatar Eugene Zhulenev2019-03-04
| * Fix specialization for conjugate on non-complex types in TensorBase.h.Gravatar Rasmus Munk Larsen2019-03-01
| * Improve EventCount used by the non-blocking threadpool.Gravatar Rasmus Munk Larsen2019-02-22
| * Fix incorrect value of NumDimensions in TensorContraction traits.Gravatar Rasmus Munk Larsen2019-02-19
| * Merged in ezhulenev/eigen-01 (pull request PR-590)Gravatar Rasmus Larsen2019-02-14
| |\
| * | Fix signed-unsigned return in RuqQueueGravatar Eugene Zhulenev2019-02-14
| * | Fix signed-unsigned comparison warning in RunQueueGravatar Eugene Zhulenev2019-02-14
| | * Do not generate no-op cast() and conjugate() expressionsGravatar Eugene Zhulenev2019-02-14
| |/
| * Speedup Tensor ThreadPool RunQueu::Empty()Gravatar Eugene Zhulenev2019-02-13
| * Add PacketConv implementation for non-vectorizable src expressionsGravatar Eugene Zhulenev2019-02-08
| * Optimize TensorConversion evaluator: do not convert same typeGravatar Eugene Zhulenev2019-02-08
| * Don't do parallel_pack if we can use thread_local memory in tensor contractionsGravatar Eugene Zhulenev2019-02-07
| * Do not reduce parallelism too much in contractions with small number of threadsGravatar Eugene Zhulenev2019-02-04
| * Parallelize tensor contraction only by sharding dimension and use 'thread-loc...Gravatar Eugene Zhulenev2019-02-04