aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor
Commit message (Collapse)AuthorAge
* Fix bugs to make min- and max reducers with correctly with IEEE infinities.Gravatar Rasmus Munk Larsen2016-08-31
|
* bug #1167: simplify installation of header files using cmake's ↵Gravatar Gael Guennebaud2016-08-29
| | | | install(DIRECTORY ...) command.
* Add missing log1p methodGravatar Gael Guennebaud2016-08-26
|
* Made the cost model cwiseMax and cwiseMin methods consts to help the PowerPC ↵Gravatar Benoit Steiner2016-08-18
| | | | cuda compiler compile this code.
* Force the inlining of a simple accessor.Gravatar Benoit Steiner2016-08-18
|
* Merged in ibab/eigen/double-tensor-reduction (pull request PR-216)Gravatar Benoit Steiner2016-08-18
|\ | | | | | | Enable efficient Tensor reduction for doubles on the GPU (continued)
| * Fix remaining CUDA >= 300 checksGravatar Igor Babuschkin2016-08-18
| |
| * Add the necessary CUDA >= 300 checks backGravatar Igor Babuschkin2016-08-18
| |
* | Properly detect the type of the result of a contraction.Gravatar Benoit Steiner2016-08-16
| |
* | Use array_prod instead of calling TotalSize since TotalSize is only ↵Gravatar Benoit Steiner2016-08-15
| | | | | | | | available on DSize.
* | Fixed a bug in the documentation.Gravatar Benoit Steiner2016-08-12
| |
* | Don't attempt to optimize partial reductions when the optimized ↵Gravatar Benoit Steiner2016-08-08
| | | | | | | | implementation doesn't buy anything.
| * Remove CUDA >= 300 checks and enable outer reductin for doublesGravatar Igor Babuschkin2016-08-06
| |
| * Merge upstream changesGravatar Igor Babuschkin2016-08-05
| |\ | |/ |/|
| * Make use of atomicExch for atomicExchCustomGravatar Igor Babuschkin2016-08-05
| |
* | Merged in ibab/eigen (pull request PR-206)Gravatar Benoit Steiner2016-08-03
|\ \ | | | | | | | | | Expose real and imag methods on Tensors
* | | CUDA_ARCH isn't always defined, so avoid relying on it too much when ↵Gravatar Benoit Steiner2016-08-03
| | | | | | | | | | | | figuring out which implementation to use for reductions. Instead rely on the device to tell us on which hardware version we're running.
* | | Use numext::conj instead of std::conjGravatar Benoit Steiner2016-08-01
| | |
* | | Avoid unecessary object copiesGravatar Benoit Steiner2016-08-01
| | |
* | | bug #1266: half implementation has been moved to half_impl namespaceGravatar Benoit Steiner2016-07-29
| | |
* | | Deleted dead code.Gravatar Benoit Steiner2016-07-25
| | |
* | | bug #1255: comment out broken and unsused line.Gravatar Gael Guennebaud2016-07-25
| | |
* | | Add minimal support for Array<string>, and fix Tensor<string>Gravatar Gael Guennebaud2016-07-25
| | |
* | | Improved partial reductions in more casesGravatar Benoit Steiner2016-07-22
| | |
* | | Fix CUDA compilationGravatar Gael Guennebaud2016-07-21
| | |
* | | An evalTo expression is only aligned iff both the lhs and the rhs are aligned.Gravatar Benoit Steiner2016-07-12
| | |
* | | Improved the contraction mapper to properly support tensor productsGravatar Benoit Steiner2016-07-11
| | |
* | | Improved the detection of packet size in the tensor scan evaluator.Gravatar Benoit Steiner2016-07-11
| | |
* | | Fix assertion (it did not make sense for static_val types)Gravatar Gael Guennebaud2016-07-11
| | |
* | | Emulate _BitScanReverse64 for 32 bits buildsGravatar Gael Guennebaud2016-07-11
| | |
* | | Change runtime to compile-time conditional.Gravatar Gael Guennebaud2016-07-08
| | |
* | | Fix warningsGravatar Gael Guennebaud2016-07-08
| | |
* | | Fix warningGravatar Gael Guennebaud2016-07-07
| | |
* | | fix clang compilationGravatar Gael Guennebaud2016-07-04
| | |
* | | Workaround compilation issue with msvcGravatar Gael Guennebaud2016-07-04
| | |
| | * Enable efficient Tensor reduction for doublesGravatar Igor Babuschkin2016-07-01
| |/ |/|
| * Expose real and imag methods on TensorsGravatar Igor Babuschkin2016-07-01
|/
* Made it possible to compile reductions for an old cuda architecture and run ↵Gravatar Benoit Steiner2016-06-29
| | | | them on a recent gpu.
* Made the code compile when using CUDA architecture < 300Gravatar Benoit Steiner2016-06-29
|
* Add missing CUDA kernel to tensor scan opGravatar Igor Babuschkin2016-06-29
| | | | | The TensorScanOp implementation was missing a CUDA kernel launch. This adds a simple placeholder implementation.
* Don't store the scan axis in the evaluator of the tensor scan operation ↵Gravatar Benoit Steiner2016-06-27
| | | | | | since it's only used in the constructor. Also avoid taking references to values that may becomes stale after a copy construction.
* Return -1 from CurrentThreadId when called by thread outside the pool.Gravatar Rasmus Munk Larsen2016-06-23
|
* Resolve merge.Gravatar Rasmus Munk Larsen2016-06-23
|\
| * bug #1241: does not emmit anything for empty tensorsGravatar Gael Guennebaud2016-06-23
| |
| * merge PR 194Gravatar Gael Guennebaud2016-06-23
| |\
| * | Handle empty tensors in the print functionsGravatar Benoit Steiner2016-06-21
| | |
| * | Fixed the printing of rank-0 tensorsGravatar Benoit Steiner2016-06-20
| | |
| * | Implement exclusive scan optionGravatar Igor Babuschkin2016-06-14
| | |
| | * mergeGravatar Gael Guennebaud2016-06-14
| | |\ | | |/ | |/|
| | * Update Tensor module to use bind1st_op and bind2nd_opGravatar Gael Guennebaud2016-06-14
| | |