Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Fix bugs to make min- and max reducers with correctly with IEEE infinities. | 2016-08-31 | |
| | |||
* | bug #1167: simplify installation of header files using cmake's ↵ | 2016-08-29 | |
| | | | | install(DIRECTORY ...) command. | ||
* | Add missing log1p method | 2016-08-26 | |
| | |||
* | Made the cost model cwiseMax and cwiseMin methods consts to help the PowerPC ↵ | 2016-08-18 | |
| | | | | cuda compiler compile this code. | ||
* | Force the inlining of a simple accessor. | 2016-08-18 | |
| | |||
* | Merged in ibab/eigen/double-tensor-reduction (pull request PR-216) | 2016-08-18 | |
|\ | | | | | | | Enable efficient Tensor reduction for doubles on the GPU (continued) | ||
| * | Fix remaining CUDA >= 300 checks | 2016-08-18 | |
| | | |||
| * | Add the necessary CUDA >= 300 checks back | 2016-08-18 | |
| | | |||
* | | Properly detect the type of the result of a contraction. | 2016-08-16 | |
| | | |||
* | | Use array_prod instead of calling TotalSize since TotalSize is only ↵ | 2016-08-15 | |
| | | | | | | | | available on DSize. | ||
* | | Fixed a bug in the documentation. | 2016-08-12 | |
| | | |||
* | | Don't attempt to optimize partial reductions when the optimized ↵ | 2016-08-08 | |
| | | | | | | | | implementation doesn't buy anything. | ||
| * | Remove CUDA >= 300 checks and enable outer reductin for doubles | 2016-08-06 | |
| | | |||
| * | Merge upstream changes | 2016-08-05 | |
| |\ | |/ |/| | |||
| * | Make use of atomicExch for atomicExchCustom | 2016-08-05 | |
| | | |||
* | | Merged in ibab/eigen (pull request PR-206) | 2016-08-03 | |
|\ \ | | | | | | | | | | Expose real and imag methods on Tensors | ||
* | | | CUDA_ARCH isn't always defined, so avoid relying on it too much when ↵ | 2016-08-03 | |
| | | | | | | | | | | | | figuring out which implementation to use for reductions. Instead rely on the device to tell us on which hardware version we're running. | ||
* | | | Use numext::conj instead of std::conj | 2016-08-01 | |
| | | | |||
* | | | Avoid unecessary object copies | 2016-08-01 | |
| | | | |||
* | | | bug #1266: half implementation has been moved to half_impl namespace | 2016-07-29 | |
| | | | |||
* | | | Deleted dead code. | 2016-07-25 | |
| | | | |||
* | | | bug #1255: comment out broken and unsused line. | 2016-07-25 | |
| | | | |||
* | | | Add minimal support for Array<string>, and fix Tensor<string> | 2016-07-25 | |
| | | | |||
* | | | Improved partial reductions in more cases | 2016-07-22 | |
| | | | |||
* | | | Fix CUDA compilation | 2016-07-21 | |
| | | | |||
* | | | An evalTo expression is only aligned iff both the lhs and the rhs are aligned. | 2016-07-12 | |
| | | | |||
* | | | Improved the contraction mapper to properly support tensor products | 2016-07-11 | |
| | | | |||
* | | | Improved the detection of packet size in the tensor scan evaluator. | 2016-07-11 | |
| | | | |||
* | | | Fix assertion (it did not make sense for static_val types) | 2016-07-11 | |
| | | | |||
* | | | Emulate _BitScanReverse64 for 32 bits builds | 2016-07-11 | |
| | | | |||
* | | | Change runtime to compile-time conditional. | 2016-07-08 | |
| | | | |||
* | | | Fix warnings | 2016-07-08 | |
| | | | |||
* | | | Fix warning | 2016-07-07 | |
| | | | |||
* | | | fix clang compilation | 2016-07-04 | |
| | | | |||
* | | | Workaround compilation issue with msvc | 2016-07-04 | |
| | | | |||
| | * | Enable efficient Tensor reduction for doubles | 2016-07-01 | |
| |/ |/| | |||
| * | Expose real and imag methods on Tensors | 2016-07-01 | |
|/ | |||
* | Made it possible to compile reductions for an old cuda architecture and run ↵ | 2016-06-29 | |
| | | | | them on a recent gpu. | ||
* | Made the code compile when using CUDA architecture < 300 | 2016-06-29 | |
| | |||
* | Add missing CUDA kernel to tensor scan op | 2016-06-29 | |
| | | | | | The TensorScanOp implementation was missing a CUDA kernel launch. This adds a simple placeholder implementation. | ||
* | Don't store the scan axis in the evaluator of the tensor scan operation ↵ | 2016-06-27 | |
| | | | | | | since it's only used in the constructor. Also avoid taking references to values that may becomes stale after a copy construction. | ||
* | Return -1 from CurrentThreadId when called by thread outside the pool. | 2016-06-23 | |
| | |||
* | Resolve merge. | 2016-06-23 | |
|\ | |||
| * | bug #1241: does not emmit anything for empty tensors | 2016-06-23 | |
| | | |||
| * | merge PR 194 | 2016-06-23 | |
| |\ | |||
| * | | Handle empty tensors in the print functions | 2016-06-21 | |
| | | | |||
| * | | Fixed the printing of rank-0 tensors | 2016-06-20 | |
| | | | |||
| * | | Implement exclusive scan option | 2016-06-14 | |
| | | | |||
| | * | merge | 2016-06-14 | |
| | |\ | | |/ | |/| | |||
| | * | Update Tensor module to use bind1st_op and bind2nd_op | 2016-06-14 | |
| | | |