Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Added the ability to query the minor version of a cuda device | 2016-02-19 | |
| | |||
* | Fixed a bug in the tensor type converter | 2016-02-19 | |
| | |||
* | Added a method to conjugate the content of a tensor or the result of a ↵ | 2016-02-11 | |
| | | | | tensor expression. | ||
* | Worked around a few clang compilation warnings | 2016-02-10 | |
| | |||
* | Fixed clang comilation warnings | 2016-02-10 | |
| | |||
* | Fixed some clang compilation warnings | 2016-02-09 | |
| | |||
* | Updated the TensorIntDivisor code to work properly on LLP64 systems | 2016-02-08 | |
| | |||
* | Avoid unecessary type conversions | 2016-02-05 | |
| | |||
* | Added support for vectorized type casting of int to char. | 2016-02-03 | |
| | |||
* | Merged in ville-k/eigen/explicit_long_constructors (pull request PR-158) | 2016-02-02 | |
|\ | | | | | | | Add constructor for long types. | ||
| * | Use EIGEN_STATIC_ASSERT for backward compatibility. | 2016-02-02 | |
| | | |||
* | | Don't try to use direct offsets when computing a tensor product, since the ↵ | 2016-02-02 | |
| | | | | | | | | required stride isn't available. | ||
| * | Replace separate low word constructors with a single templated constructor. | 2016-02-01 | |
| | | |||
| * | Rebase to latest. | 2016-02-01 | |
| |\ | |/ |/| | |||
* | | Made it possible to limit the number of blocks that will be used to evaluate ↵ | 2016-02-01 | |
| | | | | | | | | a tensor expression on a CUDA device. This makesit possible to set aside streaming multiprocessors for other computations. | ||
* | | Fixed a number of compilation warnings generated by the cuda tests | 2016-01-31 | |
| | | |||
* | | Fixed a few compilation warnings | 2016-01-31 | |
| | | |||
* | | Marked several methods EIGEN_DEVICE_FUNC | 2016-01-28 | |
| | | |||
* | | Fixed a couple of compilation warnings. | 2016-01-28 | |
| | | |||
* | | merge | 2016-01-28 | |
|\ \ | |||
* | | | Deleted an invalid assertion that prevented the assignment of empty tensors. | 2016-01-27 | |
| | | | |||
* | | | Fixed some compilation problems with nvcc + clang | 2016-01-27 | |
| | | | |||
| | * | Add constructor for long types. | 2016-01-26 | |
| | | | |||
* | | | Don't explicitely evaluate the subexpression from ↵ | 2016-01-24 | |
| | | | | | | | | | | | | TensorForcedEval::evalSubExprIfNeeded, as it will be done when executing the EvalTo subexpression | ||
* | | | Added missing EIGEN_DEVICE_FUNC qualifier | 2016-01-24 | |
| | | | |||
* | | | Merged in ville-k/eigen/tensorflow_fix (pull request PR-153) | 2016-01-22 | |
|\ \ \ | | | | | | | | | | | | | Add ctor for long | ||
* | | | | Leverage the new blocking code in the tensor contraction code. | 2016-01-22 | |
| |_|/ |/| | | |||
* | | | Created a mechanism to enable contraction mappers to determine the best ↵ | 2016-01-22 | |
| | | | | | | | | | | | | blocking strategy. | ||
* | | | Backout changeset 690bc950f70c61075d396671e63480bbd64bb297 | 2016-01-22 | |
| | | | |||
| * | | Update to latest default branch | 2016-01-21 | |
| |\ \ | |/ / |/| | | |||
* | | | Fixed a constness bug | 2016-01-21 | |
| | | | |||
* | | | fix clang warnings | 2016-01-20 | |
| | | | | | | | | | | | | "braces around scalar initializer" | ||
* | | | Small cleanup and small fix to the contraction of row major tensors | 2016-01-20 | |
| | | | |||
* | | | Reduce the register pressure exerted by the tensor mappers whenever ↵ | 2016-01-20 | |
| | | | | | | | | | | | | possible. This improves the performance of the contraction of a matrix with a vector by about 35%. | ||
| * | | Use explicitly 32 bit integer types in constructors. | 2016-01-19 | |
| | | | |||
* | | | Improved the formatting of the code | 2016-01-19 | |
| | | | |||
* | | | Moved the contraction mapping code to its own file to make the code more ↵ | 2016-01-19 | |
| | | | | | | | | | | | | manageable. | ||
* | | | Improved code indentation | 2016-01-19 | |
| | | | |||
* | | | Record whether the underlying tensor storage can be accessed directly during ↵ | 2016-01-19 | |
| | | | | | | | | | | | | the evaluation of an expression. | ||
| * | | Add ctor for long | 2016-01-17 | |
| | | | |||
* | | | Fixed a race condition that could affect some reductions on CUDA devices. | 2016-01-15 | |
| | | | |||
* | | | Made it possible to compare tensor dimensions inside a CUDA kernel. | 2016-01-15 | |
| | | | |||
* | | | Use warp shuffles instead of shared memory access to speedup the inner ↵ | 2016-01-14 | |
| | | | | | | | | | | | | reduction kernel. | ||
* | | | Fixed a boundary condition bug in the outer reduction kernel | 2016-01-14 | |
| | | | |||
* | | | Properly record the rank of reduced tensors in the tensor traits. | 2016-01-13 | |
| | | | |||
* | | | Trigger the optimized matrix vector path more conservatively. | 2016-01-12 | |
| | | | |||
* | | | Improved the performance of the contraction of a 2d tensor with a 1d tensor ↵ | 2016-01-12 | |
| | | | | | | | | | | | | by a factor of 3 or more. This helps speedup LSTM neural networks. | ||
* | | | Reverted a previous change that tripped nvcc when compiling in debug mode. | 2016-01-11 | |
| | | | |||
* | | | Silenced a few compilation warnings. | 2016-01-11 | |
| | | | |||
* | | | Updated the tensor traits: the alignment is not part of the Flags enum anymore | 2016-01-11 | |
| | | |