aboutsummaryrefslogtreecommitdiffhomepage
path: root/unsupported/Eigen/CXX11/src/Tensor
Commit message (Collapse)AuthorAge
* Added the ability to query the minor version of a cuda deviceGravatar Benoit Steiner2016-02-19
|
* Fixed a bug in the tensor type converterGravatar Benoit Steiner2016-02-19
|
* Added a method to conjugate the content of a tensor or the result of a ↵Gravatar Benoit Steiner2016-02-11
| | | | tensor expression.
* Worked around a few clang compilation warningsGravatar Benoit Steiner2016-02-10
|
* Fixed clang comilation warningsGravatar Benoit Steiner2016-02-10
|
* Fixed some clang compilation warningsGravatar Benoit Steiner2016-02-09
|
* Updated the TensorIntDivisor code to work properly on LLP64 systemsGravatar Benoit Steiner2016-02-08
|
* Avoid unecessary type conversionsGravatar Benoit Steiner2016-02-05
|
* Added support for vectorized type casting of int to char.Gravatar Benoit Steiner2016-02-03
|
* Merged in ville-k/eigen/explicit_long_constructors (pull request PR-158)Gravatar Benoit Steiner2016-02-02
|\ | | | | | | Add constructor for long types.
| * Use EIGEN_STATIC_ASSERT for backward compatibility.Gravatar Ville Kallioniemi2016-02-02
| |
* | Don't try to use direct offsets when computing a tensor product, since the ↵Gravatar Benoit Steiner2016-02-02
| | | | | | | | required stride isn't available.
| * Replace separate low word constructors with a single templated constructor.Gravatar Ville Kallioniemi2016-02-01
| |
| * Rebase to latest.Gravatar Ville Kallioniemi2016-02-01
| |\ | |/ |/|
* | Made it possible to limit the number of blocks that will be used to evaluate ↵Gravatar Benoit Steiner2016-02-01
| | | | | | | | a tensor expression on a CUDA device. This makesit possible to set aside streaming multiprocessors for other computations.
* | Fixed a number of compilation warnings generated by the cuda testsGravatar Benoit Steiner2016-01-31
| |
* | Fixed a few compilation warningsGravatar Benoit Steiner2016-01-31
| |
* | Marked several methods EIGEN_DEVICE_FUNCGravatar Benoit Steiner2016-01-28
| |
* | Fixed a couple of compilation warnings.Gravatar Benoit Steiner2016-01-28
| |
* | mergeGravatar Gael Guennebaud2016-01-28
|\ \
* | | Deleted an invalid assertion that prevented the assignment of empty tensors.Gravatar Benoit Steiner2016-01-27
| | |
* | | Fixed some compilation problems with nvcc + clangGravatar Benoit Steiner2016-01-27
| | |
| | * Add constructor for long types.Gravatar Ville Kallioniemi2016-01-26
| | |
* | | Don't explicitely evaluate the subexpression from ↵Gravatar Benoit Steiner2016-01-24
| | | | | | | | | | | | TensorForcedEval::evalSubExprIfNeeded, as it will be done when executing the EvalTo subexpression
* | | Added missing EIGEN_DEVICE_FUNC qualifierGravatar Benoit Steiner2016-01-24
| | |
* | | Merged in ville-k/eigen/tensorflow_fix (pull request PR-153)Gravatar Benoit Steiner2016-01-22
|\ \ \ | | | | | | | | | | | | Add ctor for long
* | | | Leverage the new blocking code in the tensor contraction code.Gravatar Benoit Steiner2016-01-22
| |_|/ |/| |
* | | Created a mechanism to enable contraction mappers to determine the best ↵Gravatar Benoit Steiner2016-01-22
| | | | | | | | | | | | blocking strategy.
* | | Backout changeset 690bc950f70c61075d396671e63480bbd64bb297Gravatar Gael Guennebaud2016-01-22
| | |
| * | Update to latest default branchGravatar Ville Kallioniemi2016-01-21
| |\ \ | |/ / |/| |
* | | Fixed a constness bugGravatar Benoit Steiner2016-01-21
| | |
* | | fix clang warningsGravatar Jan Prach2016-01-20
| | | | | | | | | | | | "braces around scalar initializer"
* | | Small cleanup and small fix to the contraction of row major tensorsGravatar Benoit Steiner2016-01-20
| | |
* | | Reduce the register pressure exerted by the tensor mappers whenever ↵Gravatar Benoit Steiner2016-01-20
| | | | | | | | | | | | possible. This improves the performance of the contraction of a matrix with a vector by about 35%.
| * | Use explicitly 32 bit integer types in constructors.Gravatar Ville Kallioniemi2016-01-19
| | |
* | | Improved the formatting of the codeGravatar Benoit Steiner2016-01-19
| | |
* | | Moved the contraction mapping code to its own file to make the code more ↵Gravatar Benoit Steiner2016-01-19
| | | | | | | | | | | | manageable.
* | | Improved code indentationGravatar Benoit Steiner2016-01-19
| | |
* | | Record whether the underlying tensor storage can be accessed directly during ↵Gravatar Benoit Steiner2016-01-19
| | | | | | | | | | | | the evaluation of an expression.
| * | Add ctor for longGravatar Ville Kallioniemi2016-01-17
| | |
* | | Fixed a race condition that could affect some reductions on CUDA devices.Gravatar Benoit Steiner2016-01-15
| | |
* | | Made it possible to compare tensor dimensions inside a CUDA kernel.Gravatar Benoit Steiner2016-01-15
| | |
* | | Use warp shuffles instead of shared memory access to speedup the inner ↵Gravatar Benoit Steiner2016-01-14
| | | | | | | | | | | | reduction kernel.
* | | Fixed a boundary condition bug in the outer reduction kernelGravatar Benoit Steiner2016-01-14
| | |
* | | Properly record the rank of reduced tensors in the tensor traits.Gravatar Benoit Steiner2016-01-13
| | |
* | | Trigger the optimized matrix vector path more conservatively.Gravatar Benoit Steiner2016-01-12
| | |
* | | Improved the performance of the contraction of a 2d tensor with a 1d tensor ↵Gravatar Benoit Steiner2016-01-12
| | | | | | | | | | | | by a factor of 3 or more. This helps speedup LSTM neural networks.
* | | Reverted a previous change that tripped nvcc when compiling in debug mode.Gravatar Benoit Steiner2016-01-11
| | |
* | | Silenced a few compilation warnings.Gravatar Benoit Steiner2016-01-11
| | |
* | | Updated the tensor traits: the alignment is not part of the Flags enum anymoreGravatar Benoit Steiner2016-01-11
| | |