aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Collapse)AuthorAge
* Merge upstream updates.Gravatar Rasmus Munk Larsen2016-04-14
|\
* | Eigen cost model part 1. This implements a basic recursive framework to ↵Gravatar Rasmus Munk Larsen2016-04-14
| | | | | | | | estimate the cost of evaluating tensor expressions.
| * Add extreme values to the imaginary part for SVD unit tests.Gravatar Gael Guennebaud2016-04-14
| |
| * Improve numerical robustness of JacoviSVD:Gravatar Gael Guennebaud2016-04-14
| | | | | | | | | | - avoid noise amplification in complex to real conversion - compare off-diagonal entries to the current biggest diagonal entry: no need to bother about a 2x2 block containing ridiculously small entries compared to the rest of the matrix.
| * Force the inlining of the << operator on half floatsGravatar Benoit Steiner2016-04-14
| |
| * Inline the << operator on half floatsGravatar Benoit Steiner2016-04-14
| |
| * Silenced a compilation warningGravatar Benoit Steiner2016-04-14
| |
| * Added tests to validate flooring and ceiling of fp16Gravatar Benoit Steiner2016-04-14
| |
| * Added simple test for numext::sqrt and numext::pow on fp16Gravatar Benoit Steiner2016-04-14
| |
| * Added basic test for trigonometric functions on fp16Gravatar Benoit Steiner2016-04-14
| |
| * Added support for fp16 to test_isApprox, test_isMuchSmallerThan, and ↵Gravatar Benoit Steiner2016-04-14
| | | | | | | | test_isApproxOrLessThan
| * Added ability to printf fp16Gravatar Benoit Steiner2016-04-14
| |
| * Added support for fp16 to the sigmoid functionGravatar Benoit Steiner2016-04-14
| |
| * Made the test msvc friendlyGravatar Benoit Steiner2016-04-14
|/
* Cleaning pass on rcond estimator.Gravatar Gael Guennebaud2016-04-14
|
* remove useless includeGravatar Gael Guennebaud2016-04-14
|
* Better use .data() than &coeffRef(0)Gravatar Gael Guennebaud2016-04-14
|
* Merged in rmlarsen/eigen (pull request PR-174)Gravatar Gael Guennebaud2016-04-14
|\ | | | | | | Add matrix condition number estimation module.
* | Properly gate the definition of the error and gamma functions for fp16Gravatar Benoit Steiner2016-04-13
| |
* | Improved support for trigonometric functions on GPUGravatar Benoit Steiner2016-04-13
| |
* | Added basic implementation of the lgamma, digamma, igamma, igammac, ↵Gravatar Benoit Steiner2016-04-13
| | | | | | | | polygamma, and zeta function for fp16
* | mergeGravatar Gael Guennebaud2016-04-13
|\ \
* | | Fix JacobiSVD for complex when the complex-to-real update already gives a ↵Gravatar Gael Guennebaud2016-04-13
| | | | | | | | | | | | diagonal 2x2 block.
| * | Cleaned up the implementation of digammaGravatar Benoit Steiner2016-04-13
| | |
| * | Pulled latest updates from trunkGravatar Benoit Steiner2016-04-13
| |\ \
| | * | Added support for sin, cos, tan, and tanh on fp16Gravatar Benoit Steiner2016-04-13
| | | |
* | | | Add debug output for random unit testGravatar Gael Guennebaud2016-04-13
|/ / /
* | | Turn a converge check to a warningGravatar Gael Guennebaud2016-04-13
| | |
* | | Fix underflow in JacoviSVD's complex to real preconditionerGravatar Gael Guennebaud2016-04-13
| | |
* | | Fix corner case in unit test.Gravatar Gael Guennebaud2016-04-13
| | |
* | | Fix warning in unit testsGravatar Gael Guennebaud2016-04-13
|/ /
* | Added support for computing cos, sin, tan, and tanh on GPU.Gravatar Benoit Steiner2016-04-13
| |
* | Added constructors to convert unsigned integers into fp16Gravatar Benoit Steiner2016-04-13
| |
* | Workaround a division by zero when outerstride==0Gravatar Gael Guennebaud2016-04-13
| |
* | Make use of is_same_dense helper instead of extract_data to detect ↵Gravatar Gael Guennebaud2016-04-13
| | | | | | | | input/outputs are the same.
* | Fix incomplete previous patch on matrix comparision.Gravatar Gael Guennebaud2016-04-13
| |
* | Fix detection of same matrices when both matrices are not handled by ↵Gravatar Gael Guennebaud2016-04-13
| | | | | | | | extract_data.
* | Add regression unit tests for half-packet vectorizationGravatar Gael Guennebaud2016-04-13
| |
* | Improve half-packet vectorization logic to distinguish linear versus inner ↵Gravatar Gael Guennebaud2016-04-13
| | | | | | | | traversal modes.
* | Enable half-packet in reduxions.Gravatar Gael Guennebaud2016-04-13
| |
* | Fixed compilation warnings generated by clangGravatar Benoit Steiner2016-04-12
| |
* | Enable the benchmarks for algebraic and transcendental fnctions on fp16.Gravatar Benoit Steiner2016-04-12
| |
* | Don't put a command at the end of an enumerator listGravatar Benoit Steiner2016-04-12
| |
* | Pulled latest update from trunkGravatar Benoit Steiner2016-04-12
|\ \
* | | Fixed the vectorization logic testGravatar Benoit Steiner2016-04-12
| | |
* | | Turned on the contraction benchmarks for fp16Gravatar Benoit Steiner2016-04-12
| | |
| * | Enable the use of half-packet in coeff-based product.Gravatar Gael Guennebaud2016-04-12
|/ / | | | | | | For instance, Matrix4f*Vector4f is now vectorized again when using AVX.
* | Fixed the zeta testGravatar Benoit Steiner2016-04-12
| |
* | Defer the decision to vectorize tensor CUDA code to the meta kernel. This ↵Gravatar Benoit Steiner2016-04-12
| | | | | | | | makes it possible to decide to vectorize or not depending on the capability of the target cuda architecture. In particular, this enables us to vectorize the processing of fp16 when running on device of capability >= 5.3
| * Merged eigen/eigen into defaultGravatar Rasmus Larsen2016-04-11
| |\