Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Merge upstream updates. | 2016-04-14 | |
|\ | |||
* | | Eigen cost model part 1. This implements a basic recursive framework to ↵ | 2016-04-14 | |
| | | | | | | | | estimate the cost of evaluating tensor expressions. | ||
| * | Add extreme values to the imaginary part for SVD unit tests. | 2016-04-14 | |
| | | |||
| * | Improve numerical robustness of JacoviSVD: | 2016-04-14 | |
| | | | | | | | | | | - avoid noise amplification in complex to real conversion - compare off-diagonal entries to the current biggest diagonal entry: no need to bother about a 2x2 block containing ridiculously small entries compared to the rest of the matrix. | ||
| * | Force the inlining of the << operator on half floats | 2016-04-14 | |
| | | |||
| * | Inline the << operator on half floats | 2016-04-14 | |
| | | |||
| * | Silenced a compilation warning | 2016-04-14 | |
| | | |||
| * | Added tests to validate flooring and ceiling of fp16 | 2016-04-14 | |
| | | |||
| * | Added simple test for numext::sqrt and numext::pow on fp16 | 2016-04-14 | |
| | | |||
| * | Added basic test for trigonometric functions on fp16 | 2016-04-14 | |
| | | |||
| * | Added support for fp16 to test_isApprox, test_isMuchSmallerThan, and ↵ | 2016-04-14 | |
| | | | | | | | | test_isApproxOrLessThan | ||
| * | Added ability to printf fp16 | 2016-04-14 | |
| | | |||
| * | Added support for fp16 to the sigmoid function | 2016-04-14 | |
| | | |||
| * | Made the test msvc friendly | 2016-04-14 | |
|/ | |||
* | Cleaning pass on rcond estimator. | 2016-04-14 | |
| | |||
* | remove useless include | 2016-04-14 | |
| | |||
* | Better use .data() than &coeffRef(0) | 2016-04-14 | |
| | |||
* | Merged in rmlarsen/eigen (pull request PR-174) | 2016-04-14 | |
|\ | | | | | | | Add matrix condition number estimation module. | ||
* | | Properly gate the definition of the error and gamma functions for fp16 | 2016-04-13 | |
| | | |||
* | | Improved support for trigonometric functions on GPU | 2016-04-13 | |
| | | |||
* | | Added basic implementation of the lgamma, digamma, igamma, igammac, ↵ | 2016-04-13 | |
| | | | | | | | | polygamma, and zeta function for fp16 | ||
* | | merge | 2016-04-13 | |
|\ \ | |||
* | | | Fix JacobiSVD for complex when the complex-to-real update already gives a ↵ | 2016-04-13 | |
| | | | | | | | | | | | | diagonal 2x2 block. | ||
| * | | Cleaned up the implementation of digamma | 2016-04-13 | |
| | | | |||
| * | | Pulled latest updates from trunk | 2016-04-13 | |
| |\ \ | |||
| | * | | Added support for sin, cos, tan, and tanh on fp16 | 2016-04-13 | |
| | | | | |||
* | | | | Add debug output for random unit test | 2016-04-13 | |
|/ / / | |||
* | | | Turn a converge check to a warning | 2016-04-13 | |
| | | | |||
* | | | Fix underflow in JacoviSVD's complex to real preconditioner | 2016-04-13 | |
| | | | |||
* | | | Fix corner case in unit test. | 2016-04-13 | |
| | | | |||
* | | | Fix warning in unit tests | 2016-04-13 | |
|/ / | |||
* | | Added support for computing cos, sin, tan, and tanh on GPU. | 2016-04-13 | |
| | | |||
* | | Added constructors to convert unsigned integers into fp16 | 2016-04-13 | |
| | | |||
* | | Workaround a division by zero when outerstride==0 | 2016-04-13 | |
| | | |||
* | | Make use of is_same_dense helper instead of extract_data to detect ↵ | 2016-04-13 | |
| | | | | | | | | input/outputs are the same. | ||
* | | Fix incomplete previous patch on matrix comparision. | 2016-04-13 | |
| | | |||
* | | Fix detection of same matrices when both matrices are not handled by ↵ | 2016-04-13 | |
| | | | | | | | | extract_data. | ||
* | | Add regression unit tests for half-packet vectorization | 2016-04-13 | |
| | | |||
* | | Improve half-packet vectorization logic to distinguish linear versus inner ↵ | 2016-04-13 | |
| | | | | | | | | traversal modes. | ||
* | | Enable half-packet in reduxions. | 2016-04-13 | |
| | | |||
* | | Fixed compilation warnings generated by clang | 2016-04-12 | |
| | | |||
* | | Enable the benchmarks for algebraic and transcendental fnctions on fp16. | 2016-04-12 | |
| | | |||
* | | Don't put a command at the end of an enumerator list | 2016-04-12 | |
| | | |||
* | | Pulled latest update from trunk | 2016-04-12 | |
|\ \ | |||
* | | | Fixed the vectorization logic test | 2016-04-12 | |
| | | | |||
* | | | Turned on the contraction benchmarks for fp16 | 2016-04-12 | |
| | | | |||
| * | | Enable the use of half-packet in coeff-based product. | 2016-04-12 | |
|/ / | | | | | | | For instance, Matrix4f*Vector4f is now vectorized again when using AVX. | ||
* | | Fixed the zeta test | 2016-04-12 | |
| | | |||
* | | Defer the decision to vectorize tensor CUDA code to the meta kernel. This ↵ | 2016-04-12 | |
| | | | | | | | | makes it possible to decide to vectorize or not depending on the capability of the target cuda architecture. In particular, this enables us to vectorize the processing of fp16 when running on device of capability >= 5.3 | ||
| * | Merged eigen/eigen into default | 2016-04-11 | |
| |\ |