aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen
Commit message (Collapse)AuthorAge
* Merged eigen/eigen into defaultGravatar Benoit Steiner2016-05-03
|\
| * Fixed compilation error with cuda >= 7.5Gravatar Benoit Steiner2016-05-03
| |
| * Made a cast explicitGravatar Benoit Steiner2016-05-02
| |
| * Fix performance regression: with AVX, unaligned stores were emitted instead ↵Gravatar Gael Guennebaud2016-05-01
| | | | | | | | of aligned ones for fixed size assignement.
| * Fixed compilation errors generated by clangGravatar Benoit Steiner2016-04-29
| |
| * Don't turn on const expressions when compiling with gcc >= 4.8 unless the ↵Gravatar Benoit Steiner2016-04-29
| | | | | | | | -std=c++11 option has been used
* | Pulled latest updates from upstreamGravatar Benoit Steiner2016-04-29
|\|
* | Implemented palign_impl for AVX512Gravatar Benoit Steiner2016-04-29
| |
* | Fixed the AVX512 packet traitsGravatar Benoit Steiner2016-04-29
| |
* | Added pdiv packet primitives for avx512Gravatar Benoit Steiner2016-04-29
| |
* | Implemented preduxp for AVX512Gravatar Benoit Steiner2016-04-29
| |
* | Implemented the pabs and preverse primitives for avx512.Gravatar Benoit Steiner2016-04-29
| |
* | Disabled some of the AVX512 primitives on compilers that don't support themGravatar Benoit Steiner2016-04-29
| |
| * Fix compilation of sparse.cast<>().transpose().Gravatar Gael Guennebaud2016-04-29
| |
| * Fixed the igamma and igammac implementations to make them callable from a ↵Gravatar Benoit Steiner2016-04-28
| | | | | | | | gpu kernel.
| * Deleted unused variableGravatar Benoit Steiner2016-04-28
| |
| * Eliminate mutual recursion in igamma{,c}_impl::Run.Gravatar Justin Lebar2016-04-28
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Presently, igammac_impl::Run calls igamma_impl::Run, which in turn calls igammac_impl::Run. This isn't actually mutual recursion; the calls are guarded such that we never get into a loop. Nonetheless, it's a stretch for clang to prove this. As a result, clang emits a recursive call in both igammac_impl::Run and igamma_impl::Run. That this is suboptimal code is bad enough, but it's particularly bad when compiling for CUDA/nvptx. nvptx allows recursion, but only begrudgingly: If you have recursive calls in a kernel, it's on you to manually specify the kernel's stack size. Otherwise, ptxas will dump a warning, make a guess, and who knows if it's right. This change explicitly eliminates the mutual recursion in igammac_impl::Run and igamma_impl::Run.
| * Merged in rmlarsen/eigen2 (pull request PR-183)Gravatar Benoit Steiner2016-04-27
| |\ | | | | | | | | | Detect cxx_constexpr support when compiling with clang.
| | * Depend on the more extensive support for constexpr in clang:Gravatar Rasmus Munk Larsen2016-04-27
| | | | | | | | | | | | http://clang.llvm.org/docs/LanguageExtensions.html#c-1y-relaxed-constexpr
| | * Detect cxx_constexpr support when compiling with clang.Gravatar Rasmus Munk Larsen2016-04-27
| | |
| * | fpclassify isn't portable enough. In particular, the return values of the ↵Gravatar Benoit Steiner2016-04-27
| | | | | | | | | | | | function are not available on all the platforms Eigen supportes: remove it from Eigen.
| * | Made the index type a template parameter to evaluateProductBlockingSizesGravatar Benoit Steiner2016-04-27
| |/ | | | | | | Use numext::mini and numext::maxi instead of std::min/std::max to compute blocking sizes.
| * Improved support for min and max on 16 bit floats when running on recent ↵Gravatar Benoit Steiner2016-04-27
| | | | | | | | cuda gpus
| * Added support for fpclassify in Eigen::NumextGravatar Benoit Steiner2016-04-27
| |
| * Merged in rmlarsen/eigen (pull request PR-179)Gravatar Benoit Steiner2016-04-21
| |\ | | | | | | | | | Prevent crash in CompleteOrthogonalDecomposition if object was default constructed.
| | * Prevent crash in CompleteOrthogonalDecomposition if object was default ↵Gravatar Rasmus Munk Larsen2016-04-21
| | | | | | | | | | | | constructed.
| * | Don't attempt to leverage the _cvtss_sh and _cvtsh_ss instructions when ↵Gravatar Benoit Steiner2016-04-20
| | | | | | | | | | | | compiling with clang since it's unclear which versions of clang actually support these instruction.
| * | Made sure all the required header files are included when trying to use fp16Gravatar Benoit Steiner2016-04-19
| |/
| * Enable lazy-coeff-based-product for vector*(1x1) productsGravatar Gael Guennebaud2016-04-16
| |
| * Deleted extraneous comma.Gravatar Benoit Steiner2016-04-15
| |
| * bug #1203: by-pass large stack-allocation in stableNorm if ↵Gravatar Gael Guennebaud2016-04-15
| | | | | | | | EIGEN_STACK_ALLOCATION_LIMIT is too small
| * Improved the matrix multiplication blocking in the case where mr is not a ↵Gravatar Benoit Steiner2016-04-15
| | | | | | | | power of 2 (e.g on Haswell CPUs).
| * Fix trmv for mixing types.Gravatar Gael Guennebaud2016-04-15
| |
| * Added ability to access the cache sizes from the tensor devicesGravatar Benoit Steiner2016-04-14
| |
| * Added support for exclusive orGravatar Benoit Steiner2016-04-14
| |
| * Improve numerical robustness of JacoviSVD:Gravatar Gael Guennebaud2016-04-14
| | | | | | | | | | - avoid noise amplification in complex to real conversion - compare off-diagonal entries to the current biggest diagonal entry: no need to bother about a 2x2 block containing ridiculously small entries compared to the rest of the matrix.
| * Force the inlining of the << operator on half floatsGravatar Benoit Steiner2016-04-14
| |
| * Inline the << operator on half floatsGravatar Benoit Steiner2016-04-14
| |
| * Added ability to printf fp16Gravatar Benoit Steiner2016-04-14
| |
| * Cleaning pass on rcond estimator.Gravatar Gael Guennebaud2016-04-14
| |
| * Better use .data() than &coeffRef(0)Gravatar Gael Guennebaud2016-04-14
| |
| * Merged in rmlarsen/eigen (pull request PR-174)Gravatar Gael Guennebaud2016-04-14
| |\ | | | | | | | | | Add matrix condition number estimation module.
| * | Properly gate the definition of the error and gamma functions for fp16Gravatar Benoit Steiner2016-04-13
| | |
| * | Improved support for trigonometric functions on GPUGravatar Benoit Steiner2016-04-13
| | |
| * | Added basic implementation of the lgamma, digamma, igamma, igammac, ↵Gravatar Benoit Steiner2016-04-13
| | | | | | | | | | | | polygamma, and zeta function for fp16
| * | mergeGravatar Gael Guennebaud2016-04-13
| |\ \
| * | | Fix JacobiSVD for complex when the complex-to-real update already gives a ↵Gravatar Gael Guennebaud2016-04-13
| | | | | | | | | | | | | | | | diagonal 2x2 block.
| | * | Cleaned up the implementation of digammaGravatar Benoit Steiner2016-04-13
| | | |
| | * | Pulled latest updates from trunkGravatar Benoit Steiner2016-04-13
| |/| |
| | * | Added support for sin, cos, tan, and tanh on fp16Gravatar Benoit Steiner2016-04-13
| | | |