aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Collapse)AuthorAge
...
* | Don't turn on const expressions when compiling with gcc >= 4.8 unless the ↵Gravatar Benoit Steiner2016-04-29
| | | | | | | | -std=c++11 option has been used
* | Restore Tensor support for non c++11 compilersGravatar Benoit Steiner2016-04-29
| |
* | Fixed include pathGravatar Benoit Steiner2016-04-29
| |
* | Fix compilation of sparse.cast<>().transpose().Gravatar Gael Guennebaud2016-04-29
| |
* | Fixed a few memory leaksGravatar Benoit Steiner2016-04-28
| |
* | Fixed the igamma and igammac implementations to make them callable from a ↵Gravatar Benoit Steiner2016-04-28
| | | | | | | | gpu kernel.
* | Deleted unused variableGravatar Benoit Steiner2016-04-28
| |
* | Eliminate mutual recursion in igamma{,c}_impl::Run.Gravatar Justin Lebar2016-04-28
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Presently, igammac_impl::Run calls igamma_impl::Run, which in turn calls igammac_impl::Run. This isn't actually mutual recursion; the calls are guarded such that we never get into a loop. Nonetheless, it's a stretch for clang to prove this. As a result, clang emits a recursive call in both igammac_impl::Run and igamma_impl::Run. That this is suboptimal code is bad enough, but it's particularly bad when compiling for CUDA/nvptx. nvptx allows recursion, but only begrudgingly: If you have recursive calls in a kernel, it's on you to manually specify the kernel's stack size. Otherwise, ptxas will dump a warning, make a guess, and who knows if it's right. This change explicitly eliminates the mutual recursion in igammac_impl::Run and igamma_impl::Run.
* | Fixed compilation error with clang.Gravatar Benoit Steiner2016-04-27
| |
* | Merged in rmlarsen/eigen2 (pull request PR-183)Gravatar Benoit Steiner2016-04-27
|\ \ | | | | | | | | | Detect cxx_constexpr support when compiling with clang.
| * | Depend on the more extensive support for constexpr in clang:Gravatar Rasmus Munk Larsen2016-04-27
| | | | | | | | | | | | http://clang.llvm.org/docs/LanguageExtensions.html#c-1y-relaxed-constexpr
| * | Detect cxx_constexpr support when compiling with clang.Gravatar Rasmus Munk Larsen2016-04-27
| | |
* | | Merged latest update from trunkGravatar Benoit Steiner2016-04-27
|\| |
* | | fpclassify isn't portable enough. In particular, the return values of the ↵Gravatar Benoit Steiner2016-04-27
| | | | | | | | | | | | function are not available on all the platforms Eigen supportes: remove it from Eigen.
| * | Fix missing inclusion of Eigen/CoreGravatar Gael Guennebaud2016-04-27
| | |
* | | Made the index type a template parameter to evaluateProductBlockingSizesGravatar Benoit Steiner2016-04-27
|/ / | | | | | | Use numext::mini and numext::maxi instead of std::min/std::max to compute blocking sizes.
* | Merged latest updates from trunkGravatar Benoit Steiner2016-04-27
|\ \
* | | Improved support for min and max on 16 bit floats when running on recent ↵Gravatar Benoit Steiner2016-04-27
| | | | | | | | | | | | cuda gpus
| * | Merged eigen/eigen into defaultGravatar Rasmus Larsen2016-04-27
| |\ \
| * | | Use computeProductBlockingSizes to compute blocking for both ShardByCol and ↵Gravatar Rasmus Munk Larsen2016-04-27
| | | | | | | | | | | | | | | | ShardByRow cases.
* | | | Added support for fpclassify in Eigen::NumextGravatar Benoit Steiner2016-04-27
| |/ / |/| |
* | | Implement stricter argument checking for SYRK and SY2K and real matrices. To ↵Gravatar Rasmus Munk Larsen2016-04-27
|/ / | | | | | | implement the BLAS API they should return info=2 if op='C' is passed for a complex matrix. Without this change, the Eigen BLAS fails the strict zblat3 and cblat3 tests in LAPACK 3.5.
* | Refactor the unsupported CXX11/Core module to internal headers only.Gravatar Gael Guennebaud2016-04-26
| |
* | Fixed the partial evaluation of non vectorizable tensor subexpressionsGravatar Benoit Steiner2016-04-25
| |
* | Refined the cost of the striding operation.Gravatar Benoit Steiner2016-04-25
| |
* | Merged in rmlarsen/eigen (pull request PR-179)Gravatar Benoit Steiner2016-04-21
|\ \ | | | | | | | | | Prevent crash in CompleteOrthogonalDecomposition if object was default constructed.
* | | Provide access to the base threadpool classesGravatar Benoit Steiner2016-04-21
| | |
| * | Prevent crash in CompleteOrthogonalDecomposition if object was default ↵Gravatar Rasmus Munk Larsen2016-04-21
| | | | | | | | | | | | constructed.
* | | Added the ability to switch to the new thread pool with a #defineGravatar Benoit Steiner2016-04-21
| | |
* | | Use index list for the striding benchmarksGravatar Benoit Steiner2016-04-21
| | |
* | | Fixed several compilation warningsGravatar Benoit Steiner2016-04-21
| | |
* | | Added an option to enable the use of the F16C instruction setGravatar Benoit Steiner2016-04-21
| | |
* | | Use EIGEN_THREAD_YIELD instead of std::this_thread::yield to make the code ↵Gravatar Benoit Steiner2016-04-21
| | | | | | | | | | | | more portable.
* | | Don't crash when attempting to reduce empty tensors.Gravatar Benoit Steiner2016-04-20
| | |
* | | Added more testsGravatar Benoit Steiner2016-04-20
| | |
* | | Don't attempt to leverage the _cvtss_sh and _cvtsh_ss instructions when ↵Gravatar Benoit Steiner2016-04-20
| | | | | | | | | | | | compiling with clang since it's unclear which versions of clang actually support these instruction.
* | | Started to implement a portable way to yield.Gravatar Benoit Steiner2016-04-19
| | |
* | | Made sure all the required header files are included when trying to use fp16Gravatar Benoit Steiner2016-04-19
| | |
* | | Implemented a more portable version of thread local variablesGravatar Benoit Steiner2016-04-19
| | |
* | | Fixed a few typosGravatar Benoit Steiner2016-04-19
| | |
* | | Fixed a compilation error with nvcc 7.Gravatar Benoit Steiner2016-04-19
| | |
* | | Simplified the code that launches cuda kernels.Gravatar Benoit Steiner2016-04-19
| | |
* | | Don't take the address of a kernel on CUDA devices that don't support this ↵Gravatar Benoit Steiner2016-04-19
| | | | | | | | | | | | feature.
* | | Use numext::ceil instead of std::ceilGravatar Benoit Steiner2016-04-19
| | |
* | | Avoid an unnecessary copy of the evaluator.Gravatar Benoit Steiner2016-04-19
| | |
* | | Fixed 2 recent regression testsGravatar Benoit Steiner2016-04-19
|/ /
* | Use DenseIndex in the MeanReducer to avoid overflows when processing very ↵Gravatar Benoit Steiner2016-04-19
| | | | | | | | large tensors.
* | Worked around the lack of a rand_r function on windows systemsGravatar Benoit Steiner2016-04-17
| |
* | Worked around the lack of a rand_r function on windows systemsGravatar Benoit Steiner2016-04-17
| |
* | Enable lazy-coeff-based-product for vector*(1x1) productsGravatar Gael Guennebaud2016-04-16
| |