aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Collapse)AuthorAge
...
* | | | Implement custom inplace triangular product to avoid a temporaryGravatar Gael Guennebaud2018-07-03
| | | |
* | | | Make is_same_dense compatible with different scalar types.Gravatar Gael Guennebaud2018-07-03
| | | |
* | | | Activate dgmres unit testGravatar Gael Guennebaud2018-07-02
| | | |
* | | | Fix regression in changeset f05dea6b2326836e5e0243fbaffbece84b833d64Gravatar Gael Guennebaud2018-07-02
| | | | | | | | | | | | | | | | : computeFromHessenberg can take any expression for matrixQ, not only an HouseholderSequence.
* | | | Simplify redux_evaluator using inheritance, and properly rename parameters ↵Gravatar Gael Guennebaud2018-07-02
| | | | | | | | | | | | | | | | in reducers.
* | | | bug #1562: optimize evaluation of small products of the form s*A*B by ↵Gravatar Gael Guennebaud2018-07-02
| | | | | | | | | | | | | | | | rewriting them as: s*(A.lazyProduct(B)) to save a costly temporary. Measured speedup from 2x to 5x...
* | | | Fix unit testGravatar Gael Guennebaud2018-07-01
| | | |
* | | | update commentGravatar Gael Guennebaud2018-06-29
| | | |
* | | | Merged in net147/eigen (pull request PR-411)Gravatar Gael Guennebaud2018-06-28
|\ \ \ \ | | | | | | | | | | | | | | | Use std::complex constructor instead of assignment from scalar
* | | | | Fix order of EIGEN_DEVICE_FUNC and returned typeGravatar Gael Guennebaud2018-06-28
| | | | |
| * | | | Use std::complex constructor instead of assignment from scalarGravatar Jonathan Liu2018-06-28
|/ / / / | | | | | | | | | | | | | | | | Fixes GCC conversion to non-scalar type requested compile error when using boost::multiprecision::cpp_dec_float_50 as scalar type.
* | | | First step towards a generic vectorised quaternion productGravatar Gael Guennebaud2018-06-25
| | | |
* | | | bug #1560 fix product with a 1x1 diagonal matrixGravatar Gael Guennebaud2018-06-25
| | | |
* | | | mergeGravatar Rasmus Munk Larsen2018-06-22
|\ \ \ \
* | | | | Fix typo in pbend for AltiVec.Gravatar Rasmus Munk Larsen2018-06-22
| | | | |
| * | | | Merged in rmlarsen/eigen2 (pull request PR-409)Gravatar Benoit Steiner2018-06-21
| |\ \ \ \ | |/ / / / |/| | | | | | | | | Fix oversharding bug in parallelFor.
| * | | | bug #1555: compilation fix with XLCGravatar Gael Guennebaud2018-06-21
| | |_|/ | |/| |
* / | | Fix oversharding bug in parallelFor.Gravatar Rasmus Munk Larsen2018-06-20
|/ / /
* | / fix md5sum of lapack_addonsGravatar Gael Guennebaud2018-06-15
| |/ |/|
* | Merged in mfigurnov/eigen/gamma-der-a (pull request PR-403)Gravatar Benoit Steiner2018-06-11
|\ \ | | | | | | | | | | | | | | | Derivative of the incomplete Gamma function and the sample of a Gamma random variable Approved-by: Benoit Steiner <benoit.steiner.goog@gmail.com>
* | | bug #1531: make dedicatd unit testing for NumDimensionsGravatar Gael Guennebaud2018-06-08
| | |
* | | bug #1531: expose NumDimensions for solve and sparse expressions.Gravatar Gael Guennebaud2018-06-08
| | |
* | | bug #1531: expose NumDimensions for compatibility with TensorGravatar Gael Guennebaud2018-06-08
| | |
* | | bug #1550: prevent avoidable memory allocation in RealSchurGravatar Gael Guennebaud2018-06-08
| | |
* | | fix prototypeGravatar Gael Guennebaud2018-06-08
| | |
* | | Fix the way matrix folder is passed to the tests.Gravatar Gael Guennebaud2018-06-08
| | |
* | | Don't use std::equal_to inside cuda kernels since it's not supported.Gravatar Benoit Steiner2018-06-07
| | |
* | | Missing line during manual rebase of PR-374Gravatar Christoph Hertzberg2018-06-07
| | |
| * | Merge from eigen/eigenGravatar Michael Figurnov2018-06-07
| |\ \
| * | | Updated the stopping criteria in igammac_cf_impl.Gravatar Michael Figurnov2018-06-07
| | | | | | | | | | | | | | | | Previously, when computing the derivative, it used a relative error threshold. Now it uses an absolute error threshold. The behavior for computing the value is unchanged. This makes more sense since we do not expect the derivative to often be close to zero. This change makes the derivatives about 30% faster across the board. The error for the igamma_der_a is almost unchanged, while for gamma_sample_der_alpha it is a bit worse for float32 and unchanged for float64.
| * | | Derivative of the incomplete Gamma function and the sample of a Gamma random ↵Gravatar Michael Figurnov2018-06-06
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | variable. In addition to igamma(a, x), this code implements: * igamma_der_a(a, x) = d igamma(a, x) / da -- derivative of igamma with respect to the parameter * gamma_sample_der_alpha(alpha, sample) -- reparameterization derivative of a Gamma(alpha, 1) random variable sample with respect to the alpha parameter The derivatives are computed by forward mode differentiation of the igamma(a, x) code. Although gamma_sample_der_alpha can be implemented via igamma_der_a, a separate function is more accurate and efficient due to analytical cancellation of some terms. All three functions are implemented by a method parameterized with "mode" that always computes the derivatives, but does not return them unless required by the mode. The compiler is expected to (and, based on benchmarks, does) skip the unnecessary computations depending on the mode.
| | | * Adding support for using Eigen in HIP kernels.Gravatar Deven Desai2018-06-06
| | |/ | |/| | | | | | | | | | | | | | | | | | | | | | This commit enables the use of Eigen on HIP kernels / AMD GPUs. Support has been added along the same lines as what already exists for using Eigen in CUDA kernels / NVidia GPUs. Application code needs to explicitly define EIGEN_USE_HIP when using Eigen in HIP kernels. This is because some of the CUDA headers get picked up by default during Eigen compile (irrespective of whether or not the underlying compiler is CUDACC/NVCC, for e.g. Eigen/src/Core/arch/CUDA/Half.h). In order to maintain this behavior, the EIGEN_USE_HIP macro is used to switch to using the HIP version of those header files (see Eigen/Core and unsupported/Eigen/CXX11/Tensor) Use the "-DEIGEN_TEST_HIP" cmake option to enable the HIP specific unit tests.
* | | Adding EIGEN_DEVICE_FUNC to Products, especially Dense2Dense AssignmentGravatar Robert Lukierski2018-03-14
| |/ |/| | | | | | | specializations. Otherwise causes problems with small fixed size matrix multiplication (call to 0x00 in call_assignment_no_alias in debug mode or trap in release with CUDA 9.1).
* | Merged in mfigurnov/eigen/fix-bessel (pull request PR-404)Gravatar Benoit Steiner2018-06-07
|\ \ | | | | | | | | | Fix compilation of special functions without C99 math.
| * \ Merge from eigen/eigen.Gravatar Michael Figurnov2018-06-07
| |\ \ | |/ / |/| |
* | | Fiw some warnings in dox examplesGravatar Gael Guennebaud2018-06-07
| | |
* | | Fix int versus IndexGravatar Gael Guennebaud2018-06-07
| | |
* | | Fix warningGravatar Gael Guennebaud2018-06-07
| | |
* | | Fix warningGravatar Gael Guennebaud2018-06-07
| | |
* | | Fix MSVC warning C4290: C++ exception specification ignored except to ↵Gravatar Gael Guennebaud2018-06-07
| | | | | | | | | | | | indicate a function is not __declspec(nothrow)
| * | Fix compilation of special functions without C99 math.Gravatar Michael Figurnov2018-06-07
| |/ | | | | | | | | | | The commit with Bessel functions i0e and i1e placed the ifdef/endif incorrectly, causing i0e/i1e to be undefined when EIGEN_HAS_C99_MATH=0. These functions do not actually require C99 math, so now they are always available.
* | Fix short vs longGravatar Gael Guennebaud2018-06-07
| |
* | Avoid unnecessary C++11 dependencyGravatar Christoph Hertzberg2018-06-07
| |
* | Fix typos found using codespellGravatar Gael Guennebaud2018-06-07
| |
* | Set EIGEN_IDEAL_MAX_ALIGN_BYTES correctly for AVX512 buildsGravatar Mark D Ryan2018-05-17
| | | | | | | | | | | | | | | | | | | | | | | | bug #1548 The macro EIGEN_IDEAL_MAX_ALIGN_BYTES is being incorrectly set to 32 on AVX512 builds. It should be set to 64. In the current code it is only set to 64 if the macro EIGEN_VECTORIZE_AVX512 is defined. This macro does get defined in AVX512 builds in Core, but only after Macros.h, the file that defines EIGEN_IDEAL_MAX_ALIGN_BYTES, has been included. This commit fixes the issue by setting EIGEN_IDEAL_MAX_ALIGN_BYTES to 64 if __AVX512F__ is defined.
* | Performance improvements to tensor broadcast operationGravatar Vamsi Sripathi2018-05-23
| | | | | | | | | | | | 1. Added new packet functions using SIMD for NByOne, OneByN cases 2. Modified existing packet functions to reduce index calculations when input stride is non-SIMD 3. Added 4 test cases to cover the new packet functions
* | Fix compilation with MSVC by reverting to char* for _mm_prefetch except for ↵Gravatar Gael Guennebaud2018-06-07
|/ | | | PGI (the later being the one that has the wrong prototype).
* Merged in mfigurnov/eigen (pull request PR-400)Gravatar Benoit Steiner2018-06-05
|\ | | | | | | | | | | Exponentially scaled modified Bessel functions of order zero and one. Approved-by: Benoit Steiner <benoit.steiner.goog@gmail.com>
* | Add a ThreadPoolInterface* getter for ThreadPoolDevice.Gravatar Penporn Koanantakool2018-06-02
| |
* | Don't run hg on non mercurial cloneGravatar Gael Guennebaud2018-05-31
| |