eigen - C++ library for linear algebra

	Commit message (Collapse)	Author	Age
*	Fix more enum arithmetic.	Rasmus Munk Larsen	2021-06-15
\|
*	Fix c++20 warnings about using enums in arithmetic expressions.	Rasmus Munk Larsen	2021-06-10
\|
*	Add missing NEON ptranspose implementations.	Antonio Sanchez	2021-05-25
\| \| \| \|	Unified implementation using only `vzip`.
*	predux_half_dowto4 test extended to all applicable packets	Jakub Lichman	2021-05-21
\|
*	Use derived object type in conservative_resize_like_impl	Niall Murphy	2021-05-20
\| \| \| \| \| \| \| \| \|	When calling conservativeResize() on a matrix with DontAlign flag, the temporary variable used to perform the resize should have the same Options as the original matrix to ensure that the correct override of swap is called (i.e. PlainObjectBase::swap(DenseBase<OtherDerived> & other). Calling the base class swap (i.e in DenseBase) results in assertions errors or memory corruption.
*	ptranpose test for non-square kernels added	Jakub Lichman	2021-05-19
\|
*	Ensure all generated matrices for inverse_4x4 testes are invertible, this ↵	Guoqiang QI	2021-05-13
\| \| \| \|	fix #2248 .
*	Fix numext::arg return type.	Antonio Sanchez	2021-05-07
\| \| \| \| \| \| \| \|	The cxx11 path for `numext::arg` incorrectly returned the complex type instead of the real type, leading to compile errors. Fixed this and added tests. Related to !477, which uncovered the issue.
*	Added complex matrix unit tests for SelfAdjointEigenSolve	Theo Fletcher	2021-04-26
\|
*	Tests added and AVX512 bug fixed for pcmp_lt_or_nan	Jakub Lichman	2021-04-25
\|
*	Tests for pcmp_lt and pcmp_le added	Jakub Lichman	2021-04-23
\|
*	DenseStorage safely copy/swap.	Antonio Sanchez	2021-04-22
\| \| \| \| \| \| \| \|	Fixes #2229. For dynamic matrices with fixed-sized storage, only copy/swap elements that have been set. Otherwise, this leads to inefficient copying, and potential UB for non-initialized elements.
*	Modify googlehash use to account for namespace issues.	Antonio Sanchez	2021-04-12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The namespace declaration for googlehash is a configurable macro that can be disabled. In particular, it is disabled within google, causing compile errors since `dense_hash_map`/`sparse_hash_map` are then in the global namespace instead of in `::google`. Here we play a bit of gynastics to allow for both `google::_hash_map` and `_hash_map`, while limiting namespace polution. Symbols within the `::google` namespace are imported into `Eigen::google`. We also remove checks based on `_SPARSE_HASH_MAP_H_`, as this is fragile, and instead require `EIGEN_GOOGLEHASH_SUPPORT` to be defined.
*	Make iterators default constructible and assignable, by making...	Christoph Hertzberg	2021-04-09
\|
*	Fix clang tidy warnings in AnnoyingScalar.	Antonio Sanchez	2021-04-05
\| \| \| \| \| \| \| \|	Clang-tidy complains that full specializations in headers can cause ODR violations. Marked these as `inline` to fix. It also complains about renaming arguments in specializations. Set the argument names to match.
*	Fix two bugs in commit	Rasmus Munk Larsen	2021-04-02
\|
*	Add CMake infrastructure for smoke testing	David Tellenbach	2021-03-31
\| \| \| \| \|	Necessary CMake changes to implement pre-merge smoke tests running via CI.
*	Add an info() method to the SVDBase class to make it possible to tell the ↵	Rasmus Munk Larsen	2021-03-31
\| \| \| \| \| \|	user that the computation failed, possibly due to invalid input. Make Jacobi and divide-and-conquer fail fast and return info() == InvalidInput if the matrix contains NaN or +/-Inf.
*	Fix CUDA constexpr issues for numeric_limits.	Antonio Sanchez	2021-03-30
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Some CUDA/HIP constants fail on device with `constexpr` since they internally rely on non-constexpr functions, e.g. ``` \#define CUDART_INF_F __int_as_float(0x7f800000) ``` This fails for cuda-clang (though passes with nvcc). These constants are currently used by `device::numeric_limits`. For portability, we need to remove `constexpr` from the affected functions. For C++11 or higher, we should be able to rely on the `std::numeric_limits` versions anyways, since the methods themselves are now `constexpr`, so should be supported on device (clang/hipcc natively, nvcc with `--expr-relaxed-constexpr`).
*	Fixed performance issues for complex VSX and P10 MMA in gebp_kernel (level 3).	Chip Kerchner	2021-03-25
\|
*	Eliminate mixingtypes_7 warning.	Antonio Sanchez	2021-03-24
\| \| \| \| \| \|	`g_called` is not used in subtest 7, so was generating a `-Wunneeded-internal-declaration` warnings. Here we silence it by initializing the static variable.
*	Split test commainitializer into two substests	David Tellenbach	2021-03-18
\|
*	Use singleton pattern for static registered tests.	Antonio Sanchez	2021-03-18
\| \| \| \| \| \| \| \| \| \|	The original fails with nvcc+msvc - there's a static order of initialization issue leading to registered tests being cleared. The test then fails on ``` VERIFY(EigenTest::all().size()>0); ``` since `EigenTest` no longer contains any tests. The singleton pattern fixes this.
*	Augment NumTraits with min/max_exponent() again.	Antonio Sanchez	2021-03-16
\| \| \| \| \| \| \| \| \| \| \| \|	Replace usage of `std::numeric_limits<...>::min/max_exponent` in codebase where possible. Also replaced some other `numeric_limits` usages in affected tests with the `NumTraits` equivalent. The previous MR !443 failed for c++03 due to lack of `constexpr`. Because of this, we need to keep around the `std::numeric_limits` version in enum expressions until the switch to c++11. Fixes #2148
*	Revert "Augment NumTraits with min/max_exponent()."	David Tellenbach	2021-03-17
\| \| \| \|	This reverts commit 75ce9cd2a7aefaaea8543e2db14ce4dc149eeb03.
*	Augment NumTraits with min/max_exponent().	Antonio Sanchez	2021-03-17
\| \| \| \| \| \| \| \|	Replace usage of `std::numeric_limits<...>::min/max_exponent` in codebase. Also replaced some other `numeric_limits` usages in affected tests with the `NumTraits` equivalent. Fixes #2148
*	Add NaN propagation options to minCoeff/maxCoeff visitors.	Rasmus Munk Larsen	2021-03-16
\|
*	Add fmod(half, half).	Antonio Sanchez	2021-03-15
\| \| \| \|	This is to support TensorFlow's `tf.math.floormod` for half.
*	Fix NVCC+ICC issues.	Antonio Sanchez	2021-03-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	NVCC does not understand `__forceinline`, so we need to use `inline` when compiling for GPU. ICC specializes `std::complex` operators for `float` and `double` by default, which cannot be used on device and conflict with Eigen's workaround in CUDA/Complex.h. This can be prevented by defining `_OVERRIDE_COMPLEX_SPECIALIZATION_` before including `<complex>`. Added this define to the tests and to `Eigen/Core`, but this will not work if the user includes `<complex>` before `<Eigen/Core>`. ICC also seems to generate a duplicate `Map` symbol in `PlainObjectBase`: ``` error: "Map" has already been declared in the current scope static ConstMapType Map(const Scalar *data) ``` I tracked this down to `friend class Eigen::Map`. Putting the `friend` statements at the bottom of the class seems to resolve this issue. Fixes #2180
*	Add increment/decrement operators to Eigen::half.	Antonio Sanchez	2021-03-15
\| \| \| \| \|	This is for consistency with bfloat16, and to support initialization with `std::iota`.
*	Bump up rand histogram threshold.	Antonio Sanchez	2021-03-10
\| \| \| \| \| \| \|	The previous one sometimes fails for MSVC which has a poor random number generator. Fixes #2182
*	Re-implement move assignments.	Antonio Sanchez	2021-03-10
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The original swap approach leads to potential undefined behavior (reading uninitialized memory) and results in unnecessary copying of data for static storage. Here we pass down the move assignment to the underlying storage. Static storage does a one-way copy, dynamic storage does a swap. Modified the tests to no longer read from the moved-from matrix/tensor, since that can lead to UB. Added a test to ensure we do not access uninitialized memory in a move. Fixes: #2119
*	Define EIGEN_CPLUSPLUS and replace most __cplusplus checks.	Antonio Sanchez	2021-03-05
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The macro `__cplusplus` is not defined correctly in MSVC unless building with the the `/Zc:__cplusplus` flag. Instead, it defines `_MSVC_LANG` to the specified c++ standard version number. Here we introduce `EIGEN_CPLUSPLUS` which will contain the c++ version number both for MSVC and otherwise. This simplifies checks for supported features. Also replaced most instances of standard version checking via `__cplusplus` with the existing `EIGEN_COMP_CXXVER` macro for better clarity. Fixes: #2170
*	Fix rint SSE/NEON again, using optimization barrier.	Antonio Sanchez	2021-03-05
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a new version of !423, which failed for MSVC. Defined `EIGEN_OPTIMIZATION_BARRIER(X)` that uses inline assembly to prevent operations involving `X` from crossing that barrier. Should work on most `GNUC` compatible compilers (MSVC doesn't seem to need this). This is a modified version adapted from what was used in `psincos_float` and tested on more platforms (see #1674, https://godbolt.org/z/73ezTG). Modified `rint` to use the barrier to prevent the add/subtract rounding trick from being optimized away. Also fixed an edge case for large inputs that get bumped up a power of two and ends up rounding away more than just the fractional part. If we are over `2^digits` then just return the input. This edge case was missed in the test since the test was comparing approximate equality, which was still satisfied. Adding a strict equality option catches it.
*	Revert "Fix rint for SSE/NEON."	Antonio Sánchez	2021-03-03
\| \| \|	This reverts commit e72dfeb8b9fa5662831b5d0bb9d132521f9173dd
*	Fix rint for SSE/NEON.	Antonio Sanchez	2021-03-03
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	It seems sometimes with aggressive optimizations the combination `psub(padd(a, b), b)` trick to force rounding is compiled away. Here we replace with inline assembly to prevent this (I tried `volatile`, but that leads to additional loads from memory). Also fixed an edge case for large inputs `a` where adding `b` bumps the value up a power of two and ends up rounding away more than just the fractional part. If we are over `2^digits` then just return the input. This edge case was missed in the test since the test was comparing approximate equality, which was still satisfied. Adding a strict equality option catches it.
*	geo_alignedbox_5 was failing with AVX enabled, due to storing `Vector4d` in ↵	Christoph Hertzberg	2021-03-01
\| \| \| \| \| \| \|	a `std::vector` without using an aligned allocator. Got rid of using `std::vector` and simplified the code. Avoid leading `_`
*	Add print for SSE/NEON, use NEON rounding intrinsics if available.	Antonio Sanchez	2021-02-27
\| \| \| \| \| \| \| \| \| \|	In SSE, by adding/subtracting 2^MantissaBits, we force rounding according to the current rounding mode. For NEON, we use the provided intrinsics for rint/floor/ceil if available (armv8). Related to #1969.
*	Make half/bfloat16 constructor take inputs by value, fix powerpc test.	Antonio Sanchez	2021-02-27
\| \| \| \| \| \| \| \| \| \| \| \|	Since `numeric_limits<half>::max_exponent` is a static inline constant, it cannot be directly passed by reference. This triggers a linker error in recent versions of `g++-powerpc64le`. Changing `half` to take inputs by value fixes this. Wrapping `max_exponent` with `int(...)` to make an addressable integer also fixes this and may help with other custom `Scalar` types down-the-road. Also eliminated some compile warnings for powerpc.
*	Fixed/masked more implicit copy constructor warnings	Christoph Hertzberg	2021-02-27
\| \| \| \|	(cherry picked from commit 2883e91ce5a99c391fbf28e20160176b70854992)
*	Fix NEON sqrt for 32-bit, add prsqrt.	Antonio Sanchez	2021-02-26
\| \| \| \| \| \| \| \| \| \| \| \|	With !406, we accidentally broke arm 32-bit NEON builds, since `vsqrt_f32` is only available for 64-bit. Here we add back the `rsqrt` implementation for 32-bit, relying on a `prsqrt` implementation with better handling of edge cases. Note that several of the 32-bit NEON packet tests are currently failing - either due to denormal handling (NEON versions flush to zero, but scalar paths don't) or due to accuracy (e.g. sin/cos).
*	Merge branch 'rmlarsen1/eigen-nan_prop'	Rasmus Munk Larsen	2021-02-26
\|\
* \|	Fix floor/ceil for NEON fp16.	Antonio Sanchez	2021-02-25
\| \| \| \| \| \| \| \|	Forgot to test this. Fixes bug introduced in !416.
* \|	Fix SSE/NEON pfloor/pceil for saturated values.	Antonio Sanchez	2021-02-25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The original will saturate if the input does not fit into an integer type. Here we fix this, returning the input if it doesn't have enough precision to have a fractional part. Also added `pceil` for NEON. Fixes #1969.
\| *	Make it possible to specify NaN propagation strategy for maxCoeff/minCoeff ↵	Rasmus Munk Larsen	2021-02-25
\|/ \| \| \|	reductions.
*	Disable new/delete test for HIP	Antonio Sanchez	2021-02-25
\|
*	Fix CUDA device new and delete, and add test.	Antonio Sanchez	2021-02-24
\| \| \| \|	HIP does not support new/delete on device, so test is skipped.
*	Eliminate CMake FindPackageHandleStandardArgs warnings.	Antonio Sanchez	2021-02-24
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	CMake complains that the package name does not match when the case differs, e.g.: ``` CMake Warning (dev) at /usr/share/cmake-3.18/Modules/FindPackageHandleStandardArgs.cmake:273 (message): The package name passed to `find_package_handle_standard_args` (UMFPACK) does not match the name of the calling package (Umfpack). This can lead to problems in calling code that expects `find_package` result variables (e.g., `_FOUND`) to follow a certain pattern. Call Stack (most recent call first): cmake/FindUmfpack.cmake:50 (find_package_handle_standard_args) bench/spbench/CMakeLists.txt:24 (find_package) This warning is for project developers. Use -Wno-dev to suppress it. ``` Here we rename the libraries to match their true cases.
*	Fixed sparse conservativeResize() when both num cols and rows decreased.	Adam Shapiro	2021-02-23
\| \| \| \| \|	The previous implementation caused a buffer overflow trying to calculate non- zero counts for columns that no longer exist.
*	Missing change regarding #1910	Christoph Hertzberg	2021-02-19
\|