eigen - C++ library for linear algebra

	Commit message (Collapse)	Author	Age
...
*	Add EIGEN_UNUSED_VARIABLE to unused variable in Memory.h	Rasmus Munk Larsen	2020-09-15
\|
*	Fix bfloat16 round on gcc 4.8	Pedro Caldeira	2020-09-14
\|
*	Fix issue #1968. Don't discard return value from "new" in C++17.	Rasmus Munk Larsen	2020-09-13
\|
*	Unified sse pldexp_double api	Guoqiang QI	2020-09-12
\|
*	Make blueNorm threadsafe if C++11 atomics are available.	Rasmus Munk Larsen	2020-09-12
\|
*	New CI infrastructure, including AArch64 runners	David Tellenbach	2020-09-11
\|
*	Fix half_impl::float_to_half_rtne(float) warning: '<<' causes overflow	Niels Dekker	2020-09-10
\| \| \| \| \| \|	Fixed Visual Studio 2019 Code Analysis (C++ Core Guidelines) warning C26450 from inside `half_impl::float_to_half_rtne(float)`: > Arithmetic overflow: '<<' operation causes overflow at compile time.
*	Add missing functions for Packet8bf in Altivec architecture.	Pedro Caldeira	2020-09-08
\| \| \| \| \|	Including new tests for bfloat16 Packets. Fix prsqrt on GenericPacketMath.
*	Add Neon psqrt<Packet2d> and pexp<Packet2d>	Guoqiang QI	2020-09-08
\|
*	remove semi triggering -Wextra-semi-stmt	Alexander Neumann	2020-09-07
\|
*	Add Inverse_NEON.h	Stephen Zheng	2020-09-04
\| \| \| \| \| \| \| \| \| \| \|	Implemented fast size-4 matrix inverse (mimicking Inverse_SSE.h) using NEON intrinsics. ``` Benchmark Time CPU Time Old Time New CPU Old CPU New -------------------------------------------------------------------------------------------------------- BM_float -0.1285 -0.1275 568 495 572 499 BM_double -0.2265 -0.2254 638 494 641 496 ```
*	MatrixProuct enhancements:	Everton Constantino	2020-09-02
\| \| \| \| \| \| \| \| \| \| \| \| \|	- Changes to Altivec/MatrixProduct Adapting code to gcc 10. Generic code style and performance enhancements. Adding PanelMode support. Adding stride/offset support. Enabling float64, std::complex and std::complex. Fixing lack of symm_pack. Enabling mixedtypes. - Adding std::complex tests to blasutil. - Adding an implementation of storePacketBlock when Incr!= 1.
*	Changing u/int8_t to un/signed char because clang does not understand	Everton Constantino	2020-09-02
\| \| \| \| \| \|	it. Implementing pcmp_eq to Packet8 and Packet16.
*	fix #1901: warning in Mode==(Upper\|Lower)	Gael Guennebaud	2020-09-02
\|
*	BUG: cmake_minimum_required must be the first command	Hans Johnson	2020-08-28
\| \| \| \| \| \| \| \| \|	https://cmake.org/cmake/help/v3.5/command/project.html Note: Call the cmake_minimum_required() command at the beginning of the top-level CMakeLists.txt file even before calling the project() command. It is important to establish version and policy settings before invoking other commands whose behavior they may affect. See also policy CMP0000.
*	Change Packet8s and Packet8us to use vector commands on Power for pmadd, ↵	Chip Kerchner	2020-08-28
\| \| \| \|	pmul and psub.
*	Fix #1974: assertion when reserving an empty sparse matrix	Gael Guennebaud	2020-08-26
\|
*	add psqrt ops support packet2f/packet4f for NEON	Guoqiang QI	2020-08-21
\|
*	adding attributes to constructors to support hip-clang on ROCm 3.5	Georg Jäger	2020-08-20
\|
*	Fixing a CUDA / P100 regression introduced by PR 181	Deven Desai	2020-08-20
\| \| \| \| \| \|	PR 181 ( https://gitlab.com/libeigen/eigen/-/merge_requests/181 ) adds `__launch_bounds__(1024)` attribute to GPU kernels, that did not have that attribute explicitly specified. That PR seems to cause regressions on the CUDA platform. This PR/commit makes the changes in PR 181, to be applicable for HIP only
*	Fix nightly CI configuration	David Tellenbach	2020-08-19
\|
*	Add possibility to split test suit build targets and improved CI configuration	David Tellenbach	2020-08-19
\| \| \| \| \| \|	- Introduce CMake option `EIGEN_SPLIT_TESTSUITE` that allows to divide the single test build target into several subtargets - Add CI pipeline for merge request that can be run by GitLab's shared runners - Add nightly CI pipeline
*	Add missing inline keyword in Quaternion.h.	Rasmus Munk Larsen	2020-08-14
\|
*	Disable min/max NaN propagation in test cxx11_tensor_expr	David Tellenbach	2020-08-14
\| \| \| \| \| \| \|	The current pmin/pmax implementation for Arm Neon propagate NaNs differently than std::min/std::max. See issue https://gitlab.com/libeigen/eigen/-/issues/1937
*	Fix compilation error in blasutil test	David Tellenbach	2020-08-14
\|
*	Replace the call to int64_t in the blasutil test by explicit types	David Tellenbach	2020-08-14
\| \| \| \| \| \| \| \| \|	Some platforms define int64_t to be long long even for C++03. If this is the case we miss the definition of internal::make_unsigned for this type. If we just define the template we get duplicated definitions errors for platforms defining int64_t as signed long for C++03. We need to find a way to distinguish both cases at compile-time.
*	bfloat16 packetmath for Arm Neon backend	David Tellenbach	2020-08-13
\|
*	Add support for Bfloat16 to use vector instructions on Altivec	Pedro Caldeira	2020-08-10
\| \| \| \|	architecture
*	Adding an explicit launch_bounds(1024) attribute for GPU kernels.	Deven Desai	2020-08-05
\| \| \| \| \| \| \| \| \| \|	Starting with ROCm 3.5, the HIP compiler will change from HCC to hip-clang. This compiler change introduce a change in the default value of the `__launch_bounds__` attribute associated with a GPU kernel. (default value means the value assumed by the compiler as the `__launch_bounds attribute__` value, when it is not explicitly specified by the user) Currently (i.e. for HIP with ROCm 3.3 and older), the default value is 1024. That changes to 256 with ROCm 3.5 (i.e. hip-clang compiler). As a consequence of this change, if a GPU kernel with a `__luanch_bounds__` attribute of 256 is launched at runtime with a threads_per_block value > 256, it leads to a runtime error. This is leading to a couple of Eigen unit test failures with ROCm 3.5. This commit adds an explicit `__launch_bounds(1024)__` attribute to every GPU kernel that currently does not have it explicitly specified (and hence will end up getting the default value of 256 with the change to hip-clang)
*	Temporarily turn off the NEON implementation of pfloor as it does not work ↵	Zachary Garrett	2020-08-04
\| \| \| \| \| \|	for large values. The NEON implementation mimics the SSE implementation, but didn't mention the caveat that due to the unsigned of signed integer conversions, not all values in the original floating point represented are supported.
*	Disable CI buildstage again	David Tellenbach	2020-08-03
\|
*	add a banner to advertise the survey	Gael Guennebaud	2020-07-29
\|
*	Fix StlDeque for GCC 10	David Tellenbach	2020-07-29
\| \| \| \| \|	StlDeque extends std::deque by accessing some of its internal members. Since GCC 10 these are not accessible anymore.
*	Fix undefine BF16 union behavior in AVX512.	Teng Lu	2020-07-29
\|
*	Inherit alignment trait from argument in TensorBroadcasting to avoid ↵	Rasmus Munk Larsen	2020-07-28
\| \| \| \|	segfault when the argument is unaligned.
*	Fix clang-tidy warnings in generic bfloat16 implementation	David Tellenbach	2020-07-27
\| \| \| \|	See !172 for related discussions.
*	Fix CMake install command	qxxxb	2020-07-25
\|
*	Don't allow failure for CI build stage anymore	David Tellenbach	2020-07-24
\|
*	Improve CI configuration	David Tellenbach	2020-07-24
\| \| \| \| \| \| \| \|	- Fix docker Fedora image to Fedora:31 - Fix gcc version to gcc-9.2.1 - Use GitLab CI dag - Fix usage of build cache - Introduce build artificats
*	Add missing footer declaration	Gael Guennebaud	2020-07-24
\|
*	Fix bfloat16 casts	David Tellenbach	2020-07-23
\| \| \| \| \| \| \|	If we have explicit conversion operators available (C++11) we define explicit casts from bfloat16 to other types. If not (C++03), we don't define conversion operators but rely on implicit conversion chains from bfloat16 over float to other types.
*	remove piwik tracker	Gael Guennebaud	2020-07-23
\|
*	Revert change that made conversion from bfloat16 to {float, double} implicit.	Rasmus Munk Larsen	2020-07-22
\| \| \| \|	Add roundtrip tests for casting between bfloat16 and complex types.
*	Fix cast of blfoat16 to std::complex<T>	David Tellenbach	2020-07-22
\| \| \| \|	This fixes https://gitlab.com/libeigen/eigen/-/issues/1951
*	Make sure we take the little-endian path if __BYTE_ORDER__ is not defined.	Rasmus Munk Larsen	2020-07-22
\|
*	Faster conversion from integer types to bfloat16	Niels Dekker	2020-07-22
\| \| \| \| \| \|	Specialized `bfloat16_impl::float_to_bfloat16_rtne(float)` for normal floating point numbers, infinity and zero, in order to improve the performance of `bfloat16::bfloat16(const T&)` for integer argument types. A reduction of more than 20% of the runtime duration of conversion from int to bfloat16 was observed, using Visual C++ 2019 on Windows 10.
*	Avoid division by zero in nonZerosEstimate() for empty blocks.	Rasmus Munk Larsen	2020-07-22
\|
*	Update tensor reduction test to avoid undefined division of bfloat16 by int.	Rasmus Munk Larsen	2020-07-22
\|
*	Make numext::as_uint a device function.	Rasmus Munk Larsen	2020-07-22
\|
*	user-defined copy operations removed in favor of compiler-generated ones	Alexander Turkin	2020-07-20
\|