eigen - C++ library for linear algebra

	Commit message (Collapse)	Author	Age
...
* \|	Revert the specialization for scalar_logistic_op<float> introduced in:	Rasmus Munk Larsen	2019-12-02
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	https://bitbucket.org/eigen/eigen/commits/77b447c24e3344e43ff64eb932d4bb35a2db01ce While providing a 50% speedup on Haswell+ processors, the large relative error outside [-18, 18] in this approximation causes problems, e.g., when computing gradients of activation functions like softplus in neural networks.
* \|	Merged in ezhulenev/eigen-02 (pull request PR-767)	Rasmus Larsen	2019-12-02
\|\ \ \| \| \| \| \| \| \| \| \|	Fix shadow warnings in AlignedBox and SparseBlock
* \| \|	Fix for the HIP build+test errors.	Deven Desai	2019-12-02
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Recent changes have introduced the following build error when compiling with HIPCC --------- unsupported/test/../../Eigen/src/Core/GenericPacketMath.h:254:58: error: 'ldexp': no overloaded function has restriction specifiers that are compatible with the ambient context 'pldexp' --------- The fix for the error is to pick the math function(s) from the global namespace (where they are declared as device functions in the HIP header files) when compiling with HIPCC.
* \| \|	Merged in codeplaysoftware/eigen/SYCL-Backend (pull request PR-691)	Rasmus Larsen	2019-11-28
\|\ \ \ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	SYCL Backend Approved-by: Rasmus Larsen <rmlarsen@google.com>
\| * \| \|	[SYCL] Rebasing the SYCL support branch on top of the Einge upstream master ↵	Mehdi Goli	2019-11-28
\|/ / / \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	branch. * Unifying all loadLocalTile from lhs and rhs to an extract_block function. * Adding get_tensor operation which was missing in TensorContractionMapper. * Adding the -D method missing from cmake for Disable_Skinny Contraction operation. * Wrapping all the indices in TensorScanSycl into Scan parameter struct. * Fixing typo in Device SYCL * Unifying load to private register for tall/skinny no shared * Unifying load to vector tile for tensor-vector/vector-tensor operation * Removing all the LHS/RHS class for extracting data from global * Removing Outputfunction from TensorContractionSkinnyNoshared. * Combining the local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining the no-local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining General Tensor-Vector and VectorTensor contraction into one kernel. * Making double buffering optional for Tensor contraction when local memory is version is used. * Modifying benchmark to accept custom Reduction Sizes * Disabling AVX optimization for SYCL backend on the host to allow SSE optimization to the host * Adding Test for SYCL * Modifying SYCL CMake
\| * /	Fix shadow warnings in AlignedBox and SparseBlock	Eugene Zhulenev	2019-11-27
\|/ /
* \|	Add missing EIGEN_DEVICE_FUNC attribute to template specializations for pexp ↵	Rasmus Munk Larsen	2019-11-27
\| \| \| \| \| \| \| \|	to fix GPU build.
* \|	Fix warning due to missing cast for exponent arguments for std::frexp and ↵	Rasmus Munk Larsen	2019-11-26
\| \| \| \| \| \| \| \|	std::lexp.
* \|	Merged in realjhol/eigen/fix-warnings (pull request PR-760)	Rasmus Larsen	2019-11-26
\|\ \ \| \| \| \| \| \| \| \| \|	Fix warnings
* \| \|	Add async evaluation support to TensorReverse	Eugene Zhulenev	2019-11-26
\| \| \|
* \| \|	Add async evaluation support to TensorPadding/TensorImagePatch/TensorShuffling	Eugene Zhulenev	2019-11-26
\| \| \|
\| * \|	SparseRef: Fixed alignment warning on ARM GCC	Joel Holdsworth	2019-11-07
\| \| \|
\| \| *	Update ConfigureVectorization.h to not optimize fp16 routines when compiling ↵	Anshul Jaiswal	2019-11-06
\| \| \| \| \| \| \| \| \| \| \| \|	with cuda.
\| * \|	test/packetmath: Silence alignment warnings	Joel Holdsworth	2019-11-05
\|/ /
* \|	Add an explicit example for auto and re-evaluation	Gael Guennebaud	2019-11-20
\| \|
* \|	COMP: Simplify install commands for Eigen	Hans Johnson	2019-11-17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Confirm that install directory is identical before and after this simplifying patch. ```bash hg clone <<Eigen>> mkdir eigen-bld cd eigen-bld cmake ../Eigen -DCMAKE_INSTALL_PREFIX:PATH=/tmp/bef make install find /tmp/pre_eigen_modernize >/tmp/bef # Apply this patch cmake ../Eigen -DCMAKE_INSTALL_PREFIX:PATH=/tmp/aft make install find /tmp/post_eigen_modernize \|sed 's/post_e/pre_e/g' >/tmp/aft diff /tmp/bef /tmp/aft ```
* \|	COMP: target_compile_definitions requires cmake 2.8.11	Hans Johnson	2019-11-17
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	Features committed in 2016 have required cmake verison 2.8.11. `sergiu Tue Nov 22 12:25:06 2016 +0100: target_compile_definitions` Set the minimum cmake version to the minimum version that is capable of compiling or installing the code base.
* \|	Fix duplicate symbol linking error.	Gael Guennebaud	2019-11-20
\| \|
* \|	STYLE: Convert CMake-language commands to lower case	Hans Johnson	2019-10-31
\| \| \| \| \| \| \| \| \| \|	Ancient CMake versions required upper-case commands. Later command names became case-insensitive. Now the preferred style is lower-case.
* \|	STYLE: Remove CMake-language block-end command arguments	Hans Johnson	2019-10-31
\| \| \| \| \| \| \| \| \| \| \| \|	Ancient versions of CMake required else(), endif(), and similar block termination commands to have arguments matching the command starting the block. This is no longer the preferred style.
* \|	1. Fix a bug in psqrt and make it return 0 for +inf arguments.	Rasmus Munk Larsen	2019-11-15
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	2. Simplify handling of special cases by taking advantage of the fact that the builtin vrsqrt approximation handles negative, zero and +inf arguments correctly. This speeds up the SSE and AVX implementations by ~20%. 3. Make the Newton-Raphson formula used for rsqrt more numerically robust: Before: y = y * (1.5 - x/2 * y^2) After: y = y * (1.5 - y * (x/2) * y) Forming y^2 can overflow for very large or very small (denormalized) values of x, while x*y ~= 1. For AVX512, this makes it possible to compute accurate results for denormal inputs down to ~1e-42 in single precision. 4. Add a faster double precision implementation for Knights Landing using the vrsqrt28 instruction and a single Newton-Raphson iteration. Benchmark results: https://bitbucket.org/snippets/rmlarsen/5LBq9o
* \|	bug #1744: fix compilation with MSVC 2017 and AVX512, plog1p/pexpm1 require ↵	Gael Guennebaud	2019-11-15
\| \| \| \| \| \| \| \|	plog/pexp, but the later was disabled on some compilers
* \|	bug #1747: one more fix for MSVC regarding the Bessel implementation.	Gael Guennebaud	2019-11-15
\| \|
* \|	bug #1281: fix AutoDiffScalar's make_coherent for nested expression of ↵	Gael Guennebaud	2019-11-14
\| \| \| \| \| \| \| \|	constant ADs.
* \|	Fix case issue with Lapack unit tests	Gael Guennebaud	2019-11-14
\| \|
* \|	bug #1774: fix VectorwiseOp::begin()/end() return types regarding constness.	Gael Guennebaud	2019-11-14
\| \|
* \|	PR 751: Fixed compilation issue when compiling using MSVC with /arch:AVX512 flag	Sakshi Goynar	2019-10-31
\| \|
* \|	Enable CompleteOrthogonalDecomposition::pseudoInverse with non-square ↵	Gael Guennebaud	2019-11-13
\| \| \| \| \| \| \| \|	fixed-size matrices.
* \|	Move to my.cdash.org	Gael Guennebaud	2019-11-13
\| \|
* \|	Remove legacy block evaluation support	Eugene Zhulenev	2019-11-12
\| \|
* \|	Disable AVX on broken xcode versions. See PR 748.	Gael Guennebaud	2019-11-12
\| \| \| \| \| \| \| \|	Patch adapted from Hans Johnson's PR 748.
* \|	Fix a race in async tensor evaluation: Don't run on_done() until after ↵	Rasmus Munk Larsen	2019-11-11
\| \| \| \| \| \| \| \|	device.deallocate() / evaluator.cleanup() complete, since the device might be destroyed after on_done() runs.
* \|	Break loop dependence in TensorGenerator block access	Eugene Zhulenev	2019-11-11
\| \|
* \|	Fix data race in css11_tensor_notification test.	Rasmus Munk Larsen	2019-11-08
\|/
*	Add EIGEN_HAS_INTRINSIC_INT128 macro	Rasmus Munk Larsen	2019-11-06
\| \| \| \|	Add a new EIGEN_HAS_INTRINSIC_INT128 macro, and use this instead of __SIZEOF_INT128__. This fixes related issues with TensorIntDiv.h when building with Clang for Windows, where support for 128-bit integer arithmetic is advertised but broken in practice.
*	Rollback or PR-746 and partial rollback of ↵	Rasmus Munk Larsen	2019-11-05
\| \| \| \| \| \| \| \|	https://bitbucket.org/eigen/eigen/commits/668ab3fc474e54c7919eda4fbaf11f3a99246494 . std::array is still not supported in CUDA device code on Windows.
*	Merged in ezhulenev/eigen-01 (pull request PR-746)	Rasmus Larsen	2019-11-04
\|\ \| \| \| \| \| \|	Remove internal::smart_copy and replace with std::copy
* \|	Cleanup includes in Tensor module after switch to C++11 and above	Eugene Zhulenev	2019-10-29
\| \|
\| *	Remove internal::smart_copy and replace with std::copy	Eugene Zhulenev	2019-10-29
\|/
*	Fix CXX11Meta compilation with MSVC	Eugene Zhulenev	2019-10-28
\|
*	Prevent potential ODR in TensorExecutor	Eugene Zhulenev	2019-10-28
\|
*	This PR fixes:	Mehdi Goli	2019-10-23
\| \| \| \| \|	* The specialization of array class in the different namespace for GCC<=6.4 * The implicit call to `std::array` constructor using the initializer list for GCC <=6.1
*	Merged in deven-amd/eigen-hip-fix-191018 (pull request PR-738)	Rasmus Larsen	2019-10-22
\|\ \| \| \| \| \| \|	Fix for the HIP build+test errors.
* \|	Add block evaluation V2 to TensorAsyncExecutor.	Rasmus Munk Larsen	2019-10-22
\| \| \| \| \| \| \| \|	Add async evaluation to a number of ops.
\| *	Fix for the HIP build+test errors.	Deven Desai	2019-10-22
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	The errors were introduced by this commit : After the above mentioned commit, some of the tests started failing with the following error ``` Built target cxx11_tensor_reduction Building HIPCC object unsupported/test/CMakeFiles/cxx11_tensor_reduction_gpu_5.dir/cxx11_tensor_reduction_gpu_5_generated_cxx11_tensor_reduction_gpu.cu.o In file included from /home/rocm-user/eigen/unsupported/test/cxx11_tensor_reduction_gpu.cu:16: In file included from /home/rocm-user/eigen/unsupported/Eigen/CXX11/Tensor:117: /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlockV2.h:155:5: error: the field type is not amp-compatible DestinationBufferKind m_kind; ^ /home/rocm-user/eigen/unsupported/Eigen/CXX11/src/Tensor/TensorBlockV2.h:211:3: error: the field type is not amp-compatible DestinationBuffer m_destination; ^ ``` For some reason HIPCC does not like device code to contain enum types which do not have the base-type explicitly declared. The fix is trivial, explicitly state "int" as the basetype
* \|	Drop support for c++03 in Eigen tensor. Get rid of some code used to emulate ↵	Rasmus Munk Larsen	2019-10-18
\|/ \| \| \|	c++11 functionality with older compilers.
*	Propagate block evaluation preference through rvalue tensor expressions	Eugene Zhulenev	2019-10-17
\|
*	Cleanup Tensor block destination and materialized block storage allocation	Eugene Zhulenev	2019-10-16
\|
*	TensorBroadcasting support for random/uniform blocks	Eugene Zhulenev	2019-10-16
\|
*	Block evaluation for TensorGenerator/TensorReverse/TensorShuffling	Eugene Zhulenev	2019-10-14
\|