eigen - C++ library for linear algebra

	Commit message (Collapse)	Author	Age
*	Replace a static assert by a runtime one, fixes the build of unit tests on ARM	Benoit Jacob	2015-02-27
\| \| \| \| \|	Also safely assert in the non-implemented path that should never be taken in practice, and would return wrong results.
*	Avoid packing rhs multiple-times when blocking on the lhs only.	Gael Guennebaud	2015-02-26
\|
*	Make sure that the block size computation is tested by our unit test.	Gael Guennebaud	2015-02-26
\|
*	Implement a more generic blocking-size selection algorithm. See explanations ↵	Gael Guennebaud	2015-02-26
\| \| \| \| \| \| \|	inlines. It performs extremely well on Haswell. The main issue is to reliably and quickly find the actual cache size to be used for our 2nd level of blocking, that is: max(l2,l3/nb_core_sharing_l3)
*	Fix typos in block-size testing code, and set peeling on k to 8.	Gael Guennebaud	2015-02-26
\|
*	So I extensively measured the impact of the offset in this prefetch. I tried ↵	Benoit Jacob	2015-02-25
\| \| \| \| \| \| \| \| \| \| \| \| \| \|	offset values from 0 to 128 (on this float* pointer, so implicitly times 4 bytes). On x86, I tested a Sandy Bridge with AVX with 12M cache and a Haswell with AVX+FMA with 6M cache on MatrixXf sizes up to 2400. I could not see any significant impact of this offset. On Nexus 5, the offset has a slight effect: values around 32 (times sizeof float) are worst. Anything else is the same: the current 64 (8*pk), or... 0. So let's just go with 0! Note that we needed a fix anyway for not accounting for the value of RhsProgress. 0 nicely avoids the issue altogether!
*	bug #970: Add EIGEN_DEVICE_FUNC to RValue functions, in case Cuda supports ↵	Christoph Hertzberg	2015-02-24
\| \| \| \|	RValue-references.
*	Fix my recent prefetch changes:	Benoit Jacob	2015-02-23
\| \| \| \| \| \| \| \| \| \| \|	- the first prefetch is actually harmful on Haswell with FMA, but it is the most beneficial on ARM. - the second prefetch... I was very stupid and multiplied by sizeof(scalar) and offset of a scalar* pointer. The old offset was 64; pk = 8, so 64=pk*8. So this effectively restores the older offset. Actually, there were two prefetches here, one with offset 48 and one with offset 64. I could not confirm any benefit from this strange 48 offset on either the haswell or my ARM device.
*	Fix two trivial warnings	Christoph Hertzberg	2015-02-22
\|
*	log1p is defined only for real Scalars in C++11	Christoph Hertzberg	2015-02-21
\|
*	Fix compilation of unit tests disabling assertion cheking	Gael Guennebaud	2015-02-21
\|
*	Fix doc of Ref<>	Gael Guennebaud	2015-02-20
\|
*	In C++11 destructors do not throw by default (fix CommaInitializer unit test)	Gael Guennebaud	2015-02-20
\|
*	Pulled latest changes from trunk	Benoit Steiner	2015-02-19
\|\
* \|	Marked the CUDA packet primitives as EIGEN_DEVICE_FUNC since they'll end up ↵	Benoit Steiner	2015-02-19
\| \| \| \| \| \| \| \|	being executed on the GPU device.
\| *	Fix regression with C++11 support of lambda: now internal::result_of falls ↵	Gael Guennebaud	2015-02-19
\| \| \| \| \| \| \| \|	back to std::result_of in C++11.
\| *	Fix some calls to result_of on binary functors as unary ones.	Gael Guennebaud	2015-02-19
\| \|
\| *	Declare const some const variables	Gael Guennebaud	2015-02-19
\|/
*	Add support for C++11 result_of/lambdas	Gael Guennebaud	2015-02-19
\|
*	rotating kernel: avoid compiling anything outside of ARM	Benoit Jacob	2015-02-18
\|
*	remove a newly introduced redundant typedef - sorry.	Benoit Jacob	2015-02-18
\|
*	bug #955 - Implement a rotating kernel alternative in the 3px4 gebp path	Benoit Jacob	2015-02-18
\| \| \| \| \| \| \| \|	This is substantially faster on ARM, where it's important to minimize the number of loads. This is specific to the case where all packet types are of size 4. I made my best attempt to minimize how dirty this is... opinions welcome. Eventually one could have a generic rotated kernel, but it would take some work to get there. Also, on sandy bridge, in my experience, it's not beneficial (even about 1% slower).
*	Fixed template parameter.	Hauke Heibel	2015-02-18
\|
*	merge	Gael Guennebaud	2015-02-18
\|\
* \|	Clean a bit computeProductBlockingSizes (use Index type, remove CEIL macro)	Gael Guennebaud	2015-02-18
\| \|
\| *	bug #958 - Allow testing specific blocking sizes	Benoit Jacob	2015-02-18
\|/ \| \| \| \| \| \| \| \| \| \| \| \| \|	This is only a debugging/testing patch. It allows testing specific product blocking sizes, typically to study the impact on performance. Example usage: int testk, testm, testn; #define EIGEN_TEST_SPECIFIC_BLOCKING_SIZES #define EIGEN_TEST_SPECIFIC_BLOCKING_SIZE_K testk #define EIGEN_TEST_SPECIFIC_BLOCKING_SIZE_M testm #define EIGEN_TEST_SPECIFIC_BLOCKING_SIZE_N testn #include <Eigen/Core>
*	Fix a regression when using OpenMP, and fix bug #714: the number of threads ↵	Gael Guennebaud	2015-02-18
\| \| \| \|	might be lower than the number of requested ones
*	Fix bug #945: workaround MSVC warning	Gael Guennebaud	2015-02-18
\|
*	Add missing install directives for arch/CUDA	Gael Guennebaud	2015-02-18
\|
*	Add an internal assertion in makeCompressed to catch a possible risk of ↵	Gael Guennebaud	2015-02-18
\| \| \| \|	null-pointer access.
*	Remove some dead stores.	Gael Guennebaud	2015-02-18
\|
*	Fix possible usage of a null pointer in CholmodSupport	Gael Guennebaud	2015-02-18
\|
*	Big 957, workaround MSVC/ICC compilation issue	Gael Guennebaud	2015-02-18
\|
*	Packet must be passed by const reference and not by value to avoid alignment ↵	Gael Guennebaud	2015-02-17
\| \| \| \|	issue.
*	Suppress some remaining Index conversion warnings	Christoph Hertzberg	2015-02-17
\|
*	Disable __m128* wrappers when compiling with AVX and -fabi-version=4	Gael Guennebaud	2015-02-17
\|
*	Fix compilation with GCC/AVX (workaround __m128 and __m256 being the same ↵	Gael Guennebaud	2015-02-17
\| \| \| \|	type with default ABI)
*	Fix compilation of Cholmod*(matrix) ctor	Gael Guennebaud	2015-02-17
\|
*	Fix compilation of int*complex with gcc	Gael Guennebaud	2015-02-16
\|
*	Fix SparseLU::signDeterminant() method, and add a SparseLU::determinant() ↵	Gael Guennebaud	2015-02-16
\| \| \| \|	method.
*	Add PermutationMatrix::determinant method.	Gael Guennebaud	2015-02-16
\|
*	bug #956: Fixed bug in move constructors of DenseStorage which caused ↵	Martin Drozdik	2015-02-16
\| \| \| \|	"moved-from" objects to be in an invalid state.
*	Fix unused variable warning.	Gael Guennebaud	2015-02-16
\|
*	bug #897: fix regression in BiCGSTAB(mat) ctor (an all other iterative solvers).	Gael Guennebaud	2015-02-16
\| \| \| \|	Add respective regression unit test.
*	Remove some useless typedefs	Gael Guennebaud	2015-02-16
\|
*	Doc: explain how to free allocated memory in SparseMAtrix	Gael Guennebaud	2015-02-16
\|
*	Merged in chtz/eigen-indexconversion (pull request PR-92)	Gael Guennebaud	2015-02-16
\|\ \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	bug #877, bug #572: Get rid of Index conversion warnings, summary of changes: - Introduce a global typedef Eigen::Index making Eigen::DenseIndex and AnyExpr<>::Index deprecated (default is std::ptrdiff_t). - Eigen::Index is used throughout the API to represent indices, offsets, and sizes. - Classes storing an array of indices uses the type StorageIndex to store them. This is a template parameter of the class. Default is int. - Methods that explicitly set or return an element of such an array take or return a StorageIndex type. In all other cases, the Index type is used.
\| *	The usage of DenseIndex is deprecated, so let's replace DenseIndex by Index	Gael Guennebaud	2015-02-16
\| \|
\| *	Remove deprecated usage of expr::Index.	Gael Guennebaud	2015-02-16
\| \|
\| *	Fix many long to int conversion warnings:	Gael Guennebaud	2015-02-16
\| \| \| \| \| \| \| \| \| \| \| \|	- fix usage of Index (API) versus StorageIndex (when multiple indexes are stored) - use StorageIndex(val) when the input has already been check - use internal::convert_index<StorageIndex>(val) when val is potentially unsafe (directly comes from user input)