eigen - C++ library for linear algebra

	Commit message (Collapse)	Author	Age
*	Added more tests	Benoit Steiner	2016-04-20
\|
*	Started to implement a portable way to yield.	Benoit Steiner	2016-04-19
\|
*	Implemented a more portable version of thread local variables	Benoit Steiner	2016-04-19
\|
*	Fixed a few typos	Benoit Steiner	2016-04-19
\|
*	Fixed a compilation error with nvcc 7.	Benoit Steiner	2016-04-19
\|
*	Simplified the code that launches cuda kernels.	Benoit Steiner	2016-04-19
\|
*	Don't take the address of a kernel on CUDA devices that don't support this ↵	Benoit Steiner	2016-04-19
\| \| \| \|	feature.
*	Use numext::ceil instead of std::ceil	Benoit Steiner	2016-04-19
\|
*	Avoid an unnecessary copy of the evaluator.	Benoit Steiner	2016-04-19
\|
*	Fixed 2 recent regression tests	Benoit Steiner	2016-04-19
\|
*	Use DenseIndex in the MeanReducer to avoid overflows when processing very ↵	Benoit Steiner	2016-04-19
\| \| \| \|	large tensors.
*	Worked around the lack of a rand_r function on windows systems	Benoit Steiner	2016-04-17
\|
*	Worked around the lack of a rand_r function on windows systems	Benoit Steiner	2016-04-17
\|
*	Move the evalGemm method into the TensorContractionEvaluatorBase class to ↵	Benoit Steiner	2016-04-15
\| \| \| \|	make it accessible from both the single and multithreaded contraction evaluators.
*	Deleted unnecessary variable	Benoit Steiner	2016-04-15
\|
*	Fixed a few compilation warnings	Benoit Steiner	2016-04-15
\|
*	Merged in rmlarsen/eigen (pull request PR-178)	Benoit Steiner	2016-04-15
\|\ \| \| \| \| \| \|	Eigen Tensor cost model part 2: Thread scheduling for standard evaluators and reductions.
\| *	Get rid of void* casting when calling EvalRange::run.	Rasmus Munk Larsen	2016-04-15
\| \|
* \|	Fixed compilation errors with msvc	Benoit Steiner	2016-04-15
\| \|
* \|	Added ability to access the cache sizes from the tensor devices	Benoit Steiner	2016-04-14
\| \|
* \|	Added support for exclusive or	Benoit Steiner	2016-04-14
\| \|
\| *	Eigen Tensor cost model part 2: Thread scheduling for standard evaluators ↵	Rasmus Munk Larsen	2016-04-14
\| \| \| \| \| \| \| \|	and reductions. The cost model is turned off by default.
* \|	Added missing definition of PacketSize in the gpu evaluator of convolution	Benoit Steiner	2016-04-14
\| \|
* \|	Merged in rmlarsen/eigen (pull request PR-177)	Benoit Steiner	2016-04-14
\|\\| \| \| \| \| \| \|	Eigen Tensor cost model part 1.
* \|	Enabled the new threadpool tests	Benoit Steiner	2016-04-14
\| \|
* \|	Cleanup	Benoit Steiner	2016-04-14
\| \|
* \|	Prepared the migration to the new non blocking thread pool	Benoit Steiner	2016-04-14
\| \|
\| *	Improvements to cost model.	Rasmus Munk Larsen	2016-04-14
\| \|
* \|	Added tests for the non blocking thread pool	Benoit Steiner	2016-04-14
\| \|
* \|	Added a more scalable non blocking thread pool	Benoit Steiner	2016-04-14
\| \|
\| *	Merge upstream updates.	Rasmus Munk Larsen	2016-04-14
\| \|\ \| \|/ \|/\|
\| *	Eigen cost model part 1. This implements a basic recursive framework to ↵	Rasmus Munk Larsen	2016-04-14
\| \| \| \| \| \| \| \|	estimate the cost of evaluating tensor expressions.
* \|	Silenced a compilation warning	Benoit Steiner	2016-04-14
\| \|
* \|	Added tests to validate flooring and ceiling of fp16	Benoit Steiner	2016-04-14
\| \|
* \|	Added simple test for numext::sqrt and numext::pow on fp16	Benoit Steiner	2016-04-14
\| \|
* \|	Added basic test for trigonometric functions on fp16	Benoit Steiner	2016-04-14
\| \|
* \|	Added support for fp16 to the sigmoid function	Benoit Steiner	2016-04-14
\| \|
* \|	Made the test msvc friendly	Benoit Steiner	2016-04-14
\|/
*	Turn a converge check to a warning	Gael Guennebaud	2016-04-13
\|
*	Fixed compilation warnings generated by clang	Benoit Steiner	2016-04-12
\|
*	Fixed the zeta test	Benoit Steiner	2016-04-12
\|
*	Defer the decision to vectorize tensor CUDA code to the meta kernel. This ↵	Benoit Steiner	2016-04-12
\| \| \| \|	makes it possible to decide to vectorize or not depending on the capability of the target cuda architecture. In particular, this enables us to vectorize the processing of fp16 when running on device of capability >= 5.3
*	bug #1197: fix/relax some LM unit tests	Gael Guennebaud	2016-04-09
\|
*	bug #1160: fix and relax some lm unit tests by turning faillures to warnings	Gael Guennebaud	2016-04-09
\|
*	Disabled the use of half2 on cuda devices of compute capability < 5.3	Benoit Steiner	2016-04-08
\|
*	Created the new EIGEN_TEST_CUDA_CLANG option to compile the CUDA tests using ↵	Benoit Steiner	2016-04-08
\| \| \| \|	clang instead of nvcc
*	Don't test the division by 0 on float16 when compiling with msvc since msvc ↵	Benoit Steiner	2016-04-08
\| \| \| \|	detects and errors out on divisions by 0.
*	Renamed float16 into cxx11_float16 since the test relies on c++11 features	Benoit Steiner	2016-04-07
\|
*	Added missing EIGEN_DEVICE_FUNC to the tensor conversion code.	Benoit Steiner	2016-04-07
\|
*	Worked around numerical noise in the test for the zeta function.	Benoit Steiner	2016-04-07
\|