Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Added a test to validate tensor casting on cuda devices | Benoit Steiner | 2015-02-10 |
| | |||
* | Silenced a few compilation warnings generated by nvcc | Benoit Steiner | 2015-02-10 |
| | |||
* | Marked a few functions as EIGEN_DEVICE_FUNC to enable the use of tensors in ↵ | Benoit Steiner | 2015-02-10 |
| | | | | cuda kernels. | ||
* | Fixed compilation error triggered when trying to vectorize a non ↵ | Benoit Steiner | 2015-02-10 |
| | | | | vectorizable cuda kernel. | ||
* | Silenced the last batch of compilation warnings triggered by gcc 4.8 | Benoit Steiner | 2015-02-10 |
| | |||
* | Fixed a few more compilation warnings | Benoit Steiner | 2015-02-10 |
| | |||
* | Silenced more compilation warnings | Benoit Steiner | 2015-02-10 |
| | |||
* | Silcenced a few compilation warnings | Benoit Steiner | 2015-02-10 |
| | |||
* | Silenced several compilation warnings | Benoit Steiner | 2015-02-10 |
| | |||
* | Fixed compilation error | Benoit Steiner | 2015-02-10 |
| | |||
* | Removed a debug printf statement. | Benoit Steiner | 2015-02-10 |
| | |||
* | Fixed the cxx11_meta test | Benoit Steiner | 2015-02-06 |
| | |||
* | Pulled latest fixes | Benoit Steiner | 2015-02-06 |
|\ | |||
| * | merge | Gael Guennebaud | 2015-02-06 |
| |\ | |||
| * | | Fix symmetric product | Gael Guennebaud | 2015-02-06 |
| | | | |||
| * | | Fix clang compilation | Gael Guennebaud | 2015-02-06 |
| | | | |||
| * | | Fix non initialized entries and comparison of very small numbers | Gael Guennebaud | 2015-02-06 |
| | | | |||
* | | | Pulled the latest changes from the trunk | Benoit Steiner | 2015-02-06 |
|\ \ \ | |_|/ |/| | | |||
* | | | Fixed compilation error in the tensor broadcasting test | Benoit Steiner | 2015-02-06 |
| | | | |||
* | | | Added the EIGEN_HAS_CONSTEXPR define | Benoit Steiner | 2015-02-06 |
| |/ |/| | | | | | Gate the tensor index list code based on the value of EIGEN_HAS_CONSTEXPR | ||
| * | Using numext::pow instead of std::pow in poly_eval function. | Filippo Basso | 2015-02-04 |
| | | |||
| * | Use Ref<> to ensure that both x and b in Ax=b are compatible with ↵ | Gael Guennebaud | 2015-02-03 |
| | | | | | | | | Umfpack/SuperLU expectations | ||
| * | SPQR: fix default threshold value | Gael Guennebaud | 2015-02-03 |
| | | |||
* | | Silenced a few more compilation warnings | Benoit Steiner | 2015-01-30 |
| | | |||
* | | Silenced some compilation warnings | Benoit Steiner | 2015-01-30 |
| | | |||
| * | bug #936, patch 3/3: Properly detect FMA support on ARM (requires VFPv4) | Benoit Jacob | 2015-01-30 |
| | | | | | | | | | | and use it instead of MLA when available, because it's both more accurate, and faster. | ||
| * | bug #936, patch 2/3: Remove EIGEN_VECTORIZE_FMA, was redundant with ↵ | Benoit Jacob | 2015-01-30 |
| | | | | | | | | EIGEN_HAS_SINGLE_INSTRUCTION_MADD | ||
| * | bug #936, patch 1.5/3: rename _FUSED_ macros to _SINGLE_INSTRUCTION_, | Benoit Jacob | 2015-01-31 |
| | | | | | | | | | | | | | | | | | | because this is what they are about. "Fused" means "no intermediate rounding between the mul and the add, only one rounding at the end". Instead, what we are concerned about here is whether a temporary register is needed, i.e. whether the MUL and ADD are separate instructions. Concretely, on ARM NEON, a single-instruction mul-add is always available: VMLA. But a true fused mul-add is only available on VFPv4: VFMA. | ||
| * | bug #936, patch 1/3: some cleanup and renaming for consistency. | Benoit Jacob | 2015-01-30 |
| | | |||
| * | bug #935: Add asm comments in GEBP kernels to work around a bug | Benoit Jacob | 2015-01-30 |
| | | | | | | | | | | | | | | | | | | in both GCC and Clang on ARM/NEON, whereby they spill registers, severely harming performance. The reason why the asm comments make a difference is that they prevent the compiler from reordering code across these boundaries, which has the effect of extending the lifetime of local variables and increasing register pressure on this register-tight code. | ||
| * | bug #941: fix accuracy issue in ColPivHouseholderQR, do not stop ↵ | Gael Guennebaud | 2015-01-30 |
| | | | | | | | | decomposition on a small pivot | ||
| * | Supernodes was disabled. | Gael Guennebaud | 2015-01-30 |
| | | |||
* | | Marked the contraction operation as read only, since its result can't be ↵ | Benoit Steiner | 2015-01-29 |
| | | | | | | | | assigned. | ||
* | | Added more tests to cover tensor reductions | Benoit Steiner | 2015-01-28 |
| | | |||
| * | bug #933: RealSchur, do not consider the input matrix norm to check ↵ | Gael Guennebaud | 2015-01-28 |
| | | | | | | | | negligible sub-diag entries. This also makes this test consistent with the complex and self-adjoint cases. | ||
* | | mproved the performance of tensor reductions that preserve the inner most ↵ | Benoit Steiner | 2015-01-27 |
| | | | | | | | | dimension(s). | ||
* | | Added a few benchmarks for the tensor code | Benoit Steiner | 2015-01-26 |
| | | |||
| * | Enable vectorization of transposeInPlace for PacketSize x PacketSize matrices | Gael Guennebaud | 2015-01-26 |
| | | |||
| * | Add support for dense ?= diagonal | Gael Guennebaud | 2015-01-24 |
| | | |||
| * | bug #329: fix typo | Gael Guennebaud | 2015-01-17 |
| | | |||
* | | gcc doesn't consider that | Benoit Steiner | 2015-01-16 |
| | | | | | | | | | | | | | | | | | | | | | | template<typename OtherDerived> TensorStridingOp& operator = (const OtherDerived& other) provides a valid assignment operator for the striding operation, and therefore refuses to compile code like: result.stride(foo) = source.stride(bar); Added the explicit TensorStridingOp& operator = (const TensorStridingOp& other) as a workaround to get the code to compile, and did the same in all the operations that can be used as lvalues. | ||
* | | Added cube() operation | Benoit Steiner | 2015-01-15 |
| | | |||
* | | Created many additional tests | Benoit Steiner | 2015-01-14 |
| | | |||
* | | Updated the list of include files | Benoit Steiner | 2015-01-14 |
| | | |||
* | | Improved support for RowMajor tensors | Benoit Steiner | 2015-01-14 |
| | | | | | | | | Misc fixes and API cleanups. | ||
* | | Ensured that each thread has it's own copy of the TensorEvaluator: this ↵ | Benoit Steiner | 2015-01-14 |
| | | | | | | | | avoid race conditions when the evaluator calls a non thread safe functor, eg when generating random numbers. | ||
* | | Improved the resizing of tensors | Benoit Steiner | 2015-01-14 |
| | | |||
* | | Misc improvements | Benoit Steiner | 2015-01-14 |
| | | |||
* | | Misc fixes | Benoit Steiner | 2015-01-14 |
| | | |||
* | | Expanded the functionality of index lists | Benoit Steiner | 2015-01-14 |
| | |