Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Make sure that HyperPlane::transform manitains a unit normal vector in the ↵ | 2016-12-20 | |
| | | | | Affine case. | ||
* | Fixed compilation errors with gcc6 when compiling the AVX512 intrinsics | 2016-12-19 | |
| | |||
* | Use 32 registers on ARM64 | 2016-12-19 | |
| | |||
* | bug #1360: fix sign issue with pmull on altivec | 2016-12-18 | |
| | |||
* | Fix unused warning | 2016-12-18 | |
| | |||
* | Transformation methods added to ParametrizedLine class. | 2016-12-17 | |
| | |||
* | Made sure EIGEN_HAS_C99_MATH is defined when compiling OpenCL code | 2016-12-17 | |
| | |||
* | NumTraits.h: | 2016-12-16 | |
| | | | | For the values 'ReadCost, AddCost and MulCost', information about value Eigen::HugeCost | ||
* | Reenabled the use of constexpr on OpenCL devices | 2016-12-15 | |
| | |||
* | bug #1363: fix mingw's ABI issue | 2016-12-15 | |
| | |||
* | bug #1358: fix compilation for sparse += sparse.selfadjointView(); | 2016-12-14 | |
| | |||
* | bug #1359: fix compilation of col_major_sparse.row() *= scalar | 2016-12-14 | |
| | | | | (used to work in 3.2.9 though the expression is not really writable) | ||
* | bug #1359: fix sparse /=scalar and *=scalar implementation. | 2016-12-14 | |
| | | | | InnerIterators must be obtained from an evaluator. | ||
* | bug #1361: fix compilation issue in mat=perm.inverse() | 2016-12-13 | |
| | |||
* | Merged in srvasude/eigen/fix_cuda_exp (pull request PR-268) | 2016-12-08 | |
|\ | | | | | | | Fix expm1 CUDA implementation (do not shadow exp CUDA implementation). | ||
* | | Revert vec/y to vec*(1/y) in row-major TRSM: | 2016-12-06 | |
| | | | | | | | | | | | | - div is extremely costly - this is consistent with the column-major case - this is consistent with all other BLAS implementations | ||
* | | Fix BLAS backend for symmetric rank K updates. | 2016-12-06 | |
| | | |||
| * | Change comparisons to use Scalar instead of RealScalar. | 2016-12-05 | |
| | | |||
| * | Fix expm1 CUDA implementation (do not shadow exp CUDA implementation). | 2016-12-05 | |
| | | |||
* | | use numext::abs | 2016-12-02 | |
| | | | | | | | | | | (grafted from 0a08d4c60b652d1f24b2fa062c818c4b93890c59 ) | ||
* | | 1. Add explicit template to abs2 (resolves deduction for some arithmetic types) | 2016-12-02 | |
| | | | | | | | | | | | | 2. Avoid signed-unsigned conversion in comparison (warning in case Scalar is unsigned) (grafted from 4086187e49760d4bde72750dfa20ae9451263417 ) | ||
* | | Fix memory leak in Ref<Sparse> | 2016-12-05 | |
| | | |||
* | | Optimize SparseLU::solve for rhs vectors | 2016-12-05 | |
| | | |||
* | | remove temporary in SparseLU::solve | 2016-12-05 | |
| | | |||
* | | bug #1356: fix calls to evaluator::coeffRef(0,0) to get the address of the ↵ | 2016-12-05 | |
| | | | | | | | | | | | | destination by adding a dstDataPtr() member to the kernel. This fixes undefined behavior if dst is empty (nullptr). | ||
* | | typo | 2016-12-05 | |
| | | |||
* | | Improve performance of row-major-dense-matrix * vector products for recent CPUs. | 2016-12-05 | |
| | | | | | | | | | | This revised version does not bother about aligned loads/stores, and rather processes 8 rows at ones for better instruction pipelining. | ||
* | | Clean debugging code | 2016-12-05 | |
| | | |||
* | | Merged in srvasude/eigen (pull request PR-265) | 2016-12-05 | |
|\| | | | | | | | Add Expm1 support to Eigen. | ||
* | | Complete rewrite of column-major-matrix * vector product to deliver higher ↵ | 2016-12-03 | |
| | | | | | | | | | | | | | | | | | | | | performance of modern CPU. The previous code has been optimized for Intel core2 for which unaligned loads/stores were prohibitively expensive. This new version exhibits much higher instruction independence (better pipelining) and explicitly leverage FMA. According to my benchmark, on Haswell this new kernel is always faster than the previous one, and sometimes even twice as fast. Even higher performance could be achieved with a better blocking size heuristic and, perhaps, with explicit prefetching. We should also check triangular product/solve to optimally exploit this new kernel (working on vertical panel of 4 columns is probably not optimal anymore). | ||
| * | Fix small nit where I changed name of plog1p to pexpm1. | 2016-12-02 | |
| | | |||
| * | Sync from Head. | 2016-12-02 | |
| |\ | |/ |/| | |||
| * | Added support for expm1 in Eigen. | 2016-12-02 | |
| | | |||
* | | Ease compiler job to generate clean and efficient code in mat*vec. | 2016-12-02 | |
| | | |||
* | | Operators += and -= do not resize! | 2016-12-02 | |
| | | |||
* | | bug #1355: Fixed wrong line-endings on two files | 2016-12-02 | |
| | | |||
* | | typo UIntPtr | 2016-12-01 | |
| | | | | | | | | | | (grafted from b6f04a2dd4d68fe1858524709813a5df5b9a085b ) | ||
* | | fix two warnings(unused typedef, unused variable) and a typo | 2016-12-01 | |
|/ | | | | | (grafted from a9aa3bcf50d55b63c8adb493a06c903ec34251c6 ) | ||
* | Clean up SparseCore module regarding ReverseInnerIterator | 2016-12-01 | |
| | |||
* | fix member order | 2016-12-01 | |
| | |||
* | Merged in rmlarsen/eigen (pull request PR-256) | 2016-12-01 | |
|\ | | | | | | | Add a default constructor for the "fake" __half class when not using the __half class provided by CUDA. | ||
* \ | Merged eigen/eigen into default | 2016-12-01 | |
|\ \ | |||
| * | | Fix misleading-indentation warnings. | 2016-12-01 | |
| | | | |||
* | | | Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in ↵ | 2016-12-01 | |
|/ / | | | | | | | TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code. | ||
* | | Merged eigen/eigen into default | 2016-11-30 | |
|\ \ | |||
| * | | Fix slection of product implementation for dynamic size matrices with fixed ↵ | 2016-11-30 | |
| | | | | | | | | | | | | max size. | ||
| * | | Fix a performance regression in (mat*mat)*vec for which mat*mat was ↵ | 2016-11-30 | |
| | | | | | | | | | | | | evaluated multiple times. | ||
| * | | bug #1351: fix compilation of random with old compilers | 2016-11-30 | |
| | | | |||
| | * | Add a default constructor for the "fake" __half class when not using the | 2016-11-29 | |
| |/ | | | | | | | __half class provided by CUDA. | ||
* | | Merged eigen/eigen into default | 2016-11-26 | |
|\| |