aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen
Commit message (Collapse)AuthorAge
* use numext::absGravatar Angelos Mantzaflaris2016-12-02
| | | | | (grafted from 0a08d4c60b652d1f24b2fa062c818c4b93890c59 )
* 1. Add explicit template to abs2 (resolves deduction for some arithmetic types)Gravatar Angelos Mantzaflaris2016-12-02
| | | | | | 2. Avoid signed-unsigned conversion in comparison (warning in case Scalar is unsigned) (grafted from 4086187e49760d4bde72750dfa20ae9451263417 )
* Fix memory leak in Ref<Sparse>Gravatar Gael Guennebaud2016-12-05
|
* Optimize SparseLU::solve for rhs vectorsGravatar Gael Guennebaud2016-12-05
|
* remove temporary in SparseLU::solveGravatar Gael Guennebaud2016-12-05
|
* bug #1356: fix calls to evaluator::coeffRef(0,0) to get the address of the ↵Gravatar Gael Guennebaud2016-12-05
| | | | | | destination by adding a dstDataPtr() member to the kernel. This fixes undefined behavior if dst is empty (nullptr).
* typoGravatar Gael Guennebaud2016-12-05
|
* Improve performance of row-major-dense-matrix * vector products for recent CPUs.Gravatar Gael Guennebaud2016-12-05
| | | | | This revised version does not bother about aligned loads/stores, and rather processes 8 rows at ones for better instruction pipelining.
* Clean debugging codeGravatar Gael Guennebaud2016-12-05
|
* Merged in srvasude/eigen (pull request PR-265)Gravatar Benoit Steiner2016-12-05
|\ | | | | | | Add Expm1 support to Eigen.
* | Complete rewrite of column-major-matrix * vector product to deliver higher ↵Gravatar Gael Guennebaud2016-12-03
| | | | | | | | | | | | | | | | | | | | performance of modern CPU. The previous code has been optimized for Intel core2 for which unaligned loads/stores were prohibitively expensive. This new version exhibits much higher instruction independence (better pipelining) and explicitly leverage FMA. According to my benchmark, on Haswell this new kernel is always faster than the previous one, and sometimes even twice as fast. Even higher performance could be achieved with a better blocking size heuristic and, perhaps, with explicit prefetching. We should also check triangular product/solve to optimally exploit this new kernel (working on vertical panel of 4 columns is probably not optimal anymore).
| * Fix small nit where I changed name of plog1p to pexpm1.Gravatar Srinivas Vasudevan2016-12-02
| |
| * Sync from Head.Gravatar Srinivas Vasudevan2016-12-02
| |\ | |/ |/|
| * Added support for expm1 in Eigen.Gravatar Srinivas Vasudevan2016-12-02
| |
* | Ease compiler job to generate clean and efficient code in mat*vec.Gravatar Gael Guennebaud2016-12-02
| |
* | Operators += and -= do not resize!Gravatar Gael Guennebaud2016-12-02
| |
* | bug #1355: Fixed wrong line-endings on two filesGravatar Christoph Hertzberg2016-12-02
| |
* | typo UIntPtrGravatar Angelos Mantzaflaris2016-12-01
| | | | | | | | | | (grafted from b6f04a2dd4d68fe1858524709813a5df5b9a085b )
* | fix two warnings(unused typedef, unused variable) and a typoGravatar Angelos Mantzaflaris2016-12-01
|/ | | | | (grafted from a9aa3bcf50d55b63c8adb493a06c903ec34251c6 )
* Clean up SparseCore module regarding ReverseInnerIteratorGravatar Gael Guennebaud2016-12-01
|
* fix member orderGravatar Gael Guennebaud2016-12-01
|
* Merged in rmlarsen/eigen (pull request PR-256)Gravatar Gael Guennebaud2016-12-01
|\ | | | | | | Add a default constructor for the "fake" __half class when not using the __half class provided by CUDA.
* \ Merged eigen/eigen into defaultGravatar Benoit Steiner2016-12-01
|\ \
| * | Fix misleading-indentation warnings.Gravatar Gael Guennebaud2016-12-01
| | |
* | | Adding sycl backend for TensorPadding.h; disbaling __unit128 for sycl in ↵Gravatar Mehdi Goli2016-12-01
|/ / | | | | | | TensorIntDiv.h; disabling cashsize for sycl in tensorDeviceDefault.h; adding sycl backend for StrideSliceOP ; removing sycl compiler warning for creating an array of size 0 in CXX11Meta.h; cleaning up the sycl backend code.
* | Merged eigen/eigen into defaultGravatar Benoit Steiner2016-11-30
|\ \
| * | Fix slection of product implementation for dynamic size matrices with fixed ↵Gravatar Gael Guennebaud2016-11-30
| | | | | | | | | | | | max size.
| * | Fix a performance regression in (mat*mat)*vec for which mat*mat was ↵Gravatar Gael Guennebaud2016-11-30
| | | | | | | | | | | | evaluated multiple times.
| * | bug #1351: fix compilation of random with old compilersGravatar Gael Guennebaud2016-11-30
| | |
| | * Add a default constructor for the "fake" __half class when not using theGravatar Rasmus Munk Larsen2016-11-29
| |/ | | | | | | __half class provided by CUDA.
* | Merged eigen/eigen into defaultGravatar Benoit Steiner2016-11-26
|\|
* | Fixing LLVM error on TensorMorphingSycl.h on GPU; fixing int64_t crash for ↵Gravatar Mehdi Goli2016-11-25
| | | | | | | | tensor_broadcast_sycl on GPU; adding get_sycl_supported_devices() on syclDevice.h.
| * Disabled the "remove the call to 'std::abs' since unsigned values cannot be ↵Gravatar Benoit Steiner2016-11-23
| | | | | | | | negative" warning introduced in clang 3.5
* | Removing unsupported device from test case; cleaning the tensor device sycl.Gravatar Mehdi Goli2016-11-23
| |
| * Fix compilation with gcc and old ABI versionGravatar Gael Guennebaud2016-11-23
| |
| * Fix compilation issue with MSVC:Gravatar Gael Guennebaud2016-11-23
| | | | | | | | | | | | | | | | | | MSVC always messes up with shadowed template arguments, for instance in: struct B { typedef float T; } template<typename T> struct A : B { T g; }; The type of A<double>::g will be float and not double.
| * Optimize predux<Packet8f> (AVX)Gravatar Gael Guennebaud2016-11-22
| |
| * Disable usage of SSE3 _mm_hadd_ps that is extremely slow.Gravatar Gael Guennebaud2016-11-22
| |
| * Optimize predux<Packet4d> (AVX)Gravatar Gael Guennebaud2016-11-22
| |
| * Disable usage of SSE3 haddpd that is extremely slow.Gravatar Gael Guennebaud2016-11-22
| |
| * Fix regression in assigment of sparse block to spasre block.Gravatar Gael Guennebaud2016-11-21
|/
* Enable the use of constant expressions with clang >= 3.6Gravatar Benoit Steiner2016-11-20
|
* Fix compilation issue in mat = permutation (regression introduced in ↵Gravatar Gael Guennebaud2016-11-20
| | | | | | 8193ffb3d38b56c9295f204dc57dc6bac74f58aa )
* Fixed merge conflictsGravatar Benoit Steiner2016-11-19
|
* Merged in benoitsteiner/opencl (pull request PR-253)Gravatar Benoit Steiner2016-11-19
|\ | | | | | | OpenCL improvements
* | Enable EIGEN_HAS_C99_MATH when building with the latest version of Visual StudioGravatar Benoit Steiner2016-11-18
| |
* | Merged in ChunW/eigen (pull request PR-252)Gravatar Gael Guennebaud2016-11-18
|\ \ | | | | | | | | | Workaround for error in VS2012 with /clr
| | * Added isnan, isfinite and isinf for SYCL device. Plus test for that.Gravatar Luke Iwanski2016-11-18
| | |
* | | bug #1343: fix compilation regression in mat+=selfadjoint_view.Gravatar Gael Guennebaud2016-11-18
| | | | | | | | | | | | Generic EigenBase2EigenBase assignment was incomplete.
* | | bug #1343: fix compilation regression in array = matrix_productGravatar Gael Guennebaud2016-11-18
| | |