aboutsummaryrefslogtreecommitdiffhomepage
Commit message (Collapse)AuthorAge
* Reworked the threadpool cancellation mechanism to not depend on ↵Gravatar Benoit Steiner2016-12-09
| | | | pthread_cancel since it turns out that pthread_cancel doesn't work properly on numerous platforms.
* Added a message to ease the detection of platforms on which thread ↵Gravatar Benoit Steiner2016-12-08
| | | | cancellation isn't supported.
* Added a Flush method to the RunQueueGravatar Benoit Steiner2016-12-08
|
* Added the new threadpool cancel method to the threadpool interface based class.Gravatar Benoit Steiner2016-12-08
|
* Added support for thread cancellation on LinuxGravatar Benoit Steiner2016-12-08
|
* Merged in srvasude/eigen/fix_cuda_exp (pull request PR-268)Gravatar Benoit Steiner2016-12-08
|\ | | | | | | Fix expm1 CUDA implementation (do not shadow exp CUDA implementation).
* | typoGravatar Gael Guennebaud2016-12-08
| |
* | make sure we use the right eigen versionGravatar Gael Guennebaud2016-12-08
| |
* | fix usage of custom compilerGravatar Gael Guennebaud2016-12-08
| |
* | Add missing include and use -O3Gravatar Gael Guennebaud2016-12-07
| |
* | Fix call to convert on linuxGravatar Gael Guennebaud2016-12-07
| |
* | Cleanup file structureGravatar Gael Guennebaud2016-12-07
| |
* | Update perf monitoring scripts to generate html/svg outputsGravatar Gael Guennebaud2016-12-07
| |
* | Add a script example to run and upload performance testsGravatar Gael Guennebaud2016-12-06
| |
* | generate png file for web uploadGravatar Gael Guennebaud2016-12-06
| |
* | Mention the CMAKE_PREFIX_PATH variable.Gravatar Gael Guennebaud2016-12-06
| |
* | Update monitored changeset listsGravatar Gael Guennebaud2016-12-06
| |
* | Revert vec/y to vec*(1/y) in row-major TRSM:Gravatar Gael Guennebaud2016-12-06
| | | | | | | | | | | | - div is extremely costly - this is consistent with the column-major case - this is consistent with all other BLAS implementations
* | Fix BLAS backend for symmetric rank K updates.Gravatar Gael Guennebaud2016-12-06
| |
* | Explain how to choose your favorite Eigen versionGravatar Gael Guennebaud2016-12-06
| |
* | Added relocatable cmake support also for CMake before 3.0 and after 2.8.8Gravatar Silvio Traversaro2016-12-06
| |
| * Change comparisons to use Scalar instead of RealScalar.Gravatar Srinivas Vasudevan2016-12-05
| |
| * Fix expm1 CUDA implementation (do not shadow exp CUDA implementation).Gravatar Srinivas Vasudevan2016-12-05
| |
* | Make CMake config file relocatableGravatar Silvio Traversaro2016-12-05
| |
* | fix compilation of BTL's blaze interfaceGravatar Gael Guennebaud2016-12-05
| |
* | Add performance monitoring for LLTGravatar Gael Guennebaud2016-12-05
| |
* | use numext::absGravatar Angelos Mantzaflaris2016-12-02
| | | | | | | | | | (grafted from 0a08d4c60b652d1f24b2fa062c818c4b93890c59 )
* | 1. Add explicit template to abs2 (resolves deduction for some arithmetic types)Gravatar Angelos Mantzaflaris2016-12-02
| | | | | | | | | | | | 2. Avoid signed-unsigned conversion in comparison (warning in case Scalar is unsigned) (grafted from 4086187e49760d4bde72750dfa20ae9451263417 )
* | Fix memory leak in Ref<Sparse>Gravatar Gael Guennebaud2016-12-05
| |
* | Optimize SparseLU::solve for rhs vectorsGravatar Gael Guennebaud2016-12-05
| |
* | remove temporary in SparseLU::solveGravatar Gael Guennebaud2016-12-05
| |
* | bug #1356: fix calls to evaluator::coeffRef(0,0) to get the address of the ↵Gravatar Gael Guennebaud2016-12-05
| | | | | | | | | | | | destination by adding a dstDataPtr() member to the kernel. This fixes undefined behavior if dst is empty (nullptr).
* | typoGravatar Gael Guennebaud2016-12-05
| |
* | extend monitoring benchmarks with transpose matrix-vector and triangular ↵Gravatar Gael Guennebaud2016-12-05
| | | | | | | | matrix-vectors.
* | Improve performance of row-major-dense-matrix * vector products for recent CPUs.Gravatar Gael Guennebaud2016-12-05
| | | | | | | | | | This revised version does not bother about aligned loads/stores, and rather processes 8 rows at ones for better instruction pipelining.
* | Clean debugging codeGravatar Gael Guennebaud2016-12-05
| |
* | Merged in srvasude/eigen (pull request PR-265)Gravatar Benoit Steiner2016-12-05
|\| | | | | | | Add Expm1 support to Eigen.
* | Add missing generic load methods.Gravatar Gael Guennebaud2016-12-03
| |
* | Complete rewrite of column-major-matrix * vector product to deliver higher ↵Gravatar Gael Guennebaud2016-12-03
| | | | | | | | | | | | | | | | | | | | performance of modern CPU. The previous code has been optimized for Intel core2 for which unaligned loads/stores were prohibitively expensive. This new version exhibits much higher instruction independence (better pipelining) and explicitly leverage FMA. According to my benchmark, on Haswell this new kernel is always faster than the previous one, and sometimes even twice as fast. Even higher performance could be achieved with a better blocking size heuristic and, perhaps, with explicit prefetching. We should also check triangular product/solve to optimally exploit this new kernel (working on vertical panel of 4 columns is probably not optimal anymore).
* | Merged eigen/eigen into defaultGravatar Benoit Steiner2016-12-02
|\ \
| | * Fix small nit where I changed name of plog1p to pexpm1.Gravatar Srinivas Vasudevan2016-12-02
| | |
| | * Sync from Head.Gravatar Srinivas Vasudevan2016-12-02
| | |\ | | |/ | |/|
| | * Added support for expm1 in Eigen.Gravatar Srinivas Vasudevan2016-12-02
| | |
| * | Ease compiler job to generate clean and efficient code in mat*vec.Gravatar Gael Guennebaud2016-12-02
| | |
| * | Operators += and -= do not resize!Gravatar Gael Guennebaud2016-12-02
| | |
* | | Makingt default numeric_list works with sycl.Gravatar Mehdi Goli2016-12-02
| | |
| * | mergeGravatar Gael Guennebaud2016-12-02
| |\ \
| * | | Add perf monitoring for gemvGravatar Gael Guennebaud2016-12-02
| | | |
| * | | Re-enable A^T*A action in BTLGravatar Gael Guennebaud2016-12-02
| | | |
| | * | bug #1355: Fixed wrong line-endings on two filesGravatar Christoph Hertzberg2016-12-02
| |/ /