Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Reworked the threadpool cancellation mechanism to not depend on ↵ | Benoit Steiner | 2016-12-09 |
| | | | | pthread_cancel since it turns out that pthread_cancel doesn't work properly on numerous platforms. | ||
* | Added a message to ease the detection of platforms on which thread ↵ | Benoit Steiner | 2016-12-08 |
| | | | | cancellation isn't supported. | ||
* | Added a Flush method to the RunQueue | Benoit Steiner | 2016-12-08 |
| | |||
* | Added the new threadpool cancel method to the threadpool interface based class. | Benoit Steiner | 2016-12-08 |
| | |||
* | Added support for thread cancellation on Linux | Benoit Steiner | 2016-12-08 |
| | |||
* | Merged in srvasude/eigen/fix_cuda_exp (pull request PR-268) | Benoit Steiner | 2016-12-08 |
|\ | | | | | | | Fix expm1 CUDA implementation (do not shadow exp CUDA implementation). | ||
* | | typo | Gael Guennebaud | 2016-12-08 |
| | | |||
* | | make sure we use the right eigen version | Gael Guennebaud | 2016-12-08 |
| | | |||
* | | fix usage of custom compiler | Gael Guennebaud | 2016-12-08 |
| | | |||
* | | Add missing include and use -O3 | Gael Guennebaud | 2016-12-07 |
| | | |||
* | | Fix call to convert on linux | Gael Guennebaud | 2016-12-07 |
| | | |||
* | | Cleanup file structure | Gael Guennebaud | 2016-12-07 |
| | | |||
* | | Update perf monitoring scripts to generate html/svg outputs | Gael Guennebaud | 2016-12-07 |
| | | |||
* | | Add a script example to run and upload performance tests | Gael Guennebaud | 2016-12-06 |
| | | |||
* | | generate png file for web upload | Gael Guennebaud | 2016-12-06 |
| | | |||
* | | Mention the CMAKE_PREFIX_PATH variable. | Gael Guennebaud | 2016-12-06 |
| | | |||
* | | Update monitored changeset lists | Gael Guennebaud | 2016-12-06 |
| | | |||
* | | Revert vec/y to vec*(1/y) in row-major TRSM: | Gael Guennebaud | 2016-12-06 |
| | | | | | | | | | | | | - div is extremely costly - this is consistent with the column-major case - this is consistent with all other BLAS implementations | ||
* | | Fix BLAS backend for symmetric rank K updates. | Gael Guennebaud | 2016-12-06 |
| | | |||
* | | Explain how to choose your favorite Eigen version | Gael Guennebaud | 2016-12-06 |
| | | |||
* | | Added relocatable cmake support also for CMake before 3.0 and after 2.8.8 | Silvio Traversaro | 2016-12-06 |
| | | |||
| * | Change comparisons to use Scalar instead of RealScalar. | Srinivas Vasudevan | 2016-12-05 |
| | | |||
| * | Fix expm1 CUDA implementation (do not shadow exp CUDA implementation). | Srinivas Vasudevan | 2016-12-05 |
| | | |||
* | | Make CMake config file relocatable | Silvio Traversaro | 2016-12-05 |
| | | |||
* | | fix compilation of BTL's blaze interface | Gael Guennebaud | 2016-12-05 |
| | | |||
* | | Add performance monitoring for LLT | Gael Guennebaud | 2016-12-05 |
| | | |||
* | | use numext::abs | Angelos Mantzaflaris | 2016-12-02 |
| | | | | | | | | | | (grafted from 0a08d4c60b652d1f24b2fa062c818c4b93890c59 ) | ||
* | | 1. Add explicit template to abs2 (resolves deduction for some arithmetic types) | Angelos Mantzaflaris | 2016-12-02 |
| | | | | | | | | | | | | 2. Avoid signed-unsigned conversion in comparison (warning in case Scalar is unsigned) (grafted from 4086187e49760d4bde72750dfa20ae9451263417 ) | ||
* | | Fix memory leak in Ref<Sparse> | Gael Guennebaud | 2016-12-05 |
| | | |||
* | | Optimize SparseLU::solve for rhs vectors | Gael Guennebaud | 2016-12-05 |
| | | |||
* | | remove temporary in SparseLU::solve | Gael Guennebaud | 2016-12-05 |
| | | |||
* | | bug #1356: fix calls to evaluator::coeffRef(0,0) to get the address of the ↵ | Gael Guennebaud | 2016-12-05 |
| | | | | | | | | | | | | destination by adding a dstDataPtr() member to the kernel. This fixes undefined behavior if dst is empty (nullptr). | ||
* | | typo | Gael Guennebaud | 2016-12-05 |
| | | |||
* | | extend monitoring benchmarks with transpose matrix-vector and triangular ↵ | Gael Guennebaud | 2016-12-05 |
| | | | | | | | | matrix-vectors. | ||
* | | Improve performance of row-major-dense-matrix * vector products for recent CPUs. | Gael Guennebaud | 2016-12-05 |
| | | | | | | | | | | This revised version does not bother about aligned loads/stores, and rather processes 8 rows at ones for better instruction pipelining. | ||
* | | Clean debugging code | Gael Guennebaud | 2016-12-05 |
| | | |||
* | | Merged in srvasude/eigen (pull request PR-265) | Benoit Steiner | 2016-12-05 |
|\| | | | | | | | Add Expm1 support to Eigen. | ||
* | | Add missing generic load methods. | Gael Guennebaud | 2016-12-03 |
| | | |||
* | | Complete rewrite of column-major-matrix * vector product to deliver higher ↵ | Gael Guennebaud | 2016-12-03 |
| | | | | | | | | | | | | | | | | | | | | performance of modern CPU. The previous code has been optimized for Intel core2 for which unaligned loads/stores were prohibitively expensive. This new version exhibits much higher instruction independence (better pipelining) and explicitly leverage FMA. According to my benchmark, on Haswell this new kernel is always faster than the previous one, and sometimes even twice as fast. Even higher performance could be achieved with a better blocking size heuristic and, perhaps, with explicit prefetching. We should also check triangular product/solve to optimally exploit this new kernel (working on vertical panel of 4 columns is probably not optimal anymore). | ||
* | | Merged eigen/eigen into default | Benoit Steiner | 2016-12-02 |
|\ \ | |||
| | * | Fix small nit where I changed name of plog1p to pexpm1. | Srinivas Vasudevan | 2016-12-02 |
| | | | |||
| | * | Sync from Head. | Srinivas Vasudevan | 2016-12-02 |
| | |\ | | |/ | |/| | |||
| | * | Added support for expm1 in Eigen. | Srinivas Vasudevan | 2016-12-02 |
| | | | |||
| * | | Ease compiler job to generate clean and efficient code in mat*vec. | Gael Guennebaud | 2016-12-02 |
| | | | |||
| * | | Operators += and -= do not resize! | Gael Guennebaud | 2016-12-02 |
| | | | |||
* | | | Makingt default numeric_list works with sycl. | Mehdi Goli | 2016-12-02 |
| | | | |||
| * | | merge | Gael Guennebaud | 2016-12-02 |
| |\ \ | |||
| * | | | Add perf monitoring for gemv | Gael Guennebaud | 2016-12-02 |
| | | | | |||
| * | | | Re-enable A^T*A action in BTL | Gael Guennebaud | 2016-12-02 |
| | | | | |||
| | * | | bug #1355: Fixed wrong line-endings on two files | Christoph Hertzberg | 2016-12-02 |
| |/ / |