aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/products/GeneralBlockPanelKernel.h
Commit message (Expand)AuthorAge
...
| * Fix product for custom complex type. (conjugation was ignored)Gravatar Gael Guennebaud2016-09-14
| * Fix performance regression in dgemm introduced by changeset 5d51a7f12c69138ed...Gravatar Gael Guennebaud2016-07-02
| * Fix performance regression introduced in changeset e56aabf205a1e8f581dd8a46d7...Gravatar Gael Guennebaud2016-07-02
| * Relax mixing-type constraints for binary coefficient-wise operators:Gravatar Gael Guennebaud2016-06-06
| * Remove the rotating kernel. It was only useful on some ARM CPUs (Qualcomm Kra...Gravatar Benoit Jacob2016-05-24
| * Don't optimize the processing of the last rows of a matrix matrix product in ...Gravatar Benoit Steiner2016-05-23
* | Pulled latest updates from upstreamGravatar Benoit Steiner2016-04-29
|\|
| * Made the index type a template parameter to evaluateProductBlockingSizesGravatar Benoit Steiner2016-04-27
| * Deleted extraneous comma.Gravatar Benoit Steiner2016-04-15
| * Improved the matrix multiplication blocking in the case where mr is not a pow...Gravatar Benoit Steiner2016-04-15
| * Added ability to access the cache sizes from the tensor devicesGravatar Benoit Steiner2016-04-14
* | Pull latest updates from upstreamGravatar Benoit Steiner2016-04-11
|\|
| * bug #1161: fix division by zero for huge scalar typesGravatar Gael Guennebaud2016-02-03
* | Updated the matrix multiplication code to make it compile with AVX512 enabled.Gravatar Benoit Steiner2016-02-01
| * Make sure that block sizes are smaller than input matrix sizes.Gravatar Gael Guennebaud2016-01-26
* | Disabled part of the matrix matrix peeling code that's incompatible with 512 ...Gravatar Benoit Steiner2015-12-21
|/
* Fix degenerate cases in syrk and trsmGravatar Gael Guennebaud2015-11-30
* Use a class constructor to initialize CPU cache sizesGravatar Chris Jones2015-11-20
* bug #1043: Avoid integer conversion sign warningGravatar Christoph Hertzberg2015-08-19
* Enable runtime stack alignment in gemm_blocking_space.Gravatar Gael Guennebaud2015-08-06
* Abandon blocking size lookup table approach. Not performing as well in real w...Gravatar Benoit Jacob2015-05-19
* Improved the blocking strategy to speedup multithreaded tensor contractions.Gravatar Benoit Steiner2015-04-09
* add a note on bug #992Gravatar Gael Guennebaud2015-04-08
* bug #992: don't select a 3p GEMM path with non-vectorizable scalar types, thi...Gravatar Benoit Jacob2015-04-07
* Fix computeProductBlockingSizes with m==0, and add respective unit test.Gravatar Gael Guennebaud2015-03-31
* Similar to cset 3589a9c115a892ea3ca5dac74d71a1526764cb38Gravatar Benoit Jacob2015-03-16
* Fix bug in case where EIGEN_TEST_SPECIFIC_BLOCKING_SIZE is defined but falseGravatar Benoit Jacob2015-03-15
* actual_panel_rows computation should always be resilient to parameters not co...Gravatar Benoit Jacob2015-03-15
* Refactor computeProductBlockingSizes to make room for the possibility of usin...Gravatar Benoit Jacob2015-03-15
* organize a little our default cache sizes, and use a saner default L1 outside...Gravatar Benoit Jacob2015-03-13
* Avoid undeflow when blocking size are tuned manually.Gravatar Gael Guennebaud2015-03-06
* Improve blocking heuristic: if the lhs fit within L1, then block on the rhs i...Gravatar Gael Guennebaud2015-03-06
* Improve product kernel: replace the previous dynamic loop swaping strategy by...Gravatar Gael Guennebaud2015-03-06
* Product optimization: implement a dynamic loop-swapping startegy to improve m...Gravatar Gael Guennebaud2015-03-05
* Fix asm comments in 1px1 kernelGravatar Benoit Jacob2015-03-03
* Add a benchmark-default-sizes action to benchmark-blocking-sizes.cppGravatar Benoit Jacob2015-03-03
* Increase unit-test L1 cache size to ensure we are doing at least 2 peeled loo...Gravatar Gael Guennebaud2015-02-27
* Re-enbale detection of min/max parentheses protection, and re-enable mpreal_s...Gravatar Gael Guennebaud2015-02-27
* Reimplement the selection between rotating and non-rotating kernelsGravatar Benoit Jacob2015-02-27
* Make sure that the block size computation is tested by our unit test.Gravatar Gael Guennebaud2015-02-26
* Implement a more generic blocking-size selection algorithm. See explanations ...Gravatar Gael Guennebaud2015-02-26
* Fix typos in block-size testing code, and set peeling on k to 8.Gravatar Gael Guennebaud2015-02-26
* So I extensively measured the impact of the offset in this prefetch. I tried ...Gravatar Benoit Jacob2015-02-25
* Fix my recent prefetch changes:Gravatar Benoit Jacob2015-02-23
* rotating kernel: avoid compiling anything outside of ARMGravatar Benoit Jacob2015-02-18
* remove a newly introduced redundant typedef - sorry.Gravatar Benoit Jacob2015-02-18
* bug #955 - Implement a rotating kernel alternative in the 3px4 gebp pathGravatar Benoit Jacob2015-02-18
* Fixed template parameter.Gravatar Hauke Heibel2015-02-18
* mergeGravatar Gael Guennebaud2015-02-18
|\
* | Clean a bit computeProductBlockingSizes (use Index type, remove CEIL macro)Gravatar Gael Guennebaud2015-02-18