| Commit message (Expand) | Author | Age |
* | Made the index type a template parameter to evaluateProductBlockingSizes | Benoit Steiner | 2016-04-27 |
* | Deleted extraneous comma. | Benoit Steiner | 2016-04-15 |
* | Improved the matrix multiplication blocking in the case where mr is not a pow... | Benoit Steiner | 2016-04-15 |
* | Added ability to access the cache sizes from the tensor devices | Benoit Steiner | 2016-04-14 |
* | bug #1161: fix division by zero for huge scalar types | Gael Guennebaud | 2016-02-03 |
* | Make sure that block sizes are smaller than input matrix sizes. | Gael Guennebaud | 2016-01-26 |
* | Fix degenerate cases in syrk and trsm | Gael Guennebaud | 2015-11-30 |
* | Use a class constructor to initialize CPU cache sizes | Chris Jones | 2015-11-20 |
* | bug #1043: Avoid integer conversion sign warning | Christoph Hertzberg | 2015-08-19 |
* | Enable runtime stack alignment in gemm_blocking_space. | Gael Guennebaud | 2015-08-06 |
* | Abandon blocking size lookup table approach. Not performing as well in real w... | Benoit Jacob | 2015-05-19 |
* | Improved the blocking strategy to speedup multithreaded tensor contractions. | Benoit Steiner | 2015-04-09 |
* | add a note on bug #992 | Gael Guennebaud | 2015-04-08 |
* | bug #992: don't select a 3p GEMM path with non-vectorizable scalar types, thi... | Benoit Jacob | 2015-04-07 |
* | Fix computeProductBlockingSizes with m==0, and add respective unit test. | Gael Guennebaud | 2015-03-31 |
* | Similar to cset 3589a9c115a892ea3ca5dac74d71a1526764cb38 | Benoit Jacob | 2015-03-16 |
* | Fix bug in case where EIGEN_TEST_SPECIFIC_BLOCKING_SIZE is defined but false | Benoit Jacob | 2015-03-15 |
* | actual_panel_rows computation should always be resilient to parameters not co... | Benoit Jacob | 2015-03-15 |
* | Refactor computeProductBlockingSizes to make room for the possibility of usin... | Benoit Jacob | 2015-03-15 |
* | organize a little our default cache sizes, and use a saner default L1 outside... | Benoit Jacob | 2015-03-13 |
* | Avoid undeflow when blocking size are tuned manually. | Gael Guennebaud | 2015-03-06 |
* | Improve blocking heuristic: if the lhs fit within L1, then block on the rhs i... | Gael Guennebaud | 2015-03-06 |
* | Improve product kernel: replace the previous dynamic loop swaping strategy by... | Gael Guennebaud | 2015-03-06 |
* | Product optimization: implement a dynamic loop-swapping startegy to improve m... | Gael Guennebaud | 2015-03-05 |
* | Fix asm comments in 1px1 kernel | Benoit Jacob | 2015-03-03 |
* | Add a benchmark-default-sizes action to benchmark-blocking-sizes.cpp | Benoit Jacob | 2015-03-03 |
* | Increase unit-test L1 cache size to ensure we are doing at least 2 peeled loo... | Gael Guennebaud | 2015-02-27 |
* | Re-enbale detection of min/max parentheses protection, and re-enable mpreal_s... | Gael Guennebaud | 2015-02-27 |
* | Reimplement the selection between rotating and non-rotating kernels | Benoit Jacob | 2015-02-27 |
* | Make sure that the block size computation is tested by our unit test. | Gael Guennebaud | 2015-02-26 |
* | Implement a more generic blocking-size selection algorithm. See explanations ... | Gael Guennebaud | 2015-02-26 |
* | Fix typos in block-size testing code, and set peeling on k to 8. | Gael Guennebaud | 2015-02-26 |
* | So I extensively measured the impact of the offset in this prefetch. I tried ... | Benoit Jacob | 2015-02-25 |
* | Fix my recent prefetch changes: | Benoit Jacob | 2015-02-23 |
* | rotating kernel: avoid compiling anything outside of ARM | Benoit Jacob | 2015-02-18 |
* | remove a newly introduced redundant typedef - sorry. | Benoit Jacob | 2015-02-18 |
* | bug #955 - Implement a rotating kernel alternative in the 3px4 gebp path | Benoit Jacob | 2015-02-18 |
* | Fixed template parameter. | Hauke Heibel | 2015-02-18 |
* | merge | Gael Guennebaud | 2015-02-18 |
|\ |
|
* | | Clean a bit computeProductBlockingSizes (use Index type, remove CEIL macro) | Gael Guennebaud | 2015-02-18 |
| * | bug #958 - Allow testing specific blocking sizes | Benoit Jacob | 2015-02-18 |
|/ |
|
* | Fix bug #945: workaround MSVC warning | Gael Guennebaud | 2015-02-18 |
* | bug #953 - Fix prefetches in 3px4 product kernel | Benoit Jacob | 2015-02-13 |
* | Pulled the latest changes from the trunk | Benoit Steiner | 2015-02-06 |
|\ |
|
| * | bug #936, patch 1.5/3: rename _FUSED_ macros to _SINGLE_INSTRUCTION_, | Benoit Jacob | 2015-01-31 |
| * | bug #936, patch 1/3: some cleanup and renaming for consistency. | Benoit Jacob | 2015-01-30 |
| * | bug #935: Add asm comments in GEBP kernels to work around a bug | Benoit Jacob | 2015-01-30 |
* | | Made the blocking computation aware of the l3 cache | Benoit Steiner | 2014-10-15 |
* | | Generalized the gebp apis | Benoit Steiner | 2014-10-02 |
| * | Initial VSX commit | Konstantinos Margaritis | 2014-08-29 |
|/ |
|