aboutsummaryrefslogtreecommitdiffhomepage
path: root/Eigen/src/Core/products/GeneralBlockPanelKernel.h
Commit message (Expand)AuthorAge
* Small cleanup: Get rid of the macros EIGEN_HAS_SINGLE_INSTRUCTION_CJMADD and ...Gravatar Rasmus Munk Larsen2021-06-24
* Fix c++20 warnings about using enums in arithmetic expressions.Gravatar Rasmus Munk Larsen2021-06-10
* Fix excessive GEBP register spilling for 32-bit NEON.Gravatar Antonio Sanchez2021-02-03
* Fix failure in GEBP kernel when compiling with OpenMP and FMAGravatar David Tellenbach2020-09-30
* Fix unused variable warning on ArmGravatar David Tellenbach2020-06-15
* Fix static analyzer warning in SelfadjointProduct.h.Gravatar Rasmus Munk Larsen2020-06-08
* Fix #1874: it works on both MSVC 2017 and other platforms.Gravatar Kan Chen2020-05-21
* Fix #1874: workaround MSVC 2017 compilation issue.Gravatar Gael Guennebaud2020-05-15
* Possibility to specify user-defined default cache sizes for GEBP kernelGravatar David Tellenbach2020-05-08
* Speed up matrix multiplication for small to medium size matrices by using hal...Gravatar Rasmus Munk Larsen2020-04-07
* Adding correct cache sizes for PPC architecture.Gravatar Everton Constantino2020-01-13
* PR 719: fix real/imag namespace conflictGravatar Gael Guennebaud2019-10-08
* Speed up GEMV on AVX-512 builds, just as done for GEBP previously.Gravatar Gustavo Lima Chaves2019-04-26
* Remove duplicated comment lineGravatar Eugene Zhulenev2019-02-04
* Fix GeneralBlockPanelKernel Android compilationGravatar Eugene Zhulenev2019-02-04
* GEBP: improves pipelining in the 1pX4 path with FMA.Gravatar Gael Guennebaud2019-01-30
* Fix compilation with ARM64.Gravatar Gael Guennebaud2019-01-30
* Fix conflicts and mergeGravatar Gael Guennebaud2019-01-30
|\
* | According to https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89101, the previous...Gravatar Gael Guennebaud2019-01-30
* | ARM64 & GEBP: add specialization for double +30% speed upGravatar Gael Guennebaud2019-01-30
* | ARM64 & GEBP: Make use of vfmaq_laneq_f32 and workaround GCC's issue in gener...Gravatar Gael Guennebaud2019-01-30
* | Fix compilation error in NEON GEBP specializaition of madd.Gravatar Rasmus Munk Larsen2019-01-25
* | GEBP: fix swapped kernel mode with AVX512 and complex scalarsGravatar Gael Guennebaud2019-01-16
* | GEBP: cleanup logic to choose between a 4 packets of 1 packetGravatar Gael Guennebaud2019-01-16
* | bug #1661: fix regression in GEBP and AVX512Gravatar Gael Guennebaud2019-01-16
* | bug #1633: use proper type for madd temporaries, factorize RhsPacketx4.Gravatar Gael Guennebaud2019-01-16
* | Bug: 1633: refactor gebp kernel and optimize for neonGravatar Renjie Liu2019-01-16
* | Make code compile again for older compilers.Gravatar Christoph Hertzberg2018-12-22
| * gebp: Add new ½ and ¼ packet rows per (peeling) round on the lhsGravatar Gustavo Lima Chaves2018-12-21
|/
* Artificially increase l1-blocking size for AVX512. +10% speedup with current ...Gravatar Gael Guennebaud2018-12-11
* bug #1643: fix compilation issue with gcc and no optimizaionGravatar Gael Guennebaud2018-12-11
* enable spilling workaround on architectures with SSE/AVXGravatar Gael Guennebaud2018-12-10
* workaround "may be used uninitialized" warningGravatar Gael Guennebaud2018-12-08
* fix EIGEN_GEBP_2PX4_SPILLING_WORKAROUND for non vectorized type, and non x86/...Gravatar Gael Guennebaud2018-12-08
* bug #1515: disable gebp's 3pX4 micro kernel for MSVC<=19.14 because of regist...Gravatar Gael Guennebaud2018-12-07
* bug #1636: fix gemm performance issue with gcc>=6 and no FMAGravatar Gael Guennebaud2018-12-07
* Implement AVX512 vectorization of std::complex<float/double>Gravatar Gael Guennebaud2018-12-06
* do not read buffers out of bounds -- load only the 4 bytes we know exist here...Gravatar Benoit Jacob2018-11-27
* fix the build on 64-bit ARM when NEON is disabledGravatar Benoit Jacob2018-11-27
* bug #1624: improve matrix-matrix product on ARM 64, 20% speedupGravatar Gael Guennebaud2018-11-23
* Vectorize row-by-row gebp loop iterations on 16 packets as wellGravatar Gustavo Lima Chaves2018-11-06
* Fix regression introduced by the previous fix for AVX512.Gravatar Gael Guennebaud2018-09-20
* Fix gebp kernel for real+complex in case only reals are vectorized (e.g., AVX...Gravatar Gael Guennebaud2018-09-20
* bug #1578: Improve prefetching in matrix multiplication on MIPS.Gravatar Alexey Frunze2018-07-24
* Fix legitimate "declaration shadows a typedef" warningGravatar Gael Guennebaud2018-07-09
* Rename predux_downto4 to be more accurate on its semantic.Gravatar Gael Guennebaud2018-04-03
* MIsc. source and comment typosGravatar luz.paz2018-03-11
* Fix misleading-indentation warnings.Gravatar Gael Guennebaud2016-12-01
* Renamed predux_half into predux_downto4Gravatar Benoit Steiner2016-10-06
* Merged latest updates from trunkGravatar Benoit Steiner2016-10-05
|\