aboutsummaryrefslogtreecommitdiffhomepage
path: root/bench
Commit message (Collapse)AuthorAge
* Fix #1911: add benchmark for move semantics with fixed-size matrixGravatar Sebastien Boisvert2020-06-11
| | | | | | | | | | | $ clang++ -O3 bench/bench_move_semantics.cpp -I. -std=c++11 \ -o bench_move_semantics $ ./bench_move_semantics float copy semantics: 1755.97 ms float move semantics: 55.063 ms double copy semantics: 2457.65 ms double move semantics: 55.034 ms
* Update run instructions for benchCholeskyGravatar n0mend2020-06-01
|
* Add KLU support to spbenchsolverGravatar Mark Eberlein2020-05-11
|
* Eigen moved the `scanLauncehr` function inside the internal namespace.Gravatar mehdi-goli2020-05-11
| | | | | | | This commit applies the following changes: - Moving the `scamLauncher` specialization inside internal namespace to fix compiler crash on TensorScan for SYCL backend. - Replacing `SYCL/sycl.hpp` to `CL/sycl.hpp` in order to follow SYCL 1.2.1 standard. - minor fixes: commenting out an unused variable to avoid compiler warnings.
* Fix perf monitoring merge functionGravatar Clément Grégoire2020-04-28
|
* Make file formatting comply with POSIX and Unix standardsGravatar Aaron Franke2020-03-23
| | | | UTF-8, LF, no BOM, and newlines at the end of files
* update hg to git hashesGravatar Gael Guennebaud2019-12-05
|
* Replace calls to "hg" by calls to "git"Gravatar Gael Guennebaud2019-12-04
|
* [SYCL] Rebasing the SYCL support branch on top of the Einge upstream master ↵Gravatar Mehdi Goli2019-11-28
| | | | | | | | | | | | | | | | | | | | | | branch. * Unifying all loadLocalTile from lhs and rhs to an extract_block function. * Adding get_tensor operation which was missing in TensorContractionMapper. * Adding the -D method missing from cmake for Disable_Skinny Contraction operation. * Wrapping all the indices in TensorScanSycl into Scan parameter struct. * Fixing typo in Device SYCL * Unifying load to private register for tall/skinny no shared * Unifying load to vector tile for tensor-vector/vector-tensor operation * Removing all the LHS/RHS class for extracting data from global * Removing Outputfunction from TensorContractionSkinnyNoshared. * Combining the local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining the no-local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining General Tensor-Vector and VectorTensor contraction into one kernel. * Making double buffering optional for Tensor contraction when local memory is version is used. * Modifying benchmark to accept custom Reduction Sizes * Disabling AVX optimization for SYCL backend on the host to allow SSE optimization to the host * Adding Test for SYCL * Modifying SYCL CMake
* STYLE: Convert CMake-language commands to lower caseGravatar Hans Johnson2019-10-31
| | | | | Ancient CMake versions required upper-case commands. Later command names became case-insensitive. Now the preferred style is lower-case.
* STYLE: Remove CMake-language block-end command argumentsGravatar Hans Johnson2019-10-31
| | | | | | Ancient versions of CMake required else(), endif(), and similar block termination commands to have arguments matching the command starting the block. This is no longer the preferred style.
* update wrt recent changesGravatar Gael Guennebaud2019-02-21
|
* Add possibility to bench row-major lhs and rhsGravatar Gael Guennebaud2019-02-15
|
* bug #1676: workaround GCC's bug in c++17 mode.Gravatar Gael Guennebaud2019-02-07
|
* Add recent gemm related changesets and various cleanups in perf-monitoringGravatar Gael Guennebaud2019-01-29
|
* Bypass inline asm for non compatible compilers.Gravatar Gael Guennebaud2019-01-23
|
* add changesets related to matrix product perf.Gravatar Gael Guennebaud2018-12-13
|
* [PATCH 1/2] Misc. typosGravatar luz.paz"2018-09-18
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | From 68d431b4c14ad60a778ee93c1f59ecc4b931950e Mon Sep 17 00:00:00 2001 Found via `codespell -q 3 -I ../eigen-word-whitelist.txt` where the whitelists consists of: ``` als ans cas dum lastr lowd nd overfl pres preverse substraction te uint whch ``` --- CMakeLists.txt | 26 +++++++++---------- Eigen/src/Core/GenericPacketMath.h | 2 +- Eigen/src/SparseLU/SparseLU.h | 2 +- bench/bench_norm.cpp | 2 +- doc/HiPerformance.dox | 2 +- doc/QuickStartGuide.dox | 2 +- .../Eigen/CXX11/src/Tensor/TensorChipping.h | 6 ++--- .../Eigen/CXX11/src/Tensor/TensorDeviceGpu.h | 2 +- .../src/Tensor/TensorForwardDeclarations.h | 4 +-- .../src/Tensor/TensorGpuHipCudaDefines.h | 2 +- .../Eigen/CXX11/src/Tensor/TensorReduction.h | 2 +- .../CXX11/src/Tensor/TensorReductionGpu.h | 2 +- .../test/cxx11_tensor_concatenation.cpp | 2 +- unsupported/test/cxx11_tensor_executor.cpp | 2 +- 14 files changed, 29 insertions(+), 29 deletions(-)
* Add option to disable plot generationGravatar Gael Guennebaud2018-11-07
|
* Optimize the product of a householder-sequence with the identity, and ↵Gravatar Gael Guennebaud2018-07-11
| | | | optimize the evaluation of a HouseholderSequence to a dense matrix using faster blocked product.
* MIsc. source and comment typosGravatar luz.paz2018-03-11
| | | | Found using `codespell` and `grep` from downstream FreeCAD
* Fix cmake warningGravatar Gael Guennebaud2017-12-14
|
* Improve cmake scripts for Pastix and BLAS detection.Gravatar Gael Guennebaud2017-04-14
|
* Adding synchronisation to convolution kernel for sycl backend.Gravatar Mehdi Goli2017-03-13
|
* Fixing typo in sycl Benchmark.Gravatar Mehdi Goli2017-03-08
|
* Adding sycl Benchmarks.Gravatar Mehdi Goli2017-03-08
|
* Merge latest changes from upstreamGravatar Benoit Steiner2017-01-30
|\
| * Add debug outputGravatar Gael Guennebaud2017-01-03
| |
| * Fix inclusionGravatar Gael Guennebaud2017-01-03
| |
* | Fixed the sycl benchmarking codeGravatar Benoit Steiner2016-12-22
|/
* typoGravatar Gael Guennebaud2016-12-08
|
* make sure we use the right eigen versionGravatar Gael Guennebaud2016-12-08
|
* fix usage of custom compilerGravatar Gael Guennebaud2016-12-08
|
* Add missing include and use -O3Gravatar Gael Guennebaud2016-12-07
|
* Fix call to convert on linuxGravatar Gael Guennebaud2016-12-07
|
* Cleanup file structureGravatar Gael Guennebaud2016-12-07
|
* Update perf monitoring scripts to generate html/svg outputsGravatar Gael Guennebaud2016-12-07
|
* generate png file for web uploadGravatar Gael Guennebaud2016-12-06
|
* Update monitored changeset listsGravatar Gael Guennebaud2016-12-06
|
* fix compilation of BTL's blaze interfaceGravatar Gael Guennebaud2016-12-05
|
* Add performance monitoring for LLTGravatar Gael Guennebaud2016-12-05
|
* extend monitoring benchmarks with transpose matrix-vector and triangular ↵Gravatar Gael Guennebaud2016-12-05
| | | | matrix-vectors.
* Add perf monitoring for gemvGravatar Gael Guennebaud2016-12-02
|
* Re-enable A^T*A action in BTLGravatar Gael Guennebaud2016-12-02
|
* Pull the latest updates from trunkGravatar Benoit Steiner2016-10-05
|\
| * Do not manually add absolute path to boost-library.Gravatar Christoph Hertzberg2016-09-22
| | | | | | | | Also set C++ standard for blaze to C++14
* | Partial OpenCL support via SYCL compatible with ComputeCpp CE.Gravatar Luke Iwanski2016-09-19
|/
* Update and fix Cholesky mini benchmarkGravatar Gael Guennebaud2016-07-28
|
* Update benchmark for dense solver to stress least-squares pb, and to output ↵Gravatar Gael Guennebaud2016-07-21
| | | | a HTML table
* Update relevent list of changesets.Gravatar Gael Guennebaud2016-07-04
|