aboutsummaryrefslogtreecommitdiffhomepage
path: root/bench/tensors
Commit message (Collapse)AuthorAge
* Eigen moved the `scanLauncehr` function inside the internal namespace.Gravatar mehdi-goli2020-05-11
| | | | | | | This commit applies the following changes: - Moving the `scamLauncher` specialization inside internal namespace to fix compiler crash on TensorScan for SYCL backend. - Replacing `SYCL/sycl.hpp` to `CL/sycl.hpp` in order to follow SYCL 1.2.1 standard. - minor fixes: commenting out an unused variable to avoid compiler warnings.
* [SYCL] Rebasing the SYCL support branch on top of the Einge upstream master ↵Gravatar Mehdi Goli2019-11-28
| | | | | | | | | | | | | | | | | | | | | | branch. * Unifying all loadLocalTile from lhs and rhs to an extract_block function. * Adding get_tensor operation which was missing in TensorContractionMapper. * Adding the -D method missing from cmake for Disable_Skinny Contraction operation. * Wrapping all the indices in TensorScanSycl into Scan parameter struct. * Fixing typo in Device SYCL * Unifying load to private register for tall/skinny no shared * Unifying load to vector tile for tensor-vector/vector-tensor operation * Removing all the LHS/RHS class for extracting data from global * Removing Outputfunction from TensorContractionSkinnyNoshared. * Combining the local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining the no-local memory version of tall/skinny and normal tensor contraction into one kernel. * Combining General Tensor-Vector and VectorTensor contraction into one kernel. * Making double buffering optional for Tensor contraction when local memory is version is used. * Modifying benchmark to accept custom Reduction Sizes * Disabling AVX optimization for SYCL backend on the host to allow SSE optimization to the host * Adding Test for SYCL * Modifying SYCL CMake
* Adding synchronisation to convolution kernel for sycl backend.Gravatar Mehdi Goli2017-03-13
|
* Fixing typo in sycl Benchmark.Gravatar Mehdi Goli2017-03-08
|
* Adding sycl Benchmarks.Gravatar Mehdi Goli2017-03-08
|
* Fixed the sycl benchmarking codeGravatar Benoit Steiner2016-12-22
|
* Partial OpenCL support via SYCL compatible with ComputeCpp CE.Gravatar Luke Iwanski2016-09-19
|
* Updated the README file for the tensor benchmarksGravatar Benoit Steiner2016-05-25
|
* Improved the performance of tensor paddingGravatar Benoit Steiner2016-05-25
|
* Added benchmarks for contraction on CPU.Gravatar Benoit Steiner2016-05-13
|
* Added a benchmark to measure the performance of full reductions of 16 bit floatsGravatar Benoit Steiner2016-05-05
|
* Use index list for the striding benchmarksGravatar Benoit Steiner2016-04-21
|
* Enable the benchmarks for algebraic and transcendental fnctions on fp16.Gravatar Benoit Steiner2016-04-12
|
* Turned on the contraction benchmarks for fp16Gravatar Benoit Steiner2016-04-12
|
* Turn on the coeffWise benchmarks on fp16Gravatar Benoit Steiner2016-04-07
|
* Fixed the type casting benchmarks for fp16Gravatar Benoit Steiner2016-04-07
|
* Fixed the benchmarking of fp16 coefficient wise operationsGravatar Benoit Steiner2016-04-07
|
* Updated the benchmarking code to use Eigen::half instead of halfGravatar Benoit Steiner2016-03-24
|
* Made the tensor benchmarks compile on MacOSGravatar Benoit Steiner2016-03-23
|
* Added benchmarks for full reductionGravatar Benoit Steiner2016-02-29
|
* Improved the READMEGravatar Benoit Steiner2016-02-27
|
* Added benchmarks for type casting of float16Gravatar Benoit Steiner2016-02-26
|
* Added benchmarks for fp16Gravatar Benoit Steiner2016-02-26
|
* Extended the tensor benchmark suite to support types other than floatsGravatar Benoit Steiner2016-02-23
|
* Updated the tensor benchmarking code to work with compilers that don't ↵Gravatar Benoit Steiner2016-02-23
| | | | support cxx11.
* Added 2 benchmarks to the suite of tensor benchmarks running on GPUGravatar Benoit Steiner2016-01-30
|
* Fixed the tensor benchmarks on apple devicesGravatar Benoit Steiner2016-01-28
|
* Fixed clang related compilation errorGravatar Benoit Steiner2016-01-28
|
* Fixed a typoGravatar Benoit Steiner2016-01-28
|
* Made sure the number of floating point operations done by a benchmark is ↵Gravatar Benoit Steiner2016-01-28
| | | | computed using 64 bit integers to avoid overflows.
* Added a readme to explain how to compile the tensor benchmarks.Gravatar Benoit Steiner2016-01-28
|
* Updated the benchmarking code to print the number of flops processed instead ↵Gravatar Benoit Steiner2016-01-28
| | | | of the number of bytes.
* Added extra tensor benchmarksGravatar Benoit Steiner2016-01-28
|
* bugfixGravatar Yangqing Jia2016-01-28
|
* benchmark modifications to make it compilable in a standalone fashion.Gravatar Yangqing Jia2016-01-28
|
* Added a few benchmarks for the tensor codeGravatar Benoit Steiner2015-01-26