diff options
author | 2016-05-17 09:13:27 -0700 | |
---|---|---|
committer | 2016-05-17 09:13:27 -0700 | |
commit | 8d06c02ffd9eb43194311d0e21b8618d3a8f4937 (patch) | |
tree | 3536e517927555d667504b70e5b6f087697aae86 /CTestCustom.cmake.in | |
parent | a80d875916de350c1849cd97d8b2515f620911d4 (diff) |
Allow vectorized padding on GPU. This helps speed things up a little.
Before:
BM_padding/10 5000000 460 217.03 MFlops/s
BM_padding/80 5000000 460 13899.40 MFlops/s
BM_padding/640 5000000 461 888421.17 MFlops/s
BM_padding/4K 5000000 460 54316322.55 MFlops/s
After:
BM_padding/10 5000000 454 220.20 MFlops/s
BM_padding/80 5000000 455 14039.86 MFlops/s
BM_padding/640 5000000 452 904968.83 MFlops/s
BM_padding/4K 5000000 411 60750049.21 MFlops/s
Diffstat (limited to 'CTestCustom.cmake.in')
0 files changed, 0 insertions, 0 deletions