aboutsummaryrefslogtreecommitdiffhomepage
path: root/tensorflow/core/util/cuda_launch_config.h
Commit message (Collapse)AuthorAge
* [ROCm] Interface changes for StreamExecutor to support both CUDA and ROCmGravatar Wen-Heng (Jack) Chung2018-07-12
| | | | | | | | | | | | | | | | | | | 1) StreamInterface::CudaStreamMemberHack() Despite the fact that StreamExecutor and GPU common runtime are largely orthogonal, this particular routine in StreamExecutor is used in GPU common runtime and a couple of other operators. In this commit it's renamed as StreamInterface::GpuStreamMemberHack() and their call sites are also changed. 2) StreamExecutorInterface::CudaContextHack() This member is renamed to StramExecutorInterface::GpuContextHack(). Changes introduced in this commit includes: - some StreamExecutor interfaces and CUDA implementation - GPU common runtime related to interface changes in StreamExecutor - operators affected by interface changes in StreamExecutor
* Fix pontential issue with number of blocks launched for depthwise kernels: ↵Gravatar A. Unique TensorFlower2018-02-16
| | | | | | | | | | the number of work_elements was too small, which could return a block_count that is too small to cover all elements. We also have been ignoring the suggested thread_per_block, so were potentially launching more blocks than necessary to fill the GPU (which is inefficient, but functionally correct). Changing 'assert(false && ...' to LOG(FATAL) because it shouldn't be debug only. PiperOrigin-RevId: 186037306
* Roll CL 179861781 forward with fix: Wrappers for CUDA 9 warp-synchronous ↵Gravatar A. Unique TensorFlower2018-01-26
| | | | | | intrinsics. PiperOrigin-RevId: 183374082
* Automated g4 rollback of changelist 179782067Gravatar A. Unique TensorFlower2017-12-21
| | | | PiperOrigin-RevId: 179861781
* Roll CL 177989542 forward with fix: Wrappers for CUDA 9 warp-synchronous ↵Gravatar A. Unique TensorFlower2017-12-21
| | | | | | intrinsics. PiperOrigin-RevId: 179782067
* Automated g4 rollback of changelist 177799252Gravatar A. Unique TensorFlower2017-12-05
| | | | PiperOrigin-RevId: 177989542
* Wrappers for CUDA 9 warp-synchronous intrinsics.Gravatar A. Unique TensorFlower2017-12-04
PiperOrigin-RevId: 177799252