tensorflow - machine learning framework

	Commit message (Collapse)	Author	Age
*	[ROCm] Interface changes for StreamExecutor to support both CUDA and ROCm	Wen-Heng (Jack) Chung	2018-07-12
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	1) StreamInterface::CudaStreamMemberHack() Despite the fact that StreamExecutor and GPU common runtime are largely orthogonal, this particular routine in StreamExecutor is used in GPU common runtime and a couple of other operators. In this commit it's renamed as StreamInterface::GpuStreamMemberHack() and their call sites are also changed. 2) StreamExecutorInterface::CudaContextHack() This member is renamed to StramExecutorInterface::GpuContextHack(). Changes introduced in this commit includes: - some StreamExecutor interfaces and CUDA implementation - GPU common runtime related to interface changes in StreamExecutor - operators affected by interface changes in StreamExecutor
*	Fix pontential issue with number of blocks launched for depthwise kernels: ↵	A. Unique TensorFlower	2018-02-16
\| \| \| \| \| \| \| \| \| \|	the number of work_elements was too small, which could return a block_count that is too small to cover all elements. We also have been ignoring the suggested thread_per_block, so were potentially launching more blocks than necessary to fill the GPU (which is inefficient, but functionally correct). Changing 'assert(false && ...' to LOG(FATAL) because it shouldn't be debug only. PiperOrigin-RevId: 186037306
*	Roll CL 179861781 forward with fix: Wrappers for CUDA 9 warp-synchronous ↵	A. Unique TensorFlower	2018-01-26
\| \| \| \| \| \|	intrinsics. PiperOrigin-RevId: 183374082
*	Automated g4 rollback of changelist 179782067	A. Unique TensorFlower	2017-12-21
\| \| \| \|	PiperOrigin-RevId: 179861781
*	Roll CL 177989542 forward with fix: Wrappers for CUDA 9 warp-synchronous ↵	A. Unique TensorFlower	2017-12-21
\| \| \| \| \| \|	intrinsics. PiperOrigin-RevId: 179782067
*	Automated g4 rollback of changelist 177799252	A. Unique TensorFlower	2017-12-05
\| \| \| \|	PiperOrigin-RevId: 177989542
*	Wrappers for CUDA 9 warp-synchronous intrinsics.	A. Unique TensorFlower	2017-12-04
	PiperOrigin-RevId: 177799252