aboutsummaryrefslogtreecommitdiffhomepage
path: root/tensorflow/stream_executor
Commit message (Collapse)AuthorAge
...
* | Automated rollback of commit 36a66347e8e344cddee4a8d9123ccbcae40011b1Gravatar A. Unique TensorFlower2018-07-18
| | | | | | | | PiperOrigin-RevId: 205164273
* | Merge pull request #20675 from ↵Gravatar TensorFlower Gardener2018-07-18
|\ \ | | | | | | | | | | | | | | | ROCmSoftwarePlatform:upstream-staging-stream-executor PiperOrigin-RevId: 205140328
* | | Support identity activation function in Cudnn implementation of fused conv2d ↵Gravatar A. Unique TensorFlower2018-07-17
| | | | | | | | | | | | | | | | | | bias activation. PiperOrigin-RevId: 205008958
* | | Error on some documented invalid Cudnn inputs. Cudnn should haveGravatar A. Unique TensorFlower2018-07-17
| | | | | | | | | | | | | | | | | | returned errors, but crashes instead. PiperOrigin-RevId: 205000883
* | | Print kOutputYXInput.Gravatar A. Unique TensorFlower2018-07-17
| | | | | | | | | | | | PiperOrigin-RevId: 204975773
* | | Merge pull request #20706 from ↵Gravatar TensorFlower Gardener2018-07-16
|\ \ \ | | | | | | | | | | | | | | | | | | | | ROCmSoftwarePlatform:upstream-staging-stream-executor-pooling-interface PiperOrigin-RevId: 204805678
* | | | Automated rollback of commit f8044c89287b1d90510ceace4b53ec94abaffa50Gravatar A. Unique TensorFlower2018-07-12
| | | | | | | | | | | | | | | | PiperOrigin-RevId: 204327453
* | | | Automated rollback of commit 0ea6847c892497afdd20c1150fee1e532612ca17Gravatar A. Unique TensorFlower2018-07-12
| | | | | | | | | | | | | | | | PiperOrigin-RevId: 204326206
| | * | [ROCm] Interface changes for StreamExecutor to support both CUDA and ROCmGravatar Wen-Heng (Jack) Chung2018-07-12
| |/ / |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 1) StreamInterface::CudaStreamMemberHack() Despite the fact that StreamExecutor and GPU common runtime are largely orthogonal, this particular routine in StreamExecutor is used in GPU common runtime and a couple of other operators. In this commit it's renamed as StreamInterface::GpuStreamMemberHack() and their call sites are also changed. 2) StreamExecutorInterface::CudaContextHack() This member is renamed to StramExecutorInterface::GpuContextHack(). Changes introduced in this commit includes: - some StreamExecutor interfaces and CUDA implementation - GPU common runtime related to interface changes in StreamExecutor - operators affected by interface changes in StreamExecutor
| | * More cleanupGravatar Loo Rong Jie2018-07-12
| |/ |/|
| * [ROCm] Interface changes for pooling APIs in StreamExecutorGravatar Wen-Heng (Jack) Chung2018-07-11
|/ | | | | | Due to the design of MIOpen, the DNN library on ROCm platform, an instance of ScratchAllocator has to be passed into pooling routines. This commit address such interface changes and the implementation in CUDA StreamExecutor.
* [SE] Re-enable acquiring real cpu frequencyGravatar A. Unique TensorFlower2018-07-10
| | | | PiperOrigin-RevId: 203959955
* [SE,XLA] Switch to using multiple streams in xla_device_contextGravatar A. Unique TensorFlower2018-07-09
| | | | | | | | | | | Instead of having one stream for compute, host-to-device and device-to-host transfers, switch to having separate streams, just like the GPU does. Add a se::Event field to XlaTensor to allow accurate inter-stream dependencies to be created. As part of this: - Fix TransferManager::TransferLiteralFrom/ToDevice to correctly make generated substreams wait on their master stream. - Fix Stream::BlockHostUntilDone() to not block on or return substreams. This behavior is completely broken and not only nondeterministically returns substreams to the pool but causes indefinite hangs with the HostStream. PiperOrigin-RevId: 203726543
* Allow is_initialized and initializer to be called on MirroredVariables and ↵Gravatar Anjali Sridhar2018-07-06
| | | | | | TowerLocalVariables. PiperOrigin-RevId: 203520287
* Merge commit for internal changesGravatar Yifei Feng2018-07-02
|\
| * Merge changes from github.Gravatar Yifei Feng2018-07-02
| | | | | | | | PiperOrigin-RevId: 203037623
| * Workaround the cudnn 7.1.4 correctness bug, where the workspace is required ↵Gravatar A. Unique TensorFlower2018-07-02
| | | | | | | | | | | | to be zeroed. PiperOrigin-RevId: 203001311
* | Merge pull request #20147 from yongtang/06202018-mpi_collectivesGravatar Qianli Scott Zhu2018-06-29
|\ \ | | | | | | Fix MPI build issue with bazel
* | | Automated g4 rollback of changelist 202347723Gravatar A. Unique TensorFlower2018-06-28
| | | | | | | | | | | | PiperOrigin-RevId: 202392792
* | | [SE] Re-enable acquiring real cpu frequencyGravatar A. Unique TensorFlower2018-06-28
| | | | | | | | | | | | PiperOrigin-RevId: 202347723
* | | [XLA] Support asynchronous execution through XLAGravatar A. Unique TensorFlower2018-06-28
| | | | | | | | | | | | PiperOrigin-RevId: 202292422
* | | Fix Windows GPU BuildGravatar A. Unique TensorFlower2018-06-28
| | | | | | | | | | | | PiperOrigin-RevId: 202260254
| * | Add stream_executor_headers_libGravatar Yong Tang2018-06-28
|/ / | | | | | | Signed-off-by: Yong Tang <yong.tang.github@outlook.com>
| * Automated g4 rollback of changelist 202292422Gravatar A. Unique TensorFlower2018-06-28
| | | | | | | | PiperOrigin-RevId: 202551122
| * Automated g4 rollback of changelist 202347723Gravatar A. Unique TensorFlower2018-06-27
| | | | | | | | PiperOrigin-RevId: 202392792
| * [SE] Re-enable acquiring real cpu frequencyGravatar A. Unique TensorFlower2018-06-27
| | | | | | | | PiperOrigin-RevId: 202347723
| * [XLA] Support asynchronous execution through XLAGravatar A. Unique TensorFlower2018-06-27
| | | | | | | | PiperOrigin-RevId: 202292422
| * Fix Windows GPU BuildGravatar A. Unique TensorFlower2018-06-27
|/ | | | PiperOrigin-RevId: 202260254
* Moving StatusOr from XLA to stream_executor.Gravatar Michael Case2018-06-26
| | | | PiperOrigin-RevId: 202179928
* Improve filter for cuBLAS bug.Gravatar A. Unique TensorFlower2018-06-19
| | | | PiperOrigin-RevId: 201239428
* Rollback of changelist 200200356. We might want to support GPUs on MacOS ↵Gravatar A. Unique TensorFlower2018-06-19
| | | | | | again in the future. Users are interested to make it work and we don't want to be in the way. PiperOrigin-RevId: 201214857
* Reflow comments; NFCGravatar Sanjoy Das2018-06-15
| | | | PiperOrigin-RevId: 200783258
* Add return statement to end of ToVlogString(dnn::DataType data_type)Gravatar James Keeling2018-06-13
| | | | | | | | Whilst the switch statement covers all possible enum values, the compiler still complains that it reaches the end of the function without returning a value. I add an "unknown" string, mirroring the one in the function just above. PiperOrigin-RevId: 200452885
* Fix a build failure when cuda version is less than 9000.Gravatar A. Unique TensorFlower2018-06-13
| | | | PiperOrigin-RevId: 200432478
* Detect configurations that would be hitting a bug in cuBLAS and report an error.Gravatar A. Unique TensorFlower2018-06-13
| | | | PiperOrigin-RevId: 200411493
* [XLA] "Global value" tuple destructuring available via SWIG APIs.Gravatar Chris Leary2018-06-12
| | | | PiperOrigin-RevId: 200333639
* Remove OS X code from CUDA stream executor because that platform is no ↵Gravatar A. Unique TensorFlower2018-06-12
| | | | | | longer supported. PiperOrigin-RevId: 200200356
* Unify cuDNN descriptor wrapper names.Gravatar A. Unique TensorFlower2018-06-12
| | | | | | No functional changes. PiperOrigin-RevId: 200199956
* Detect configurations that would be hitting bugs in cuDNN and report an error.Gravatar A. Unique TensorFlower2018-06-08
| | | | PiperOrigin-RevId: 199780350
* Do not enable tensor ops for cuDNN RNN unless explicitly specified.Gravatar A. Unique TensorFlower2018-06-05
| | | | PiperOrigin-RevId: 199321021
* Unify error handling in CudnnSupport.Gravatar A. Unique TensorFlower2018-06-01
| | | | PiperOrigin-RevId: 198836479
* Handle FilterLayout::kOutputYXInput in FilterDescriptor::ToShortString.Gravatar Bixia Zheng2018-05-31
| | | | | | This fixes an error when running resnet50_batch128_fp16 with --v=2. PiperOrigin-RevId: 198798196
* Fix GPU build on windowsGravatar Smit Hinsu2018-05-29
| | | | PiperOrigin-RevId: 198513480
* Merge changes from github.Gravatar Yifei Feng2018-05-24
| | | | | | | Revert #18413. Too many internal test failures due to the name scope change caused by this change. Revert #18192. Cannot use re2::StringPiece internally. Need alternative for set call. Will pull and clean this up in a separate change. PiperOrigin-RevId: 197991247
* Add convolution with NHWC layout to stream executor.Gravatar A. Unique TensorFlower2018-05-22
| | | | PiperOrigin-RevId: 197650067
* Introduce an option to allocate CUDA unified memoryGravatar Smit Hinsu2018-05-21
| | | | PiperOrigin-RevId: 197490523
* Rollforward of CL 197167501, without enabling CUDNN_FFT_TILING_FORWARD ↵Gravatar A. Unique TensorFlower2018-05-20
| | | | | | because that breaks XLA tests. PiperOrigin-RevId: 197328103
* Automated g4 rollback of changelist 197118212Gravatar A. Unique TensorFlower2018-05-18
| | | | PiperOrigin-RevId: 197167501
* Dropping support for CUDA < 8.Gravatar A. Unique TensorFlower2018-05-18
| | | | PiperOrigin-RevId: 197137612
* Dropping support for cuDNN < 6.Gravatar A. Unique TensorFlower2018-05-18
| | | | | | Enable CUDNN_FFT_TILING_FORWARD for cuDNN >= 7. PiperOrigin-RevId: 197118212