aboutsummaryrefslogtreecommitdiffhomepage
path: root/tensorflow/stream_executor/cuda/cuda_blas.cc
Commit message (Expand)AuthorAge
* [XLA:GPU] Add a fast version of gemmStridedBatched for cuda 9.1Gravatar Benjamin Kramer2018-08-03
* [XLA:GPU] Use strided batched gemm instead of building pointer tables.Gravatar Benjamin Kramer2018-08-03
* [SE] Add additional log statements to DoBlasGemmWithAlgorithmImpl.Gravatar Justin Lebar2018-07-31
* [SE] Add new cublas algorithms from CUDA 9.2.Gravatar Justin Lebar2018-07-31
* [SE] Add missing cublas algorithms for cuda 9.0, CUBLAS_GEMM_ALGO{3,4}_TENSOR...Gravatar Justin Lebar2018-07-31
* Improve filter for cuBLAS bug.Gravatar A. Unique TensorFlower2018-06-19
* Fix a build failure when cuda version is less than 9000.Gravatar A. Unique TensorFlower2018-06-13
* Detect configurations that would be hitting a bug in cuBLAS and report an error.Gravatar A. Unique TensorFlower2018-06-13
* Merge changes from github.Gravatar Yifei Feng2018-05-24
* Dropping support for CUDA < 8.Gravatar A. Unique TensorFlower2018-05-18
* Use parenthesis based construction instead of brace initializationGravatar Smit Hinsu2018-05-09
* Add variants of DoBlasGemmWithAlgorithm with alpha being on device.Gravatar A. Unique TensorFlower2018-04-24
* [StreamExecutor] Rename ::perftools::gputools -> ::stream_executor, part 1.Gravatar Justin Lebar2018-04-17
* Support RNN profiling in StreamExecutor for CUDA GPUs.Gravatar James Qin2018-04-06
* [XLA] FP16 Dot support for the CPU and GPU backends.Gravatar Bixia Zheng2018-02-28
* Merge changes from github.Gravatar Patrick Nguyen2017-12-28
* Let GetBlasGemmAlgorithms() always return true.Gravatar Yangzihao Wang2017-07-21
* Automated g4 rollback of changelist 162423171Gravatar A. Unique TensorFlower2017-07-18
* Add autotuning code for matmul operator.Gravatar Yangzihao Wang2017-07-18
* Add support for int8 x int8 -> int32 matrix multiplication via cublasGemmEx t...Gravatar A. Unique TensorFlower2017-07-06
* [XLA] [StreamExecutor] Tune GEMMs when possible.Gravatar Justin Lebar2017-03-02
* Remove problematic SE_RETURN_STATUS_AS_BOOL macroGravatar Peter Hawkins2017-02-10
* Stop using DSO loader for CUDA SDK librariesGravatar A. Unique TensorFlower2017-01-25
* Merge changes from github.Gravatar Patrick Nguyen2016-10-20
* Merge changes from github.Gravatar A. Unique TensorFlower2016-08-25
* Merge changes from github.Gravatar Vijay Vasudevan2016-06-11
* Update copyright for 3p/tf.Gravatar A. Unique TensorFlower2016-06-02
* In the StreamExecutor, make lack of CUDA 7.5 a non-fatal error for SGEMMGravatar A. Unique TensorFlower2016-05-13
* Add fp16 cuDNN convolution support to StreamExecutor. (TensorFlow ops willGravatar A. Unique TensorFlower2016-05-11
* Add fp16 matrix multiplication (GEMM) support to StreamExecutor, gated onGravatar A. Unique TensorFlower2016-05-11
* Support ScratchAllocator in BLAS Batched GEMMGravatar A. Unique TensorFlower2016-03-18
* TensorFlow: upstream changes to git.Gravatar Vijay Vasudevan2015-12-08
* TensorFlow: Improve performance of AlexnetGravatar Manjunath Kudlur2015-11-20
* TensorFlow: Initial commit of TensorFlow library.Gravatar Manjunath Kudlur2015-11-06