index
:
tensorflow
master
machine learning framework
about
summary
refs
log
tree
commit
diff
homepage
log msg
author
committer
range
path:
root
/
tensorflow
/
stream_executor
/
cuda
/
cuda_blas.cc
Commit message (
Expand
)
Author
Age
*
[XLA:GPU] Add a fast version of gemmStridedBatched for cuda 9.1
Benjamin Kramer
2018-08-03
*
[XLA:GPU] Use strided batched gemm instead of building pointer tables.
Benjamin Kramer
2018-08-03
*
[SE] Add additional log statements to DoBlasGemmWithAlgorithmImpl.
Justin Lebar
2018-07-31
*
[SE] Add new cublas algorithms from CUDA 9.2.
Justin Lebar
2018-07-31
*
[SE] Add missing cublas algorithms for cuda 9.0, CUBLAS_GEMM_ALGO{3,4}_TENSOR...
Justin Lebar
2018-07-31
*
Improve filter for cuBLAS bug.
A. Unique TensorFlower
2018-06-19
*
Fix a build failure when cuda version is less than 9000.
A. Unique TensorFlower
2018-06-13
*
Detect configurations that would be hitting a bug in cuBLAS and report an error.
A. Unique TensorFlower
2018-06-13
*
Merge changes from github.
Yifei Feng
2018-05-24
*
Dropping support for CUDA < 8.
A. Unique TensorFlower
2018-05-18
*
Use parenthesis based construction instead of brace initialization
Smit Hinsu
2018-05-09
*
Add variants of DoBlasGemmWithAlgorithm with alpha being on device.
A. Unique TensorFlower
2018-04-24
*
[StreamExecutor] Rename ::perftools::gputools -> ::stream_executor, part 1.
Justin Lebar
2018-04-17
*
Support RNN profiling in StreamExecutor for CUDA GPUs.
James Qin
2018-04-06
*
[XLA] FP16 Dot support for the CPU and GPU backends.
Bixia Zheng
2018-02-28
*
Merge changes from github.
Patrick Nguyen
2017-12-28
*
Let GetBlasGemmAlgorithms() always return true.
Yangzihao Wang
2017-07-21
*
Automated g4 rollback of changelist 162423171
A. Unique TensorFlower
2017-07-18
*
Add autotuning code for matmul operator.
Yangzihao Wang
2017-07-18
*
Add support for int8 x int8 -> int32 matrix multiplication via cublasGemmEx t...
A. Unique TensorFlower
2017-07-06
*
[XLA] [StreamExecutor] Tune GEMMs when possible.
Justin Lebar
2017-03-02
*
Remove problematic SE_RETURN_STATUS_AS_BOOL macro
Peter Hawkins
2017-02-10
*
Stop using DSO loader for CUDA SDK libraries
A. Unique TensorFlower
2017-01-25
*
Merge changes from github.
Patrick Nguyen
2016-10-20
*
Merge changes from github.
A. Unique TensorFlower
2016-08-25
*
Merge changes from github.
Vijay Vasudevan
2016-06-11
*
Update copyright for 3p/tf.
A. Unique TensorFlower
2016-06-02
*
In the StreamExecutor, make lack of CUDA 7.5 a non-fatal error for SGEMM
A. Unique TensorFlower
2016-05-13
*
Add fp16 cuDNN convolution support to StreamExecutor. (TensorFlow ops will
A. Unique TensorFlower
2016-05-11
*
Add fp16 matrix multiplication (GEMM) support to StreamExecutor, gated on
A. Unique TensorFlower
2016-05-11
*
Support ScratchAllocator in BLAS Batched GEMM
A. Unique TensorFlower
2016-03-18
*
TensorFlow: upstream changes to git.
Vijay Vasudevan
2015-12-08
*
TensorFlow: Improve performance of Alexnet
Manjunath Kudlur
2015-11-20
*
TensorFlow: Initial commit of TensorFlow library.
Manjunath Kudlur
2015-11-06