aboutsummaryrefslogtreecommitdiffhomepage
path: root/tensorflow/compiler/xla/service/gpu/gemm_thunk.cc
Commit message (Expand)AuthorAge
* [XLA] Switch to absl::StrFormat.Gravatar Justin Lebar2018-08-27
* [XLA] Use absl string types and functions instead of the TF versions.Gravatar Justin Lebar2018-08-23
* [XLA:GPU] cuBlas supports complex floats, use gemm instead of our O(n^3) impl...Gravatar Benjamin Kramer2018-08-03
* [XLA:GPU] Use strided batched gemm instead of building pointer tables.Gravatar Benjamin Kramer2018-08-03
* [XLA:GPU] Forward batched dot to cublas instead of expanding itGravatar Benjamin Kramer2018-08-03
* [XLA:GPU] Add additional VLOG'ing to gemm_thunk.Gravatar Justin Lebar2018-07-31
* Do profiling inside while thunks and conditionals.Gravatar Adrian Kuegel2018-07-02
* Reland improve fusion logic of (a dot b) * alphaGravatar A. Unique TensorFlower2018-05-15
* [XLA] s/tensorflow::Status/Status/.Gravatar Justin Lebar2018-05-11
* Remove uses of the kTransposeDot fusionGravatar Sanjoy Das2018-05-07
* [XLA] Convert XLA to use xla::se as a namespace alias for ::stream_executor.Gravatar Justin Lebar2018-04-17
* Introduce output fusion for (A dot B) * alphaGravatar A. Unique TensorFlower2018-03-13
* [XLA] FP16 Dot support for the CPU and GPU backends.Gravatar Bixia Zheng2018-02-28
* [XLA:GPU] Fix a problem in DoGemmAutotune.Gravatar Bixia Zheng2018-02-17
* [XLA] Add format field to layoutGravatar A. Unique TensorFlower2017-12-18
* Merge changes from github.Gravatar Jonathan Hseu2017-06-09
* [XLA] [StreamExecutor] Tune GEMMs when possible.Gravatar Justin Lebar2017-03-02
* [TF:XLA] Change buffer assignment to combine temp buffers into one allocation.Gravatar A. Unique TensorFlower2017-02-07
* Initial open-source release of XLA: Accelerated Linear Algebra.Gravatar Peter Hawkins2017-01-09