aboutsummaryrefslogtreecommitdiffhomepage
path: root/tensorflow/compiler/xla/service/gpu/gpu_executable.cc
Commit message (Expand)AuthorAge
* [XLA] Migrate from gtl::FlatMap to absl::flat_hash_mapGravatar Benjamin Kramer2018-10-01
* [XLA] Rename all (Mutable)ArraySlice to absl::Span.Gravatar Tim Shen2018-08-30
* [XLA] Switch to absl::StrFormat.Gravatar Justin Lebar2018-08-27
* [XLA] gtl::optional->absl::optionalGravatar Yunxing Dai2018-08-21
* [XLA] Use absl::make_unique instead of xla::MakeUnique.Gravatar Justin Lebar2018-08-20
* Make GemmThunkShouldHaltAllActivityBeforeRunning easier to understand; NFCGravatar Sanjoy Das2018-08-09
* [XLA] Clean up clang tidy readability warnings in compiler/xlaGravatar Benjamin Kramer2018-08-06
* Use constant buffer allocations for XLA:CPUGravatar Sanjoy Das2018-07-27
* [XLA:GPU] Only add the cubin if it is availableGravatar Benjamin Kramer2018-07-27
* Implement constant buffer allocation for XLA:GPUGravatar Sanjoy Das2018-07-26
* Replace generic Pool with StreamPool, and discard failed streams.Gravatar Todd Wang2018-07-25
* Do profiling inside while thunks and conditionals.Gravatar Adrian Kuegel2018-07-02
* Extract HloExecutionProfiler into its own file.Gravatar Adrian Kuegel2018-06-14
* [XLA:GPU] Add op-tracing to XLA:GPU.Gravatar Justin Lebar2018-05-16
* [XLA:GPU] Load kernel thunks' kernels before running them.Gravatar Justin Lebar2018-05-11
* [XLA:GPU] Remove unused Thunk::ShouldBlockFutureThunks function.Gravatar Justin Lebar2018-05-10
* [XLA] Make XLA's memory allocator return an owning smart pointer.Gravatar Justin Lebar2018-05-09
* [XLA] Make Executable return a ScopedShapedBuffer.Gravatar Justin Lebar2018-04-22
* [XLA] De-unique_ptr-ify ShapedBuffer and ScopedShapedBuffer.Gravatar Justin Lebar2018-04-19
* [XLA] Convert XLA to use xla::se as a namespace alias for ::stream_executor.Gravatar Justin Lebar2018-04-17
* [XLA] Assert that all buffers and sub-buffers passed to XLA have an explicit ...Gravatar Justin Lebar2018-03-27
* [XLA:GPU] Fix HLO profiling when multiple streams are involved.Gravatar Justin Lebar2018-02-26
* [XLA:GPU] Don't crash when the root instruction of a computation is a multi-o...Gravatar Justin Lebar2018-02-13
* [TF:XLA] Replace most of HloProfilePrinter by a protocol bufferGravatar Sanjoy Das2018-01-24
* [XLA] Clean up our handling of ExecutionProfile and add a test caseGravatar Sanjoy Das2018-01-10
* [XLA:GPU] Fix implementation of Thunk::ShouldBlockFutureThunks.Gravatar Justin Lebar2017-12-22
* [XLA:GPU] Make the use of scratch allocator in convolution_thunk safe.Gravatar A. Unique TensorFlower2017-12-20
* Merged commit includes the following changes:Gravatar A. Unique TensorFlower2017-12-18
* Rename Stream::BlockHostUntilDoneWithStatus to BlockHostUntilDone.Gravatar A. Unique TensorFlower2017-12-13
* Use BlockHostUntilDoneWithStatus in various places.Gravatar A. Unique TensorFlower2017-12-11
* [XLA:GPU] Don't autotune while other kernels are running.Gravatar Justin Lebar2017-12-05
* Place HloProfilePrinter and HloProfileIndexMap in ExecutableGravatar Sanjoy Das2017-11-22
* GPU JIT improvements.Gravatar Artem Belevich2017-11-07
* Rename (Add|Get)ProfileResult to something more specific; NFCGravatar Sanjoy Das2017-10-31
* Remove "hybrid" HloModuleConfig option. The option was used to generate execu...Gravatar Mark Heffernan2017-10-04
* Add more `const`s to xla::Executable. No functional change.Gravatar A. Unique TensorFlower2017-09-27
* For tuple-shaped data, change ShapedBuffer (an abstraction holding on-device ...Gravatar Mark Heffernan2017-09-26
* Reduce XLA compile time by ~7% for a convolutional image model:Gravatar A. Unique TensorFlower2017-08-18
* Allow cost estimates to differ per backend and include the estimates into the...Gravatar Bjarke Hammersholt Roune2017-07-31
* [XLA] Simplify Shape traversal visitors.Gravatar Mark Heffernan2017-06-06
* [XLA] Teach Executable to do its own profiling (patch 1/4).Gravatar A. Unique TensorFlower2017-05-22
* [XLA] Attach an HloModuleConfig to HloModule, obviating the need to pass them...Gravatar Eli Bendersky2017-05-18
* Automated g4 rollback of changelist 155305963Gravatar Peter Hawkins2017-05-14
* [TF:XLA] Instead of dumping ptxas output, write to a temp file & and use to b...Gravatar A. Unique TensorFlower2017-05-06
* [TF:XLA:GPU] Avoid a CPU-GPU synchronization when running XLA GPU computation...Gravatar Peter Hawkins2017-03-08
* [TF:XLA] Remove support for client-allocated result buffers.Gravatar Peter Hawkins2017-03-07
* [XLA:GPU] Cache GPU substreams across executionsGravatar A. Unique TensorFlower2017-03-02
* [XLA] Add support for profiling multiple computationsGravatar David Majnemer2017-03-01
* [TF:XLA] Change buffer assignment to combine temp buffers into one allocation.Gravatar A. Unique TensorFlower2017-02-07
* Initial open-source release of XLA: Accelerated Linear Algebra.Gravatar Peter Hawkins2017-01-09