index
:
tensorflow
master
machine learning framework
about
summary
refs
log
tree
commit
diff
homepage
log msg
author
committer
range
path:
root
/
tensorflow
/
compiler
/
xla
/
service
/
gpu
/
ir_emitter_unnested.cc
Commit message (
Expand
)
Author
Age
*
[XLA:GPU] Elide the SequentialThunk when emitting scatter with no copy
Benjamin Kramer
2018-10-09
*
[XLA:GPU] Add an implementation of scatter for GPU
Benjamin Kramer
2018-10-09
*
Simplify ir_emitter_unnested so that it doesn't take a look at conv
Tim Shen
2018-09-19
*
Simplify convolution_thunk's interface.
Tim Shen
2018-09-10
*
[XLA:GPU] Clean up init thunk handling to handle arbitrary fused init values
Benjamin Kramer
2018-09-07
*
[XLA:GPU] Refactor some code for fusion output handling.
Bixia Zheng
2018-09-06
*
Call Cudnn also for grouped convolutions.
Adrian Kuegel
2018-09-03
*
Change headers to directly include absl::Span, and clean up the build
Tim Shen
2018-08-30
*
[XLA] Rename all (Mutable)ArraySlice to absl::Span.
Tim Shen
2018-08-30
*
[XLA] xla::ContainersEqual -> absl::c_equal
Benjamin Kramer
2018-08-30
*
Use a mixin to reduce llvm::IRBuilder<> related boilerplate.
Sanjoy Das
2018-08-27
*
[XLA] Switch to absl::StrFormat.
Justin Lebar
2018-08-27
*
[XLA] Unify spelling of 'fusible'
Benjamin Kramer
2018-08-27
*
[XLA] Use absl string types and functions instead of the TF versions.
Justin Lebar
2018-08-23
*
[XLA] gtl::optional->absl::optional
Yunxing Dai
2018-08-21
*
Merged commit includes the following changes:
Yifei Feng
2018-08-21
*
[XLA] Use absl::make_unique instead of xla::MakeUnique.
Justin Lebar
2018-08-20
*
[XLA] Switch to absl versions of the c_foo functions.
Justin Lebar
2018-08-20
*
Automated rollback of commit 4a41f50648929197954d892559587cb76458d306
A. Unique TensorFlower
2018-08-17
*
[XLA] Switch to absl versions of the c_foo functions.
Justin Lebar
2018-08-17
*
[XLA] Make sure backends that don't support variadic reduce reject it.
Michael Kuperstein
2018-08-09
*
[XLA:GPU] Add a generic trip count analysis based on HloEvaluator
Benjamin Kramer
2018-08-08
*
[XLA:GPU] Forward batched dot to cublas instead of expanding it
Benjamin Kramer
2018-08-03
*
[XLA:GPU] Don't emit HostToDevice copies
Sanjoy Das
2018-08-02
*
Allow Sort to share the buffer with the operand if it is the only user.
Adrian Kuegel
2018-07-31
*
Use constant buffer allocations for XLA:CPU
Sanjoy Das
2018-07-27
*
Implement constant buffer allocation for XLA:GPU
Sanjoy Das
2018-07-26
*
Support sorting of key/value pairs on GPU.
Adrian Kuegel
2018-07-26
*
[XLA:GPU] Remember to execute non-root outfeed instructions in nested computa...
Sanjoy Das
2018-07-25
*
[XLA:CPU/GPU] Implement the parallel Philox random number generation algorithm.
Bixia Zheng
2018-07-25
*
[XLA:GPU] Don't lie about buffer alignment to LLVM
Sanjoy Das
2018-07-24
*
Parallelize BitonicSort on GPU.
Adrian Kuegel
2018-07-24
*
[XLA:GPU] Add an operator<< to Thunk::Kind.
Bixia Zheng
2018-07-23
*
[XLA:GPU] Make sure that buffers for tuple() have a unique top-level allocation
Benjamin Kramer
2018-07-23
*
[XLA:GPU] Limit the number of shmem tiles XLA:GPU will use for 021 transposes.
Justin Lebar
2018-07-20
*
[XLA] s/ir_builder/b/
Justin Lebar
2018-07-20
*
Support unsigned indices for in-place DynamicUpdateSlice.
Adrian Kuegel
2018-07-18
*
Implement BitonicSort for GPU.
Adrian Kuegel
2018-07-18
*
[XLA:GPU] Generalize the column reduction algorithm to handle tile widths gre...
Thomas Joerg
2018-07-17
*
[XLA] Use shfl.sync.down instead of shfl.sync.
Justin Lebar
2018-07-13
*
[XLA:GPU] s/llvm_ir::IrArray/IrArray/ in ir_emitter_unnested.
Justin Lebar
2018-07-11
*
[XLA:GPU] Cleanups to fused 021 transpose implementation.
Justin Lebar
2018-07-11
*
[XLA:GPU] Implement outfeed
Benjamin Kramer
2018-07-10
*
[XLA:GPU] Delete AnnotateBufferLoadStoreInstructionWithMetadata.
Justin Lebar
2018-07-10
*
[XLA:GPU] Enhance the tiled 0-2-1 transpose algorithm to handle fusion.
Bixia Zheng
2018-07-04
*
Profile SequentialThunks if they represent one HloInstruction.
Adrian Kuegel
2018-07-04
*
[TF:XLA] Split literal_util into {literal, literal_util}.
Kay Zhu
2018-07-03
*
[TF:XLA] Split select HLO into array- and tuple-select.
A. Unique TensorFlower
2018-07-03
*
Fix check whether there is more than one tile.
Adrian Kuegel
2018-06-27
*
[XLA:GPU] Make the input-fused reduce emitter work on 16-bit types
Benjamin Kramer
2018-06-26
[next]