diff options
author | Benjamin Kramer <kramerb@google.com> | 2018-10-09 13:32:24 -0700 |
---|---|---|
committer | TensorFlower Gardener <gardener@tensorflow.org> | 2018-10-09 13:40:43 -0700 |
commit | 5d9a7fdf4f02c2db487a03e7ad2d520f8847c4e3 (patch) | |
tree | a77d90f9328b7e0e859a15ab3b5d765774954b5a /CODEOWNERS | |
parent | 9989788be25c846d087ac70b76cf78759a209a3e (diff) |
[XLA:GPU] Add an implementation of scatter for GPU
This simple has a kernel that runs on every element of the updates tensor,
figure out the right indices to perform the update, and applies it with an
atomic operation.
Currently we emit a CAS for plain (i.e. non-add) updates, which is inefficient.
Also TuplePointsToAnalysis doesn't know that it should alias the operand and
output buffers of a scatter, which would avoid a copy.
PiperOrigin-RevId: 216412467
Diffstat (limited to 'CODEOWNERS')
0 files changed, 0 insertions, 0 deletions