aboutsummaryrefslogtreecommitdiffhomepage
path: root/CODEOWNERS
diff options
context:
space:
mode:
authorGravatar Benjamin Kramer <kramerb@google.com>2018-10-09 13:32:24 -0700
committerGravatar TensorFlower Gardener <gardener@tensorflow.org>2018-10-09 13:40:43 -0700
commit5d9a7fdf4f02c2db487a03e7ad2d520f8847c4e3 (patch)
treea77d90f9328b7e0e859a15ab3b5d765774954b5a /CODEOWNERS
parent9989788be25c846d087ac70b76cf78759a209a3e (diff)
[XLA:GPU] Add an implementation of scatter for GPU
This simple has a kernel that runs on every element of the updates tensor, figure out the right indices to perform the update, and applies it with an atomic operation. Currently we emit a CAS for plain (i.e. non-add) updates, which is inefficient. Also TuplePointsToAnalysis doesn't know that it should alias the operand and output buffers of a scatter, which would avoid a copy. PiperOrigin-RevId: 216412467
Diffstat (limited to 'CODEOWNERS')
0 files changed, 0 insertions, 0 deletions