[XLA:GPU] Add an implementation of scatter for GPU - tensorflow

diff options

author	Benjamin Kramer <kramerb@google.com>	2018-10-09 13:32:24 -0700
committer	TensorFlower Gardener <gardener@tensorflow.org>	2018-10-09 13:40:43 -0700
commit	5d9a7fdf4f02c2db487a03e7ad2d520f8847c4e3 (patch)
tree	a77d90f9328b7e0e859a15ab3b5d765774954b5a /tensorflow/go
parent	9989788be25c846d087ac70b76cf78759a209a3e (diff)

[XLA:GPU] Add an implementation of scatter for GPU

This simple has a kernel that runs on every element of the updates tensor, figure out the right indices to perform the update, and applies it with an atomic operation. Currently we emit a CAS for plain (i.e. non-add) updates, which is inefficient. Also TuplePointsToAnalysis doesn't know that it should alias the operand and output buffers of a scatter, which would avoid a copy. PiperOrigin-RevId: 216412467

Diffstat (limited to 'tensorflow/go')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: