[XLA:GPU] Add an implementation of scatter for GPU

This simple has a kernel that runs on every element of the updates tensor, figure out the right indices to perform the update, and applies it with an atomic operation. Currently we emit a CAS for plain (i.e. non-add) updates, which is inefficient. Also TuplePointsToAnalysis doesn't know that it should alias the operand and output buffers of a scatter, which would avoid a copy. PiperOrigin-RevId: 216412467
author: Benjamin Kramer <kramerb@google.com> 2018-10-09 13:32:24 -0700
committer: TensorFlower Gardener <gardener@tensorflow.org> 2018-10-09 13:40:43 -0700
commit: 5d9a7fdf4f02c2db487a03e7ad2d520f8847c4e3 (patch)
tree: a77d90f9328b7e0e859a15ab3b5d765774954b5a /tensorflow/compiler/xla/service/gpu/nvptx_compiler.cc
parent: 9989788be25c846d087ac70b76cf78759a209a3e (diff)
1 files changed, 0 insertions, 3 deletions
diff --git a/tensorflow/compiler/xla/service/gpu/nvptx_compiler.cc b/tensorflow/compiler/xla/service/gpu/nvptx_compiler.cc
index ac6c2c5565..5409f65589 100644
--- a/tensorflow/compiler/xla/service/gpu/nvptx_compiler.cc
+++ b/tensorflow/compiler/xla/service/gpu/nvptx_compiler.cc
@@ -75,7 +75,6 @@ limitations under the License.
 #include "tensorflow/compiler/xla/service/llvm_ir/llvm_util.h"
 #include "tensorflow/compiler/xla/service/reduce_precision_insertion.h"
 #include "tensorflow/compiler/xla/service/reshape_mover.h"
-#include "tensorflow/compiler/xla/service/scatter_expander.h"
 #include "tensorflow/compiler/xla/service/transpose_folding.h"
 #include "tensorflow/compiler/xla/service/tuple_simplifier.h"
 #include "tensorflow/compiler/xla/service/while_loop_constant_sinking.h"
@@ -176,8 +175,6 @@ Status OptimizeHloModule(HloModule* hlo_module, se::StreamExecutor* stream_exec,
       // elimination has to come after that pass.
       pipeline.AddPass<ZeroSizedHloElimination>();
 
-      pipeline.AddPass<ScatterExpander>();
-
       pass.AddPass<AlgebraicSimplifier>(
           /*is_layout_sensitive=*/false,
           [](const Shape&, const Shape&) { return false; });
author	Benjamin Kramer <kramerb@google.com>	2018-10-09 13:32:24 -0700
committer	TensorFlower Gardener <gardener@tensorflow.org>	2018-10-09 13:40:43 -0700
commit	5d9a7fdf4f02c2db487a03e7ad2d520f8847c4e3 (patch)
tree	a77d90f9328b7e0e859a15ab3b5d765774954b5a /tensorflow/compiler/xla/service/gpu/nvptx_compiler.cc
parent	9989788be25c846d087ac70b76cf78759a209a3e (diff)