aboutsummaryrefslogtreecommitdiffhomepage
path: root/tensorflow/core/util/command_line_flags.cc
diff options
context:
space:
mode:
authorGravatar A. Unique TensorFlower <gardener@tensorflow.org>2017-07-17 12:11:12 -0700
committerGravatar TensorFlower Gardener <gardener@tensorflow.org>2017-07-17 12:15:56 -0700
commit0c144afecef6800589d255dd990a9a88e9f94b23 (patch)
tree732fedd9bd96080d10e093a49cdb35ea5aa4aad9 /tensorflow/core/util/command_line_flags.cc
parent09e9b157787f2d03ae864569de33e62424657450 (diff)
CPU IR Emitter: Lower slice operations with memcpy.
This changes the way the CPU backend emits slice operations so that the maximal contiguous blocks get copied with memcpy. Without this change, LLVM's optimizers prefer to unroll the elementwise loops emitted for doing slice. This leads to increased code size. With this change, the inliner only inlines the memcpy when it is small enough. This change also adds a test that exercises the logic to find maximal contiguous blocks. PiperOrigin-RevId: 162249449
Diffstat (limited to 'tensorflow/core/util/command_line_flags.cc')
0 files changed, 0 insertions, 0 deletions