diff options
author | A. Unique TensorFlower <gardener@tensorflow.org> | 2017-07-17 12:11:12 -0700 |
---|---|---|
committer | TensorFlower Gardener <gardener@tensorflow.org> | 2017-07-17 12:15:56 -0700 |
commit | 0c144afecef6800589d255dd990a9a88e9f94b23 (patch) | |
tree | 732fedd9bd96080d10e093a49cdb35ea5aa4aad9 /tensorflow/core/util/command_line_flags.cc | |
parent | 09e9b157787f2d03ae864569de33e62424657450 (diff) |
CPU IR Emitter: Lower slice operations with memcpy.
This changes the way the CPU backend emits slice operations
so that the maximal contiguous blocks get copied with memcpy.
Without this change, LLVM's optimizers prefer to unroll the
elementwise loops emitted for doing slice. This leads to increased
code size. With this change, the inliner only inlines the memcpy
when it is small enough.
This change also adds a test that exercises the logic to find
maximal contiguous blocks.
PiperOrigin-RevId: 162249449
Diffstat (limited to 'tensorflow/core/util/command_line_flags.cc')
0 files changed, 0 insertions, 0 deletions