aboutsummaryrefslogtreecommitdiffhomepage
path: root/third_party/clang_toolchain
diff options
context:
space:
mode:
authorGravatar A. Unique TensorFlower <gardener@tensorflow.org>2018-06-07 02:59:00 -0700
committerGravatar TensorFlower Gardener <gardener@tensorflow.org>2018-06-07 03:01:30 -0700
commit3ddc925c8559f2989f3904f271f2d4175c2f3302 (patch)
treebda00285dce010a0928f4e1517e776acb5641be4 /third_party/clang_toolchain
parentc70b7128bfb9f0283c60bbec8fd7b0c12f741d95 (diff)
Improve performance of HloComputation::MakeInstructionPostOrder
Previously it used the same infrastructure as HloInstruction::Accept what caused a high overhead for large models due to the excess amount of work it have to do to support modifying the graph under iteration and due to the lack of caching on graphs with multiple sinks. The new code is a very simple implementation of an iterative DFS based topological sort. PiperOrigin-RevId: 199606688
Diffstat (limited to 'third_party/clang_toolchain')
0 files changed, 0 insertions, 0 deletions