diff options
author | Derek Murray <mrry@google.com> | 2018-09-24 14:34:57 -0700 |
---|---|---|
committer | TensorFlower Gardener <gardener@tensorflow.org> | 2018-09-24 14:39:31 -0700 |
commit | aab3c53e1484404a70565324d1231c4e6ead7425 (patch) | |
tree | ec04217c08795c7a30ee5bf7af86d7f2b81ec53c /tensorflow/core/BUILD | |
parent | 567a7196494a79988337bcb726c0c5d453298c99 (diff) |
Inline kernel tracing logic into `ExecutorState::Process()`.
All devices implement the same tracing logic in an override of `Device::Compute()`. However, that logic does not have access to the cached `NodeItem::kernel_is_expensive` bit for the kernel, so it must make a virtual call to `OpKernel::IsExpensive()`. By inlining the logic into `ExecutorState::Process()`, we avoid making an unnecessary virtual call on each kernel invocation (when a trace controller is attached).
PiperOrigin-RevId: 214332492
Diffstat (limited to 'tensorflow/core/BUILD')
-rw-r--r-- | tensorflow/core/BUILD | 1 |
1 files changed, 0 insertions, 1 deletions
diff --git a/tensorflow/core/BUILD b/tensorflow/core/BUILD index d914fdb96c..59b7dd04e9 100644 --- a/tensorflow/core/BUILD +++ b/tensorflow/core/BUILD @@ -2803,7 +2803,6 @@ CORE_CPU_LIB_HEADERS = CORE_CPU_BASE_HDRS + [ "common_runtime/stats_publisher_interface.h", "common_runtime/step_stats_collector.h", "common_runtime/threadpool_device.h", - "common_runtime/tracing_device.h", "common_runtime/process_state.h", "common_runtime/pool_allocator.h", "graph/gradients.h", |