aboutsummaryrefslogtreecommitdiffhomepage
path: root/tensorflow/core/BUILD
diff options
context:
space:
mode:
authorGravatar Derek Murray <mrry@google.com>2018-09-24 14:34:57 -0700
committerGravatar TensorFlower Gardener <gardener@tensorflow.org>2018-09-24 14:39:31 -0700
commitaab3c53e1484404a70565324d1231c4e6ead7425 (patch)
treeec04217c08795c7a30ee5bf7af86d7f2b81ec53c /tensorflow/core/BUILD
parent567a7196494a79988337bcb726c0c5d453298c99 (diff)
Inline kernel tracing logic into `ExecutorState::Process()`.
All devices implement the same tracing logic in an override of `Device::Compute()`. However, that logic does not have access to the cached `NodeItem::kernel_is_expensive` bit for the kernel, so it must make a virtual call to `OpKernel::IsExpensive()`. By inlining the logic into `ExecutorState::Process()`, we avoid making an unnecessary virtual call on each kernel invocation (when a trace controller is attached). PiperOrigin-RevId: 214332492
Diffstat (limited to 'tensorflow/core/BUILD')
-rw-r--r--tensorflow/core/BUILD1
1 files changed, 0 insertions, 1 deletions
diff --git a/tensorflow/core/BUILD b/tensorflow/core/BUILD
index d914fdb96c..59b7dd04e9 100644
--- a/tensorflow/core/BUILD
+++ b/tensorflow/core/BUILD
@@ -2803,7 +2803,6 @@ CORE_CPU_LIB_HEADERS = CORE_CPU_BASE_HDRS + [
"common_runtime/stats_publisher_interface.h",
"common_runtime/step_stats_collector.h",
"common_runtime/threadpool_device.h",
- "common_runtime/tracing_device.h",
"common_runtime/process_state.h",
"common_runtime/pool_allocator.h",
"graph/gradients.h",