| Commit message (Collapse) | Author | Age |
|
|
|
|
|
|
|
|
| |
the duration of a single RunInternal() call from RunHandlerPool. It is used for
running inter-op closures with a global scheduler (which in the future) to
improve both median and tail latency (for use-cases like CPU inference).
In the case that global pools aren't used, this change should be a no-op.
PiperOrigin-RevId: 214992852
|
|
|
|
| |
PiperOrigin-RevId: 214853846
|
|
|
|
|
|
|
|
|
| |
the duration of a single RunInternal() call from RunHandlerPool.
We want to leverage this abstraction for improving the cross-session inter-op
parallelism for lower latency inference in the future.
In the case that global pools aren't used, this change should be a no-op.
PiperOrigin-RevId: 214818187
|
|
PiperOrigin-RevId: 207781405
|