aboutsummaryrefslogtreecommitdiffhomepage
path: root/tensorflow/core/distributed_runtime/rpc_collective_executor_mgr.cc
Commit message (Collapse)AuthorAge
* Add distributed model GetStepSequenceAsync implementation toGravatar A. Unique TensorFlower2018-07-03
| | | | | | | | | | distributed_runtiume/RpcCollectiveExecutorMgr. In a distributed environment WorkerInterface is going to call this method at the group leader when fielding a GetStepSequence request from one of the other workers. PiperOrigin-RevId: 203196543
* Collective Ops Part 8Gravatar A. Unique TensorFlower2018-06-08
Enable collective op execution in distibuted mode: Pass collective_graph_key into graph building and step execution contexts (MasterSession) where it triggers allocation of an RpcCollectiveExecutorMgr that becomes accessible via the WorkerEnv and MasterEnv. The collective_graph_key is used to synchronize step_ids (which are otherwise random) between otherwise independent graph executions that contain collective ops that need to rendezvous. All APIs for using collectives are still non-public and experimental. PiperOrigin-RevId: 199879087