aboutsummaryrefslogtreecommitdiffhomepage
path: root/tensorflow/python/training/server_lib.py
Commit message (Collapse)AuthorAge
* Remove usage of magic-api-link syntax from source files.Gravatar Mark Daoust2018-08-09
| | | | | | | | | | | | | | | | | | | | Back-ticks are now converted to links in the api_docs generator. With the new docs repo we're moving to simplify the docs pipeline, and make everything more readable. By doing this we no longer get test failures for symbols that don't exist (`tf.does_not_exist` will not get a link). There is also no way, not to set custom link text. That's okay. This is the result of the following regex replacement (+ a couple of manual edits.): re: @\{([^$].*?)(\$.+?)?} sub: `\1` Which does the following replacements: "@{tf.symbol}" --> "`tf.symbol`" "@{tf.symbol$link_text}" --> "`tf.symbol`" PiperOrigin-RevId: 208042358
* Make protocol used in estimator customizable.Gravatar Li Liangbin2018-07-14
| | | | | | | | | | Example code as follow: config = tf.estimator.RunConfig(protocol='grpc+verbs') nn = tf.estimator.Estimator(model_fn=model_fn, model_dir=model_dir, params=params, config=config)
* Adding tf_export decorators/calls to TensorFlow functions and constants.Gravatar Anna R2018-01-31
| | | | PiperOrigin-RevId: 183936100
* Added __str__ to ClusterSpec. This will improve the logs users are attaching ↵Gravatar Mustafa Ispir2017-12-07
| | | | | | | | to github issues. For example : #14942 PiperOrigin-RevId: 178296636
* Implement ClusterSpec Propagation in TF MasterGravatar Brennan Saeta2017-05-04
| | | | | | | | | | | ClusterSpec propagation is a capability upgrade for TensorFlow that should make it much easier to (1) build distributed TensorFlow clusters, and (2) handle node failures. The ClusterSpec propagation capability allows TensorFlow workers to be booted independently of each other, and with no knowledge about others. The client can then construct a ClusterDef (ClusterSpec), and then send it to the TF master at session creation. The master in turn then propagates the ClusterDef along to all of the workers. Change: 155159972
* Fix links according to new convention.Gravatar Patrick Nguyen2017-02-14
| | | | Change: 147499520
* TASK 1. Python module doc stringsGravatar A. Unique TensorFlower2017-02-13
| | | | | TASK 2. //training/... Python class doc strings Change: 147434408
* Add syntax highlighting and fix some lists in docs.Gravatar Patrick Nguyen2016-09-28
| | | | Change: 134626795
* Remove unnecessary imports in some python files.Gravatar A. Unique TensorFlower2016-09-19
| | | | Change: 133602571
* Support sparse jobs for TensorFlow gRPC servers.Gravatar Derek Murray2016-08-27
| | | | | | | | | | | | | | | | | | | | | | | | | | A TensorFlow server (tf.train.Server) is configured with a list of jobs, where each job includes the addresses of the tasks in that job. At present, the tasks are provided as a dense list, and a server must be configured with the addresses of all tasks in every job, even when that server might never contact a particular task. This CL adds support for configuring individual jobs with a sparse mapping from task index to network address. The net effect is that a server in (e.g.) a worker job need not know the addresses of the other worker tasks. This reduces the amount of configuration needed in two ways: (i) the cluster specification for an individual server contains only the server with which it makes contact, and (ii) there is no need to specify a device filter to prevent the server pinging all known tasks on session creation (which can lead to unavailability when unrelated tasks fail). This CL also cleans up the code in grpc_channel.{cc,h} in three ways: 1. Move unnecessarily public methods into an anonymous namespace. 2. Shorten some of the unwieldy function and class names. 3. Use std::move() where appropriate to avoid copying vectors and maps of strings. Change: 131490850
* Adding magic methods __bool__, __eq__, and __ne__ to ClusterSpec to ↵Gravatar A. Unique TensorFlower2016-08-23
| | | | | | facilitate comparisons. Change: 131096794
* Merge changes from github.Gravatar Benoit Steiner2016-08-18
| | | | Change: 130695673
* Add a simple way to set the `default_session_config` on a `tf.train.Server`.Gravatar Derek Murray2016-06-27
| | | | | | | | | This makes it easier to set properties such as the `gpu_options.per_process_gpu_memory_fraction`, which have to be set on the server, rather than individual serssions. Fixes #3057. Change: 126009942
* Update copyright for 3p/tf/python.Gravatar A. Unique TensorFlower2016-06-02
| | | | Change: 123900456
* Next pass at converting all return statuses to errors.OpError exceptionsGravatar A. Unique TensorFlower2016-05-05
| | | | | | | | | in Python. - Add a tf_status util file to convert between Status and TF_Status. - Use TF_Status for the swigged APIs in session, checkpoint_reader, and server_lib. - Converts all these tf_status to exceptions in Python with the new context handler. - Remove the old StatusNotOK exception. Change: 121644982
* Fixed bug in `tf.train.ClusterSpec` constructor.Gravatar Derek Murray2016-04-15
| | | | | | | | Creating a `tf.train.ClusterSpec` from another ClusterSpec was broken, which in turn broke creating a `tf.train.Server` from a ClusterSpec. Fixes #1961. Change: 119954117
* Clarifies some documentation in server_lib.pyGravatar A. Unique TensorFlower2016-04-11
| | | | Change: 119533248
* Fix segfault in grpc_server_lib when an invalid hostname-port pair is passed.Gravatar Derek Murray2016-04-08
| | | | | | Previously, if the port was undefined, an out-of-bounds access would be made. This change adds the appropriate checks. Change: 119424297
* Clean up the interface to the distributed runtime from PythonGravatar Derek Murray2016-04-07
This is a breaking change! The following classes have been renamed: tf.GrpcServer -> tf.train.Server tf.ClusterSpec -> tf.train.ClusterSpec tf.ServerDef -> tf.train.ServerDef tf.JobDef -> tf.train.JobDef tf.ClusterDef -> tf.train.ClusterDef The constructor for tf.train.Server is more permissive and now accepts tf.train.ClusterSpec, tf.train.ClusterDef, and dictionary inputs for specifying the cluster part of the server. For consistency, the server library moves from python/client to python/training. Change: 119335624