Updated speech commands example to use new dataset

PiperOrigin-RevId: 196557132
author: Pete Warden <petewarden@google.com> 2018-05-14 13:24:58 -0700
committer: TensorFlower Gardener <gardener@tensorflow.org> 2018-05-14 13:27:42 -0700
commit: 52cb1594172691bd6ea9048358652585f0ea1920 (patch)
tree: 33f89cc1fd85a3844bfd9c0e314d336f085c8c76 /tensorflow
parent: e528e5ab82fafe1cf8f5d69f9b18426af1b51d09 (diff)
2 files changed, 10 insertions, 8 deletions
diff --git a/tensorflow/docs_src/tutorials/audio_recognition.md b/tensorflow/docs_src/tutorials/audio_recognition.md
index 372ab47df7..d7a8da6f96 100644
--- a/tensorflow/docs_src/tutorials/audio_recognition.md
+++ b/tensorflow/docs_src/tutorials/audio_recognition.md
@@ -25,13 +25,15 @@ python tensorflow/examples/speech_commands/train.py
 ```
 
 The script will start off by downloading the [Speech Commands
-dataset](https://storage.cloud.google.com/download.tensorflow.org/data/speech_commands_v0.01.tar.gz),
-which consists of 65,000 WAVE audio files of people saying thirty different
-words. This data was collected by Google and released under a CC BY license, and
-you can help improve it by [contributing five minutes of your own
+dataset](https://storage.cloud.google.com/download.tensorflow.org/data/speech_commands_v0.02.tar.gz),
+which consists of over 105,000 WAVE audio files of people saying thirty
+different words. This data was collected by Google and released under a CC BY
+license, and you can help improve it by [contributing five minutes of your own
 voice](https://aiyprojects.withgoogle.com/open_speech_recording). The archive is
-over 1GB, so this part may take a while, but you should see progress logs, and
-once it's been downloaded once you won't need to do this step again.
+over 2GB, so this part may take a while, but you should see progress logs, and
+once it's been downloaded once you won't need to do this step again. You can
+find more information about this dataset in this
+[Speech Commands paper](https://arxiv.org/abs/1804.03209).
 
 Once the downloading has completed, you'll see logging information that looks
 like this:
@@ -229,7 +231,7 @@ You can also build this application yourself, since it's open source and
 [available as part of the TensorFlow repository on
 github](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/android#building-in-android-studio-using-the-tensorflow-aar-from-jcenter).
 By default it downloads [a pretrained model from
-tensorflow.org](http://download.tensorflow.org/models/speech_commands_v0.01.zip),
+tensorflow.org](http://download.tensorflow.org/models/speech_commands_v0.02.zip),
 but you can easily [replace it with a model you've trained
 yourself](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/android#install-model-files-optional).
 If you do this, you'll need to make sure that the constants in [the main
diff --git a/tensorflow/examples/speech_commands/train.py b/tensorflow/examples/speech_commands/train.py
index f084931215..fc28eb0631 100644
--- a/tensorflow/examples/speech_commands/train.py
+++ b/tensorflow/examples/speech_commands/train.py
@@ -288,7 +288,7 @@ if __name__ == '__main__':
       '--data_url',
       type=str,
       # pylint: disable=line-too-long
-      default='http://download.tensorflow.org/data/speech_commands_v0.01.tar.gz',
+      default='http://download.tensorflow.org/data/speech_commands_v0.02.tar.gz',
       # pylint: enable=line-too-long
       help='Location of speech training data archive on the web.')
   parser.add_argument(
author	Pete Warden <petewarden@google.com>	2018-05-14 13:24:58 -0700
committer	TensorFlower Gardener <gardener@tensorflow.org>	2018-05-14 13:27:42 -0700
commit	52cb1594172691bd6ea9048358652585f0ea1920 (patch)
tree	33f89cc1fd85a3844bfd9c0e314d336f085c8c76 /tensorflow
parent	e528e5ab82fafe1cf8f5d69f9b18426af1b51d09 (diff)