diff options
author | Pete Warden <petewarden@google.com> | 2018-05-14 13:24:58 -0700 |
---|---|---|
committer | TensorFlower Gardener <gardener@tensorflow.org> | 2018-05-14 13:27:42 -0700 |
commit | 52cb1594172691bd6ea9048358652585f0ea1920 (patch) | |
tree | 33f89cc1fd85a3844bfd9c0e314d336f085c8c76 /tensorflow | |
parent | e528e5ab82fafe1cf8f5d69f9b18426af1b51d09 (diff) |
Updated speech commands example to use new dataset
PiperOrigin-RevId: 196557132
Diffstat (limited to 'tensorflow')
-rw-r--r-- | tensorflow/docs_src/tutorials/audio_recognition.md | 16 | ||||
-rw-r--r-- | tensorflow/examples/speech_commands/train.py | 2 |
2 files changed, 10 insertions, 8 deletions
diff --git a/tensorflow/docs_src/tutorials/audio_recognition.md b/tensorflow/docs_src/tutorials/audio_recognition.md index 372ab47df7..d7a8da6f96 100644 --- a/tensorflow/docs_src/tutorials/audio_recognition.md +++ b/tensorflow/docs_src/tutorials/audio_recognition.md @@ -25,13 +25,15 @@ python tensorflow/examples/speech_commands/train.py ``` The script will start off by downloading the [Speech Commands -dataset](https://storage.cloud.google.com/download.tensorflow.org/data/speech_commands_v0.01.tar.gz), -which consists of 65,000 WAVE audio files of people saying thirty different -words. This data was collected by Google and released under a CC BY license, and -you can help improve it by [contributing five minutes of your own +dataset](https://storage.cloud.google.com/download.tensorflow.org/data/speech_commands_v0.02.tar.gz), +which consists of over 105,000 WAVE audio files of people saying thirty +different words. This data was collected by Google and released under a CC BY +license, and you can help improve it by [contributing five minutes of your own voice](https://aiyprojects.withgoogle.com/open_speech_recording). The archive is -over 1GB, so this part may take a while, but you should see progress logs, and -once it's been downloaded once you won't need to do this step again. +over 2GB, so this part may take a while, but you should see progress logs, and +once it's been downloaded once you won't need to do this step again. You can +find more information about this dataset in this +[Speech Commands paper](https://arxiv.org/abs/1804.03209). Once the downloading has completed, you'll see logging information that looks like this: @@ -229,7 +231,7 @@ You can also build this application yourself, since it's open source and [available as part of the TensorFlow repository on github](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/android#building-in-android-studio-using-the-tensorflow-aar-from-jcenter). By default it downloads [a pretrained model from -tensorflow.org](http://download.tensorflow.org/models/speech_commands_v0.01.zip), +tensorflow.org](http://download.tensorflow.org/models/speech_commands_v0.02.zip), but you can easily [replace it with a model you've trained yourself](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/android#install-model-files-optional). If you do this, you'll need to make sure that the constants in [the main diff --git a/tensorflow/examples/speech_commands/train.py b/tensorflow/examples/speech_commands/train.py index f084931215..fc28eb0631 100644 --- a/tensorflow/examples/speech_commands/train.py +++ b/tensorflow/examples/speech_commands/train.py @@ -288,7 +288,7 @@ if __name__ == '__main__': '--data_url', type=str, # pylint: disable=line-too-long - default='http://download.tensorflow.org/data/speech_commands_v0.01.tar.gz', + default='http://download.tensorflow.org/data/speech_commands_v0.02.tar.gz', # pylint: enable=line-too-long help='Location of speech training data archive on the web.') parser.add_argument( |