The Google Cloud Speech API allows you to convert audio to text by applying powerful neural network models in an easy to use API.
Speech methods:
sync_recognize()async_recognize()streaming_recognize()
For more examples see: https://googlecloudplatform.github.io/google-cloud-python/stable/speech-usage.html