In Google Cloud Console:
- Select a project in Google Cloud Console, or create a new one.
- Go to "Transcriptions" via the search bar, and enable the Speech-to-Text API.
- Go to "Credentials" via the search bar. Create a new service account. Grant it access to the "Storage Admin" and "Cloud Speech Client" roles.
- If you haven't, install the Google Cloud SDK:
curl https://sdk.cloud.google.com | bash
- Create an authorization key file:
gcloud config set account USER@PROJECT.iam.gserviceaccount.com gcloud auth login gcloud iam service-accounts keys create ~/google-cloud-sdk/AUTH.json --iam-account USER@PROJECT.iam.gserviceaccount.com
- If you haven't already, install go:
- macOS:
brew install go - Linux:
apt-get install go
- Install the transcribe command line tool:
go install github.com/jamesmurdza/transcribe/cmd/transcribe@HEAD
- Run these commands to prepare your environment:
export GOOGLE_APPLICATION_CREDENTIALS=~/google-cloud-sdk/AUTH.json
export PATH=$PATH:~/go/bin
- Run the command as follows:
transcribe --project=PROJECT ./file.wav
| Option | Features |
|---|---|
| Google Cloud Speech-to-Text | - 1 hour/month free forever (subject to change) then 2.4c/minute |
| AWS Transcribe | - 1 hour/month free for the first 12 months |
| Otter AI | - 6000 minutes included in $13/month subscription - Extra features such as including collaborative editing |