Batch transcribe long audio files using Google Cloud

Installation
Usage
Alternatives

Installation

Set up a Google Cloud project and service account

In Google Cloud Console:

Select a project in Google Cloud Console, or create a new one.
Go to "Transcriptions" via the search bar, and enable the Speech-to-Text API.
Go to "Credentials" via the search bar. Create a new service account. Grant it access to the "Storage Admin" and "Cloud Speech Client" roles.

Install the Google Cloud SDK

If you haven't, install the Google Cloud SDK:

curl https://sdk.cloud.google.com | bash

Create an authorization key file:

gcloud config set account USER@PROJECT.iam.gserviceaccount.com
gcloud auth login
gcloud iam service-accounts keys create ~/google-cloud-sdk/AUTH.json --iam-account USER@PROJECT.iam.gserviceaccount.com

Install the CLI

If you haven't already, install go:

macOS: brew install go
Linux: apt-get install go

Install the transcribe command line tool:

go install github.com/jamesmurdza/transcribe/cmd/transcribe@HEAD

Usage

Run these commands to prepare your environment:

export GOOGLE_APPLICATION_CREDENTIALS=~/google-cloud-sdk/AUTH.json
export PATH=$PATH:~/go/bin

Run the command as follows:

transcribe --project=PROJECT ./file.wav

Alternatives

Option	Features
Google Cloud Speech-to-Text	- 1 hour/month free forever (subject to change) then 2.4c/minute
AWS Transcribe	- 1 hour/month free for the first 12 months
Otter AI	- 6000 minutes included in $13/month subscription - Extra features such as including collaborative editing

jamesmurdza/Batch-Transcribe-with-Google-Cloud.md

Batch transcribe long audio files using Google Cloud

Contents

Installation

Set up a Google Cloud project and service account

Install the Google Cloud SDK

Install the CLI

Usage

Alternatives