alotaiba/google_speech2text.md

Created February 3, 2012 13:20

Star (308) You must be signed in to star a gist
Fork (101) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/alotaiba/1730160.js"></script>
Save alotaiba/1730160 to your computer and use it in GitHub Desktop.

Download ZIP

Google Speech To Text API

Raw

google_speech2text.md

Google Speech To Text API

Base URL: https://www.google.com/speech-api/v1/recognize
It accepts POST requests with voice file encoded in FLAC format, and query parameters for control.

Query Parameters

client
The client's name you're connecting from. For spoofing purposes, let's use chromium

lang
Speech language, for example, ar-QA for Qatari Arabic, or en-US for U.S. English

maxresults
Maximum results to return for utterance

POST

body
Should contain FLAC formatted voice binary

HTTP Header

Content-Type
Should be audio/x-flac; rate=16000;, where MIME and sample rate of the FLAC file is included

User-Agent
Can be the client's user agent string, for spoofing purposes, we'll use Chrome's

Examples

These examples assume you have a voice file encoded in FLAC called alsalam-alikum.flac.

wget

This will save JSON response in a file called recognized.json

wget --post-file='alsalam-alikum.flac' \
--user-agent='Mozilla/5.0 (Macintosh; Intel Mac OS X 10_6_8) AppleWebKit/535.7 (KHTML, like Gecko) Chrome/16.0.912.77 Safari/535.7' \
--header='Content-Type: audio/x-flac; rate=16000;' \
-O 'recognized.json' \
'https://www.google.com/speech-api/v1/recognize?client=chromium&lang=ar-QA&maxresults=10'

curl

curl -X POST \
--data-binary @alsalam-alikum.flac \
--user-agent 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_6_8) AppleWebKit/535.7 (KHTML, like Gecko) Chrome/16.0.912.77 Safari/535.7' \
--header 'Content-Type: audio/x-flac; rate=16000;' \
'https://www.google.com/speech-api/v1/recognize?client=chromium&lang=ar-QA&maxresults=10'

amsehili commented Oct 27, 2015

Hello everybody,

Some time ago I created this shell script which packages everything you need to use the API v2 (record data for a given duration or use a file, specify language, filter out results, etc.): https://github.com/amsehili/gspeech-rec

For more details about the reverse engineering being used, check out this article: https://aminesehili.wordpress.com/2015/02/08/on-the-use-of-googles-speech-recognition-api-version-2/

Cheers!

rogo21 commented Feb 4, 2016

how to divide audio into frames of 15 sec using java code or command line??

ghost commented Jul 21, 2016

Hi, https://www.google.com/speech-api/v2/recognize -> 400. That’s an error.
I am in Canada

Swethamr402 commented Jul 29, 2016

HI, Sending 'POST' request to URL : https://www.google.com/speech-api/v2/recognize?output=json&lang=en-us&key=AIzaSyBOti4mM-6x9WDnZIjIeyEU21OpBXqWBgw&results=6&pfilter=2
Response Code : 200
{"result":[]}

I am getting above code.. But no output of recorded file to be convert in to text. How do I get that ?

IvanZhao commented Sep 5, 2016

Hi everybody,
When I was "POST https://speech.googleapis.com/v1beta1/speech:asyncrecognize".
I receive the error message like this:
{
"code": 403,
"errors": [
{
"domain": "global",
"message": "Requests from this Android client application are blocked.",
"reason": "forbidden"
}
],
"message": "Requests from this Android client application are blocked.",
"status": "PERMISSION_DENIED"
}
Dose any buddy knows how to resolve this problem?
Thanks.

indrabayu commented Nov 24, 2016 •

edited

Loading

    public static async Task<string> RequestGoogleSpeechAPIAsync(byte[] byteArray) 
    {
        var httpClient = new HttpClient();
        var mediaType = new MediaTypeWithQualityHeaderValue("audio/x-flac");
        var parameter = new NameValueHeaderValue("rate", "16000");
        mediaType.Parameters.Add(parameter);

        var url = "https://www.google.com/speech-api/v2/recognize?output=json&lang=en-US&key=";
        var appSettings = ConfigurationManager.AppSettings;
        var apiKey = "AIzaSyBOti4mM-6x9WDnZIjIeyEU21OpBXqWBgw";
        var uri = new Uri(url + apiKey);

        using (MemoryStream ms = new MemoryStream(byteArray, 0, byteArray.Length))
        {
            var param = new StreamContent(ms);
            param.Headers.ContentType = mediaType;

            var result = await httpClient.PostAsync(uri, param);

            var responseFromServer = await result.Content.ReadAsStringAsync();
            var responseArray = responseFromServer.Split('\n');
            var responseJson = await Task.Factory.StartNew(() => JsonConvert.DeserializeObject<SpajamHonsen.Models.GoogleSpeechAPIResponseModel.Resuls>(responceArray[1]));

            return responseJson.result[0].alternative[0].transcript;
        }
    }

fj4870 commented Dec 1, 2016

@akifnaeem21
Request you to please share speech to text c# code.

Nalinh commented Apr 4, 2017

i can't run :(
result is blank, tell me why, plz

Reejesh-PK commented Oct 27, 2022

Here is the updated Api : https://cloud.google.com/speech-to-text/docs/reference/rest/v1/speech/recognize

Stackoverflow Ref : https://stackoverflow.com/questions/50760057/how-to-use-googles-cloud-speech-to-text-rest-api-to-transcribe-a-video

alotaiba/google_speech2text.md

Google Speech To Text API

Query Parameters

POST

HTTP Header

Examples

wget

curl

amsehili commented Oct 27, 2015

Uh oh!

rogo21 commented Feb 4, 2016

Uh oh!

ghost commented Jul 21, 2016

Uh oh!

Swethamr402 commented Jul 29, 2016

Uh oh!

IvanZhao commented Sep 5, 2016

Uh oh!

indrabayu commented Nov 24, 2016 •

edited

Loading

Uh oh!

fj4870 commented Dec 1, 2016

Uh oh!

Nalinh commented Apr 4, 2017

Uh oh!

Reejesh-PK commented Oct 27, 2022

Uh oh!

alotaiba/google_speech2text.md

Google Speech To Text API

Query Parameters

POST

HTTP Header

Examples

wget

curl

amsehili commented Oct 27, 2015

Uh oh!

rogo21 commented Feb 4, 2016

Uh oh!

ghost commented Jul 21, 2016

Uh oh!

Swethamr402 commented Jul 29, 2016

Uh oh!

IvanZhao commented Sep 5, 2016

Uh oh!

indrabayu commented Nov 24, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fj4870 commented Dec 1, 2016

Uh oh!

Nalinh commented Apr 4, 2017

Uh oh!

Reejesh-PK commented Oct 27, 2022

Uh oh!

indrabayu commented Nov 24, 2016 •

edited

Loading