titu1994/compute_asr_normalization_statistics.ipynb

Last active February 2, 2023 10:31

Star (3) You must be signed in to star a gist
Fork (1) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/titu1994/8c8d478a917cf62a6acd0f40af779f77.js"></script>
Save titu1994/8c8d478a917cf62a6acd0f40af779f77 to your computer and use it in GitHub Desktop.

Download ZIP

Raw

compute_asr_normalization_statistics.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

ridasaleem0 commented Jul 27, 2021

Hey i've been exploring your notebook to compute normalization statistics for Citrinet model, can you please probably clear how we can use manifest path, do we need to download the datasets or what?

Author

titu1994 commented Jul 27, 2021

Yes you will need to have the dataset + it's manifest file in order to calculate the dataset statistics.

ridasaleem0 commented Jul 27, 2021

Okay, I am basically looking to calculate dataset statistics for "stt_zh_citrinet_1024_gamma_0_25_1.0.0" model, since it has been trained on Multilingual LibriSpeech English corpus (pre-training) and Aishell-2 corpus (fine-tuning), i am not sure where to get the manifest file for it.

Author

titu1994 commented Jul 27, 2021

For such models, if you don't have the datasets it might be valuable to simply run https://github.com/NVIDIA/NeMo/blob/main/tutorials/asr/Streaming_ASR.ipynb with large buffer sizes (it won't require pre-calculation of the dataset statistics then)

ridasaleem0 commented Jul 27, 2021

Can this be used for real-time asr with microphone? I am specifically looking for offline microphone asr solution.

Author

titu1994 commented Jul 27, 2021

Not for realtime. Jarvis would be a proper production toolkit for streaming (real time) ASR. In Nemo we have buffered audio (the notebook above) and streaming audio is not perfect support

ridasaleem0 commented Jul 27, 2021

I have Jetson xavier and nano, as far as I know Jarvis is not compatible with Jetson for now.

Author

titu1994 commented Jul 27, 2021

It is not compatible for now. I don't think Nemo supports ASR on Jetson either

ridasaleem0 commented Jul 27, 2021

Ah seems like a pickle, anyways thank you so much for your assistance.

titu1994/compute_asr_normalization_statistics.ipynb

ridasaleem0 commented Jul 27, 2021

Uh oh!

titu1994 commented Jul 27, 2021

Uh oh!

ridasaleem0 commented Jul 27, 2021

Uh oh!

titu1994 commented Jul 27, 2021

Uh oh!

ridasaleem0 commented Jul 27, 2021

Uh oh!

titu1994 commented Jul 27, 2021

Uh oh!

ridasaleem0 commented Jul 27, 2021

Uh oh!

titu1994 commented Jul 27, 2021

Uh oh!

ridasaleem0 commented Jul 27, 2021

Uh oh!