Open Source AI Scribe / Auto-Transcriber / Speech-to-text Transcriptions / Captions & Subtitles Exporter / Interactive Transcripts / Alternative to Otter.ai, Descript, Sonix.ai

orkunzozturk commented Mar 16, 2025

I would love to see this come to life too

Author

candideu commented Mar 16, 2025

@orkunzozturk I commented on the original post, but there, thankfully, quite a few options these days:

Subtitle Edit

It's FOSS and recently added a speech-to-text function using both Vosk and Open AI's Whisper.

Here's a tutorial demo: https://youtu.be/InsNe0KjFhg

The downside: it's only available for Windows.

HyperAudio

HyperAudio now offers speech-to-text using Deepgram and Whisper.

Check out the browser-based editor here: https://hyperaudio.github.io/hyperaudio-lite-editor/index.html

Memo.Ai

Not FOSS, but pretty powerful transcription app. I've been using the free preview for a while now.

Other Whisper-based options

There has been an explosion of free, multilingual speech-to-text tools thanks to Whisper. You can check out the show case here: https://github.com/openai/whisper/discussions/categories/show-and-tell

mikeydiamonds commented Jul 10, 2025

@candideu Scriberr looks promising. rishikanthc is working on v1.0.0.

It can

transcribe dictation, audio, video, or a YouTube URL
use open source model for speaker identification (diarization)
summarize if connected to OpenAI or Ollama

There are no subtitle features but if there are skilled devs reading there's no reason it couldn't get added.

Give it a look.

candideu/Open Source AI Scribe, Auto-Transcriber, Speech-to-text Transcriptions, Captions & Subtitles Exporter, Interactive Transcripts, Alternative to Otter.ai, Descript, Sonix.ai.md

Project description

The idea

Inspiration, and the "Why"

Issues, and what's missing in existing tools

Relevant Technology

Speech-to-text

Vosk Browser

ideasman42/nerd-dictation

saharmor/realtime-transcription-playground

STTWebApp

Clickable, Interactive Transcript

AblePlayer

Subtitle + Transcript Editors + Previewers

oTranscribe

Hyperaudio

All arounders

Kdenlive

Video Transcriber

Complexity and required time

Complexity

Required time (ETA)

Categories

orkunzozturk commented Mar 16, 2025

Uh oh!

candideu commented Mar 16, 2025

Uh oh!

mikeydiamonds commented Jul 10, 2025

Uh oh!