FOSS dictation and transcription software

archer@lemmy.ml · 1 year ago

FOSS dictation and transcription software

Leraje · 1 year ago

I can vouch for whisper.cpp . It’s not 100% perfect but it’s good enough to transcribe a half hour podcast with numerous speakers and which requires pretty minimal fixing afterwards.

Infinite@lemmy.dbzer0.com · 1 year ago

Agreed.

OP, this is the best Speech-to-Text solution, IMO. I’ve used Whisper on Windows (link to GitHub) successfully to transcribe graduate-level class recordings with very minimal manual fixing, mostly only certain last names.

𝒍𝒆𝒎𝒂𝒏𝒏@lemmy.dbzer0.com · 1 year ago

Not FOSS as it’s under another license, but there’s “FUTO Voice Input” if you’re looking for a local alternative to Google’s voice dictation on Android

https://gitlab.futo.org/alex/voiceinput

The repo has a list of supported and unsupported Android keyboards. Under the hood it uses OpenAI Whisper

Substance_P@lemmy.world · 1 year ago

This one of my most used apps at the moment, it works 100% on your device and is great for filling in search terms, for AI prompts , messages etc. The only downside is that it seems to have a character limit so it may not be what OP is looking for.

AnEilifintChorcra@sopuli.xyz · 1 year ago

Maybe not exactly what you’re looking for but I found this a few weeks ago https://github.com/k2-fsa/sherpa-onnx and I haven’t really seen anyone talk about it

I’ve been using the tts on android for navigation and its way better than rhvoice and espeak.

I did try stt on android and it worked great but I’ve never used stt before so I don’t know how good it is compared to other stt

punkcoder@lemmy.world · 1 year ago

Falling into the not sure how open source it is because AI is a mess. But it works category…

https://github.com/manzolo/openai-whisper-docker

FlatFootFox@lemmy.world · 1 year ago

It’s still surreal to see OpenAI’s need for training data be so vast that they casually developed and open sourced a generational leap in transcription technology just so that they could scrape online videos better.

makingStuffForFun@lemmy.ml · 1 year ago

There is Talon Voice. It’s not FLOSS, nor even Open Source in any way. But it’s good.

I’ll watch this thread as I’m keen to know myself.

Gravitywell@sh.itjust.works · 1 year ago

Depends on your use case but I found a plugin to use openai for dictation in obsidian

archer@lemmy.ml · 1 year ago

I’d prefer to keep both the recordings and the transcription local if possible, either on device or self hosted.

moreeni@lemm.ee · 1 year ago

https://github.com/pluja/awesome-privacy?tab=readme-ov-file#speech-to-text

suoko@feddit.it · 1 year ago

https://www.omglinux.com/speech-note-transcribe-voice-to-text-on-linux/

spinning_your_wheels@thelemmy.club · 1 year ago

https://github.com/openai/whisper

they have MIT license