Exciting news! The free API you were using is no more free!

Moonrise2473@feddit.it · edit-2 1 year ago

Exciting news! The free API you were using is no more free!

Slurpey@lemmy.world · 1 year ago

I mean to be realistic, Whisper (the audio to text AI ) linked with chatGPT can subtitle anything in real time, translated in any language, in very high quality…

gato@feddit.de · 1 year ago

You just need a GPU running in realtime along side your video playback to analyse what is being played instead of a single text file with timecodes.

Progress!

ashok36@lemmy.world · 1 year ago

Could probably do it with something like a Google coral. You can get one for $60 these days. A lot cheaper than a GPU and less power hungry too.

gato@feddit.de · 1 year ago

That’s still another thing that needs to be bought, installed, and fed with power.
My low power would likely melt trying to run Whisper.

ashok36@lemmy.world · 1 year ago

A USB coral uses barely any power and if you have a hard time installing USB devices…

Besides, a lot of people are already using them for frigate. I am.

gato@feddit.de · 1 year ago

I was only aware of the m.2 variants.

Still, it’s a thing to be bought which I have not had to do for years for my media solution.

dan@upvote.au · 1 year ago

You can get one for $60 these days.

I think they’re $25ish from an official supplier. $60 is scalper pricing. Don’t pay a scalper as it just encourages them to do it more.

ashok36@lemmy.world · 1 year ago

$25 for the m.2 version at least. It’s $60 for the usb version which I assume most people would prefer.

dan@upvote.au · edit-2 1 year ago

Ah, I didn’t realise the USB one cost that much more. I’m not sure most people would prefer the USB version though. It’s convenient to move around and you can use it with mini PCs, but cooling isn’t as good compared to something that sits in a case with good airflow (so it’s more likely to thermally throttle while in use), and having dedicated PCIe lanes as you’d get with an M.2 is way more efficient than using a shared bus like USB. Google have always advertised the USB version for “prototyping” while the M.2 versions are for “production”.

For $40, you can get an M.2 version that has two Coral TPUs on a single board. https://coral.ai/products/m2-accelerator-dual-edgetpu. I’ve got this one with a PCIe adapter, but currently only use one of the TPUs.

Stephen304@lemmy.ml · 1 year ago

It doesn’t need to be realtime since you can pre generate an srt with time codes beforehand using something like bazarr. Whisper also runs faster than realtime in most model sizes, up to 32x realtime so it can really be worth it to add auto subtitles to media in your collection that’s missing subtitles as a one time job.

gato@feddit.de · 1 year ago

It’s an interesting idea to patch the holes when absolutely no srt files are available.
But why not have an open repository where already present srt files could be shared by people.
We could call it libre-subs or something like that.

nicetriangle@kbin.social · 1 year ago

Seems like a huge waste of electricity

ARNiM@lemmy.world · 1 year ago

For something like movies or shows you would only just need to run it once and store it in .srt

nicetriangle@kbin.social · 1 year ago

Yeah but then multiply that by every video file in every Plex library in the world that doesn’t have SRTs already.

LiveLM@lemmy.zip · 1 year ago

Yeah, and we could have a website so people that already did the process for a piece of media could share the result with others!

…oh wait

ARNiM@lemmy.world · 1 year ago

Full circle.

yeehaw@lemmy.ca · 1 year ago

Does this have a plugin or something for Plex?

abbadon420@lemm.ee · 1 year ago

Asking the important questions here

VaultBoyNewVegas@lemmy.world · 1 year ago

Kodi would be question.

Stephen304@lemmy.ml · edit-2 1 year ago

You can use bazarr to batch generate whisper subtitles for your Plex/jellyfin/kodi library: https://wiki.bazarr.media/Additional-Configuration/Whisper-Provider/

ashok36@lemmy.world · 1 year ago

This is super cool.