VLC player demos real-time AI subtitling for videos

Otter@lemmy.ca · edit-2 2 months ago

VLC player demos real-time AI subtitling for videos

TJA!@sh.itjust.works · 2 months ago

Problem ist that now people will say that they don’t get to create accurate subtitles because VLC is doing the job for them.

Accessibility might suffer from that, because all subtitles are now just “good enough”

Railcar8095@lemm.ee · 2 months ago

Or they can get OK ones with this tool, and fix the errors. Might save a lot of time

snooggums@lemmy.world · 2 months ago

Regular old live broadcast closed captioning is pretty much ‘good enough’ and that is the standard I’m comparing to.

Actual subtitles created ahead of time should be perfect because they have the time to double check.

LandedGentry@lemmy.zip · edit-2 2 months ago

deleted by creator

TheMachineStops@discuss.tchncs.de · edit-2 2 months ago

From experience AI translation is still garbage, specially for languages like Chinese, Japanese, and Korean , but if it only subtitles in the actual language such creating English subtitles for English then it is probably fine.

catloaf@lemm.ee · 2 months ago

That’s probably more due to lack of training than anything else. Existing models are mostly made by American companies and trained on English-language material. Naturally, the further you get from the model, the worse the result.

TheMachineStops@discuss.tchncs.de · 2 months ago

It is not the lack of training material that is the issue, it doesn’t understand context and cultural references. Someone commented here that crunchyroll AI subtitles translated Asura Hall a name to asshole.

Petter1@lemm.ee · 2 months ago

It would be able to behave like it understands context and cultural references it if it had the appropriate training data, no problem.

TheMachineStops@discuss.tchncs.de · edit-2 2 months ago

I highly doubt that it will be as good as human translation anytime soon, maybe around 10 years or so. Also they have profanity filters and they also hallucinate a lot. https://www.businessinsider.com/ai-peak-data-google-deepmind-researchers-solution-test-time-compute-2025-1

Petter1@lemm.ee · 2 months ago

Never said that…

TheMachineStops@discuss.tchncs.de · 2 months ago

You said that with training data it will be able to understand. I mean that even with training data it will take years and it also has other problems like hallucinations. I admit, I didn’t word it correctly.

LandedGentry@lemmy.zip · edit-2 2 months ago

deleted by creator

shyguyblue@lemmy.world · edit-2 2 months ago

I imagine it would be not-exactly-simple-but-not- complicated to add a “threshold” feature. If Ai is less than X% certain, it can request human clarification.

Edit: Derp. I forgot about the “real time” part. Still, as others have said, even a single botched word would still work well enough with context.

snooggums@lemmy.world · edit-2 2 months ago

That defeats the purpose of doing it in real time as it would introduce a delay.

shyguyblue@lemmy.world · 2 months ago

Derp. You’re right, I’ve added an edit to my comment.