Consider adding Vosk backend #91

Erudition · 2022-03-20T00:50:53Z

This project has a libre, offline-capable speech recognition engine that's over 90% accurate - perfect for autoedit!

Vosk is a speech recognition toolkit. The best things in Vosk are:

Supports 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi. More to come.
Works offline, even on lightweight devices - Raspberry Pi, Android, iOS
Installs with simple pip3 install vosk
Portable per-language models are only 50Mb each, but there are much bigger server models available.
Provides streaming API for the best user experience (unlike popular speech-recognition python packages)
There are bindings for different programming languages, too - java/csharp/javascript etc.
Allows quick reconfiguration of vocabulary for best accuracy.
Supports speaker identification beside simple speech recognition.

The text was updated successfully, but these errors were encountered:

pietrop · 2023-09-01T01:17:35Z

I'd welcome a PR if there's interest in adding this, and could provide some guidance on where to
extend the code if there's stil interest in this.

Erudition added the enhancement New feature or request label Mar 20, 2022

Erudition assigned pietrop Mar 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Consider adding Vosk backend #91

Consider adding Vosk backend #91

Erudition commented Mar 20, 2022 •

edited

Loading

pietrop commented Sep 1, 2023

Consider adding Vosk backend #91

Consider adding Vosk backend #91

Comments

Erudition commented Mar 20, 2022 • edited Loading

pietrop commented Sep 1, 2023

Erudition commented Mar 20, 2022 •

edited

Loading