This project for distribution of speech recognition applications :
-
speech-recognition-open-api - Converts speech into text using an API powered by deep learning neural network algorithms for automatic speech recognition (ASR). To know more, Click Here
-
speech-recognition-open-api-proxy - This project is a proxy to provide support for real-time streaming functionality for Open Speech API from browsers or any clients that doesn't support gprc bi-directional streaming. Refer the real-time streaming documentation for architecture details here
- clone the repo.
git clone [email protected]:amankrayush/speech-assistant-distro.git
- download model for languages (To recognize speech and convert it into text for particular language)
sh ./scripts/download_models.sh "english Hindi"
- update the network name in docker-compose.yml file as per your requirement.
networks:
default:
name: bahmni-docker_default
external: true
Note :
- speech-Assistant-Distro is intergrated with Bahmni, that's why network name is mentioned as 'bahmni-docker_default'
- remove the networks field if you don't need external network, else update it with external network name and external network should exist in docker when we are making distro applications up.
- make applications up (make sure models are available - mentioned in step 2)
docker-compose up -d
- speech streaming functionality will be served by proxy server which has exposed at 9009 port.