Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Additional Whisper Services (Focus on OpenAI Whisper API) #137

Closed
5 tasks
vadanrod14 opened this issue Jan 9, 2025 · 0 comments
Closed
5 tasks

Comments

@vadanrod14
Copy link
Contributor

Title: Add Support for OpenAI Whisper API Integration

Description:

Currently, the Telegram bot uses AWS Whisper to transcribe voice messages. While this works well, supporting alternative Whisper services could enhance flexibility, performance, and cost-effectiveness, depending on user needs. This issue focuses on integrating the OpenAI Whisper API as an additional transcription backend.

Proposed Solution:

  1. Abstraction Layer: Create a modular architecture that allows switching between transcription services easily. This can involve:

    • Defining a common interface for transcription services.
    • Implementing service-specific modules.
  2. OpenAI Whisper API Integration:

    • Implement a new module for the OpenAI Whisper API.
    • Include necessary configuration options (e.g., API keys, endpoints).
    • Ensure compatibility with current Telegram bot features.
  3. Fallback/Preference Mechanism:

    • Allow users to set a preferred transcription service.
    • Implement fallback logic in case the primary service is unavailable.

Tasks:

  • Refactor the existing code to introduce a service abstraction layer.
  • Research and document the OpenAI Whisper API (authentication, pricing, limits, etc.).
  • Implement and test the OpenAI Whisper API integration.
  • Add configuration options for switching between transcription services.
  • Update documentation to include instructions for using the OpenAI Whisper API.

Additional Context:

OpenAI Whisper API is known for its accuracy and robustness in handling diverse languages and accents. Including it as an option could provide better transcription quality in some cases compared to AWS Whisper.

Acceptance Criteria:

  • The bot can transcribe messages using both AWS Whisper and OpenAI Whisper.
  • Users can configure their preferred transcription service.
  • Comprehensive documentation and tests are in place.

Feel free to modify or prioritize tasks based on project needs!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant