Pull Request Title: Integrate OpenAI Whisper API Support and Service Abstraction Layer (Fixes #137) #138

agentmarketbot · 2025-01-09T16:59:15Z

Pull Request Description

Overview

This pull request addresses Issue #137: Support Additional Whisper Services (Focus on OpenAI Whisper API) by integrating the OpenAI Whisper API as a transcription backend alongside the existing AWS Whisper. This enhancement improves the flexibility and options available to users for voice message transcription within the Telegram bot.

Key Changes

Transcription Service Architecture:
- Introduced an abstract base class TranscriptionService that establishes a common interface for all transcription services. This includes:
  - An abstract method transcribe_audio(file_url: str) for implementing specific transcription logic.
  - A helper method _download_audio(file_url: str) to standardize audio file downloads and reduce code duplication.
AWS Integration:
- Modified the existing AWSTranscriber class to inherit from the new TranscriptionService, thus standardizing its interface with the new architecture.
OpenAI Integration:
- Developed a new OpenAITranscriber class that encapsulates logic for utilizing the OpenAI Whisper API. This new module handles:
  - Transcription requests to OpenAI’s API.
  - Audio file processing required for proper API interaction.
Factory Pattern Implementation:
- Implemented a TranscriptionServiceFactory to simplify the instantiation of the appropriate transcriber. It allows users to create instances of either AWSTranscriber or OpenAITranscriber based on their specified preferences.

Usage Examples:

The updated code provides clear examples for transcribers:

For AWS:

aws_services = AWSServices()
transcriber = TranscriptionServiceFactory.create_service('aws', aws_services=aws_services)

For OpenAI:

transcriber = TranscriptionServiceFactory.create_service('openai', api_key='YOUR_API_KEY')

Both services are invoked using:

transcript = transcriber.transcribe_audio(file_url)

Documentation Updates:
- Extensive documentation has been added to guide users on configuring and using both transcription services within the bot, ensuring a smooth transition to the new capabilities.

Acceptance Criteria

The bot now supports transcription via both AWS Whisper and OpenAI Whisper APIs.
Users can easily configure their preferred transcription service.
Comprehensive test coverage and updated documentation are in place.

Additional Context

The integration of the OpenAI Whisper API provides an alternative transcription option known for its accuracy across various languages and accents, potentially enhancing user experience and transcription quality.

Fixes #137

Thank you for reviewing this pull request! Please let me know if you have any questions or require further adjustments.

agentmarketbot · 2025-01-09T17:06:37Z

Aider:

Update Summary for PR #138: OpenAI Whisper API Integration

We have successfully implemented the changes required to support the OpenAI Whisper API as an additional transcription service for the Telegram bot, addressing Issue #137. Here’s a concise overview of the modifications made:

Transcription Service Architecture:
- Created an abstract base class TranscriptionService.
- Implemented two concrete classes: AWSTranscriber for AWS Whisper and OpenAITranscriber for OpenAI Whisper.
- Introduced a TranscriptionServiceFactory to allow easy switching between these services.
Configuration Updates:
- Added support for the OPENAI_API_KEY environment variable.
- Introduced the TRANSCRIPTION_SERVICE variable to allow users to select either 'aws' or 'openai' as their transcription service.
Bot Handlers Update:
- Replaced the old AudioTranscriber references with the factory-based approach.
- Adjusted the service initialization to utilize the new factory.
- Updated processing of voice messages to incorporate the new transcription service.

User Instructions:

To utilize the OpenAI Whisper service, users should:
1. Set the environment variable TRANSCRIPTION_SERVICE=openai.
2. Provide their OpenAI API key in the OPENAI_API_KEY environment variable.

By default, the AWS Whisper service will continue to be used unless otherwise specified.

These changes enhance flexibility, allowing users to choose between AWS and OpenAI Whisper services, ensuring a robust and adaptable transcription solution.

agent bot commit

c5568ad

agent bot commit

e349047

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull Request Title: Integrate OpenAI Whisper API Support and Service Abstraction Layer (Fixes #137) #138

Pull Request Title: Integrate OpenAI Whisper API Support and Service Abstraction Layer (Fixes #137) #138

agentmarketbot commented Jan 9, 2025

agentmarketbot commented Jan 9, 2025

**Pull Request Title:** Integrate OpenAI Whisper API Support and Service Abstraction Layer (Fixes #137) #138

Are you sure you want to change the base?

**Pull Request Title:** Integrate OpenAI Whisper API Support and Service Abstraction Layer (Fixes #137) #138

Conversation

agentmarketbot commented Jan 9, 2025

Pull Request Description

Overview

Key Changes

Acceptance Criteria

Additional Context

agentmarketbot commented Jan 9, 2025

Aider:

Pull Request Title: Integrate OpenAI Whisper API Support and Service Abstraction Layer (Fixes #137) #138

Pull Request Title: Integrate OpenAI Whisper API Support and Service Abstraction Layer (Fixes #137) #138