This repository contains the code for a Video and Audio QA (Question Answering) System built with Streamlit. The system is designed to provide an interactive interface for users to ask questions about videos and receive answers.
The project is organized into the following files:
video_qa_system.ipynb
: A Jupyter notebook that provides a comprehensive overview of the AI pipeline, including video processing, transcription with Whisper, and Q&A with Gemini.video_qa_app.py
: The main application file, built with Streamlit. This file contains the web app that users interact with.requirements.txt
: Lists the Python package dependencies required for the project. To install these dependencies, runpip install -r requirements.txt
.packages.txt
: Lists system-level dependencies required for deployment on Streamlit Cloud.
To get started with the project, follow these steps:
- Clone the repository:
git clone https://github.com/AtaUllahB/Python-Video-Audio-QA-System.git
- Navigate to the project directory:
cd Python-Video-Audio-QA-System
- Install the Python dependencies:
pip install -r requirements.txt
- Create a
.streamlit
directory in the project root if it doesn't already exist. - Inside the
.streamlit
directory, create asecrets.toml
file. - Obtain your Gemini API key by following the instructions provided by Gemini. If you haven't already, you'll need to sign up for an account and generate an API key.
- Add your Gemini API key to the
secrets.toml
file in the following format: GEMINI_API_KEY = "add_key_here" - Navigate back to the project root directory
cd..
- Run the Streamlit app:
streamlit run video_qa_app.py
The application is designed to be deployed on Streamlit Cloud.
Streamlit for providing an easy-to-use framework for building web applications.