Skip to content

harvard-edge/Wake_Vision_Challenge_Data_Centric_Track

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

🚀 Data-Centric Track

Welcome to the Data-Centric Track of the Wake Vision Challenge! 🎉

The goal of this track is to push the boundaries of tiny computer vision by enhancing the data quality of the Wake Vision Dataset.

🔗 Learn More: Wake Vision Challenge Details


🌟 Challenge Overview

Participants are invited to:

  1. Enhance the provided dataset to improve person detection accuracy.
  2. Train the MCUNet-VWW2 model, a state-of-the-art person detection model, on the enhanced dataset.
  3. Assess quality improvements on the public test set.

You can modify the dataset however you like, but the model architecture must remain unchanged. 🛠️


🛠️ Getting Started

Step 1: Install Docker Engine 🐋

First, install Docker on your machine:


💻 Running Without a GPU

Run the following command inside the directory where you cloned this repository:

sudo docker run -it --rm -v $PWD:/tmp -w /tmp andregara/wake_vision_challenge:cpu python data_centric_track.py
  • This trains the MCUNet-VWW2 model on the original dataset.
  • Modify the dataset to improve the model's test accuracy by correcting labels or augmenting data.

💡 Note: The first execution may take several hours as it downloads the full dataset (~365 GB).


Running With a GPU

  1. Install the NVIDIA Container Toolkit.
  2. Verify your GPU drivers.

Run the following command inside the directory where you cloned this repository:

sudo docker run --gpus all -it --rm -v $PWD:/tmp -w /tmp andregara/wake_vision_challenge:gpu python data_centric_track.py
  • This trains the MCUNet-VWW2 model on the original dataset.
  • Modify the dataset to enhance test accuracy while keeping the model architecture unchanged.

💡 Note: The first execution may take several hours as it downloads the full dataset (~365 GB).


🎯 Tips for Success

  • Focus on Data Quality: Explore label correction, data augmentation, and other preprocessing techniques.
  • Stay Efficient: The dataset is large, so plan your modifications carefully.
  • Collaborate: Join the community discussion on Discord to share ideas and tips!

📚 Resources


📞 Contact Us

Have questions or need help? Reach out on Discord.


🌟 Happy Innovating and Good Luck! 🌟

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages