Final project -- Object recognition
The aim of this project is to investigate a joint representation for image and text in order to automatize the text description of an image. In particular, we will study the Deep Sementic Matching (Deep SM) and the canonical correlation analysis (CCA) in order to create a multimodal retrieval : Image-to-Image search, Tag-to-Image search, and Image-to-Tag search. We will experiment those methods on the Pascal Sentence Dataset http://vision.cs.uiuc.edu/pascal-sentences/.
conda env create -f environment.yml
git clone https://github.com/rupy/PascalSentenceDataset.git
python pascal_sentence_dataset.py
python traintestsplit.py
python cnn_features.py
python text_LDA.py
python text_features.py
python text2images.py