This is the age of OTT. Streaming is the new way of consumption. We want to build a model to provide our members with personalized suggestions to reduce the amount of time and frustration to find something great content to watch.
I have analyzed info from Netflix, Amazon Prime, Hulu and other platforms. I have recommendations for the newbies in the streaming game.
Using web scraping to get the movie dataset from public websites:
• data-world-movie-dataset dataset
Build the recommendation model using:
-
Content Based Filtering (Item Based) (IMDB Dataset - 1000 movies)- IMDB Movie rating, genres, actors, populatity, etc. Content-based filtering uses item features to recommend other items similar to what the user likes, based on their previous actions or explicit feedback
-
Collaborative Filtering (User Based) (Kaggle Dataset - 9700 movies)
Collaborative filtering filters information by using the interactions and data collected by the system from other users. It's based on the idea that people who agreed in their evaluation of certain items are likely to agree again in the future.
Implemented Content based filtering using Cosine Similarity Matrix. Used Movie features - Director, Actors, Genres.
For a User liking the movie 'The Imitation Game', based on my recommendation engine, here's 10 movies that user might like
- Google spreadsheets - initial Exploratory Data Analysis
- Tableau - interactive dashboards
- Python packages - building the model
- Scikit-learn – build Linear Regression model
- Pandas – manipulating and working on data frames
- NumPy – working with arrays
4 Key Recommendations to focus on:
- Powerful Recommendation Engine
- Platform Originals
- Buy rights for iconic cash machines, basically all time hits
- Content dubbing - Global Reach and ‘Localisation’