movie_recommender_hybrid

This project produces a two recommender systems using both recommendation approaches: Collaborative Filtering & Content Based Approach.

Dataset: IMDB and Movielens: 28 million ratings, 280,000 reviewers, 54,000 movies, 45,000 movies tagged.

Recommend.ipynb: Base models (item based and model based apprach in CF recomentations) - Pearson Cor and SVD.
Preprocessing: Merging 3 datasets: ratings, movies, tags through movie ID, and it also cleans ratings per person, taking out outliers in ratings per person.
Shortening: Ratings are weighted through percentiles based on total ratings per movie. This was done to scale movies with low number of reviews. Reviews went down from 28m to 14m.
Train Embedding: Trains 32 embedding weights using Neural Networks. The weights are updated at each iteration such that the dot product of the two inputs (raterID and MovieID) approach the output (actual - mean rating).
Visualize Embedding: Similarity metric used on the embeddings to predict movie recommendations.
content_genre_tags: Content Based recommender using genres, tags, and review datasets. Genres and tags are a list of words for each movie. Text preprocessing, bigram, and 1% top uncommon words are filtered out. TFIDF was used to create a BOW for the movies. Identity Matrix and Cosine Similarity were used to find recommendations, and results are sorted by similarity and weighted rating.

There is also a very nice Flask App with a lot of features and attractive design.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
Flask app		Flask app
data		data
model		model
processed_data		processed_data
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
README.txt		README.txt
movie_recommender 9.45.15 AM.pptx		movie_recommender 9.45.15 AM.pptx

Provide feedback