Collaborative Filtering

Introduction

Here, I explore Collaborative Filtering, a technique used in recommender systems.

I focus on 2 types of collaborative filtering: user-based and item-based. I've created a memory-based implementation for both of them.

Data

The MovieLens 100K dataset is used for building the recommender systems.

Copy the dataset, and unzip it into a folder.

Implementation

The implementation of the collaborative filtering algorithms is done using Pandas.

For the item-based collaborative filtering algorithm, I based my implementation on the excellent Udemy course Taming Big Data with Apache Spark and Python - Hands On! by Frank Kane. The motivation to implement with Pandas the algorithm is to compare implementations with a library for distributed computing like Spark. For the user-based approached, I did not follow a specific recipe.

Jupyter notebooks (here and here) explain the methodology and the followed steps. I also develop a strategy to measure the quality of the recommendations here and here.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Collaborative Filtering

Introduction

Data

Implementation

About

Releases

Packages

Languages

nickvandewiele/collaborative-filtering

Folders and files

Latest commit

History

Repository files navigation

Collaborative Filtering

Introduction

Data

Implementation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages