Music Recommendation Using Deep Learning

Music Recommendation using latent feature vectors obtained from a network trained on the Free Music Archive dataset.

Overview

The basic idea of this project is to recommend music using computer vision through a convolutional neural network. The network is first trained as a classifier with the labels being the 8 different genres of songs from the dataset. The trained network is then modified by discarding the softmax layer i.e. creating a new model which works as an encoder. This encoder takes as input slices of a spectrogram one at a time and outputs a 32 dimensional latent representation of that respective slice. This generates multiple latent vectors for one spectrogram depending on how many slices were generated. These multiple vectors are then averaged to get one latent representation for each spectrogram. The Cosine similarity metric is used to generate a similarity score between one anchor song and the rest of the songs in the test set. The two songs with the highest similarity score with respect to the anchor song are then outputted as the recommendations.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

For using this project, you need to install Keras, Scikit-learn, PIL, Librosa, OpenCV, and Pandas

pip install keras
pip install scikit-learn
pip install pillow
pip install librosa
pip install cv2
pip install pandas

Dataset

The fma_small dataset consists of 8000 mp3 files from the Free Music Archive.

Each file in fma_small is a 30 second clip of music. The dataset is balanced and has 8 genres ( Hip-Hop, International, Electronic, Folk, Experimental, Rock, Pop, and Instrumental).

The dataset is stored in the folder Dataset as fma_small (File too large to upload onto Github).

For testing the recommendation system, I've used 30 songs from my itunes library. I've manually converted the songs into 30 second clips and then I ran my code in test mode.

30 Seconds To Mars - Night of the hunter (Acoustic)
Afrojack - The spark
Alesso - Heros
Awolnation - Sail
Boyce Avenue - Wonderwall
Bruno Mars - Just the way you are
Bruno Mars - Locked out of heaven
Calvin Harris - Summer
Calvin Harris - Sweet Nothing
Coldplay - Magic
Coldplay - Paradise
Coldplay - Viva La Vida
Coldplay - The Scientist
Daft Punk - Instant crush
Daft Punk - Lose yourself to dance
Don Omar - Danza Kuduro
Enrique Iglesias - Bailando
Imagine Dragons - Demons
Imagine Dragons - It's Time
Jennifer Lopez - On the floor 
John Mayer - Say
Kanye West - Stronger
Katy Perry - Dark Horse
Katy Perry - Fireworks
Khalid - Location
Lana Del Rey - Young and Beautiful
Maroon5 - Moves Like Jagger
Passenger - Let Her Go
Wiz Khalifa - Black and Yellow
Wiz Khalifa - Young, Wild and Free

Training

Run the script train.py in the terminal as follows.

Python train.py

Data Preprocessing

The train.py script runs import_data.py, slice_spectrogram.py, and load_data.py in the back.

import_data.py

• Train Mode - In training mode, the script converts the files from fma_small into mel-spectrograms and stores them into a folder called Train_Spectrogram_Images.

• Test Mode - In testing mode, the script converts the songs from DLMusicTest_30 into mel-spectrograms and stores them into a folder called Test_Spectrogram_Images.

Example of a Pop Song Spectrogram

slice_spectrogram.py

• Train Mode - In training mode, the script slices the spectrograms from the Train_Spectrogram_Images folder into 128x128 slices and stores them into the Train_Sliced_Images folder.

• Test Mode - In testing mode, the script slices the spectrograms from the Test_Spectrogram_Images folder into 128x128 slices and stores them into the Test_Sliced_Images folder.

Example of a Spectrogram Slice from Kanye West's Stronger

load_data.py

• Train Mode - In training mode, the script imports images from Train_Sliced_Images, converts them into grayscale, and then exports them as numpy matrices for training and testing. This is saved as train_x.npy, train_y.npy, test_x.npy, and test_y.npy in the Training_Data folder.

• Test Mode - In testing mode, the script imports images from Test_Sliced_Images, converts them into grayscale, and returns them as images and labels.

Neural Network Architecture

Convolutional Neural Network that is used for this recommendation system.

Model and History

The trained network is then saved as Model.h5 and it's history is saved as training_history.csv in the Saved_Model folder.

Training Performance

Final Training Accuracy = 77.85%
Final Validation Accuracy = 66.11 %

Prediction On Test Set

(This test set is a small part of fma_small dataset that hasn't been trained on)

Accuracy Graph

Loss Graph

Confusion Matrix

Recommendation

Testing

Run the script recommendation.py in the terminal as follows.

Python recommendation.py

This will give you a list of songs.

['Bailando' 'BlackandYellow' 'DanzaKuduro' 'DarkHorse' 'Demons'
'Fireworks' 'Heros' 'InstantCrush' 'ItsTime' 'JustTheWayYouAre'
'LetHerGo' 'Location' 'LockedOutOfHeaven' 'LoseYourselfToDance' 'Magic'
'MovesLikeJagger' 'NightOfTheHunter' 'OnTheFloor' 'Paradise' 'Sail' 'Say'
'Spark' 'Stronger' 'Summer' 'SweetNothing' 'VivaLaVida' 'Wonderwall'
'YoungAndBeautiful' 'YoungWildAndFree']

Enter an anchor song for which you want similar recommendations (Choose one from the above list).

Enter a Song Name:
TheScientist

Results

The code generates two recommendations for the song The Scientist by Coldplay, 1) Let Her Go by Passenger and 2) Say by John Mayer.

More Results

Built With

Keras - Deep Learning Framework
Google Colab - Cloud Service

Authors

Vikram Shenoy - Initial work - Vikram Shenoy

Acknowledgments

Project is inspired by Sander Dieleman's blog, Recommending music on Spotify with Deep Learning.
Free Music Archive Dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Music Recommendation Using Deep Learning

Overview

Getting Started

Prerequisites

Dataset

Training

Data Preprocessing

import_data.py

Example of a Pop Song Spectrogram

slice_spectrogram.py

Example of a Spectrogram Slice from Kanye West's Stronger

load_data.py

Neural Network Architecture

Model and History

Training Performance

Prediction On Test Set

Accuracy Graph

Loss Graph

Confusion Matrix

Recommendation

Testing

Results

More Results

Built With

Authors

Acknowledgments

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Dataset		Dataset
Graphs		Graphs
Images		Images
Saved_Model		Saved_Model
.DS_Store		.DS_Store
README.md		README.md
draw_graph.py		draw_graph.py
import_data.py		import_data.py
import_data.pyc		import_data.pyc
load_data.py		load_data.py
load_data.pyc		load_data.pyc
recommendation.py		recommendation.py
slice_spectrogram.py		slice_spectrogram.py
slice_spectrogram.pyc		slice_spectrogram.pyc
train.py		train.py

VikramShenoy97/Music-Recommendation-Using-Deep-Learning

Folders and files

Latest commit

History

Repository files navigation

Music Recommendation Using Deep Learning

Overview

Getting Started

Prerequisites

Dataset

Training

Data Preprocessing

import_data.py

Example of a Pop Song Spectrogram

slice_spectrogram.py

Example of a Spectrogram Slice from Kanye West's Stronger

load_data.py

Neural Network Architecture

Model and History

Training Performance

Prediction On Test Set

Accuracy Graph

Loss Graph

Confusion Matrix

Recommendation

Testing

Results

More Results

Built With

Authors

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages