Applied Deep Learning Spring 2020 Final Project -- Singing Transcription

Report

Demo Video

Ground truth file structure

{
  "song_id": [[start_time, end_time, pitch], ...]
  ...
}

For example:

{
  "1": [[0.0, 0.5, 60], ...],
  ...
  "123": [[0.0, 0.5, 58], ...]
}

Scripts usage

Convert music file to midi

This script is used to run efficientnet (TODO: run RNN with pretrained model).

It does svs first (using spleeter), and then run efficientnet. Finally, it generates a midi file.

python do_everything.py $input_wav_file $output_path

input_wav_file: Path to the input file.
output_path: Output midi path.

The default model is efficientnet-b3 and the model path is "sing_voice_transcription/efficientnet/b4_e_6600"

If you want to specify the model path, then you can add another argument:

python do_everything.py $input_wav_file $output_path -mp $model_path

model_path: Path to the model file.

Evaluation

This script is used to calculate the evaluation measures.

python evaluate.py $gt_file $predicted_file

gt_file: Ground truth file with .json format.
predicted_file: Predicted file with .json format.

Generate dataset instance to a file:

This script will read from a data directory and generate custom dataset class instance into a binary file.

python generate_dataset.py $data_dir $output_dir --for-rnn

data_dir: Directory of the songs.
output_dir: Path to the directory to save the dataset.pkl.
--for-rnn: Flag for generating dataset for RNN training.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
sing_voice_transcription		sing_voice_transcription
.gitignore		.gitignore
README.md		README.md
do_everything.py		do_everything.py
evaluate.py		evaluate.py
generate_dataset.py		generate_dataset.py
json2mid.py		json2mid.py
ntu_adl_final_report.pdf		ntu_adl_final_report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Applied Deep Learning Spring 2020 Final Project -- Singing Transcription

Report

Demo Video

Ground truth file structure

Scripts usage

Convert music file to midi

Evaluation

Generate dataset instance to a file:

About

Releases

Packages

Contributors 2

Languages

SongRongLee/adl-singing-transcription

Folders and files

Latest commit

History

Repository files navigation

Applied Deep Learning Spring 2020 Final Project -- Singing Transcription

Report

Demo Video

Ground truth file structure

Scripts usage

Convert music file to midi

Evaluation

Generate dataset instance to a file:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages