Implementation of State-of-the-arts in Audio Event Classification, Sound Event Detection (SED) and even more...in Pytorch
- Build a conv1d model using directly raw waves from audio files
- Using log-mel spectrogram as features
- CNN-2D as core model, based on image classification task
The benchmark audio dataset was obtained from Kaggle
TODO