Skip to content

Using Fastai library to classify Twitter jokes in Spanish

Notifications You must be signed in to change notification settings

bfarzin/haha_2019

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

98 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

haha_2019

Using Fastai library to classify Twitter jokes in Spanish

Requires install of Fast.ai library for the Language Model and classification tasks

  • Data is installed in the same directory in ./data/ directory (but not checked into this repo.)
  • Start with the NBSVM as a baseline for classification. It has been run over several random splits and there is a plot showing the predicted values depending on the split. Mean accuracy is about 84% (but you could get as good as nearly 85% if you are lucky!)

Deep Learning Models

  • Using Sentencepiece for sub-word units and better vocab coverage.
  • SP ouputs .model and .vocab files in the current directory (I added to .gitignore)

About

Using Fastai library to classify Twitter jokes in Spanish

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published