Using Fastai library to classify Twitter jokes in Spanish
Code assocaited with 3rd place finish in F1 score.
- Install conda with
conda env create -f environment.yml
- place twitter data in
./data/all_file.txt
source activate fastaiv1_dev
jupter notebook
in the home directory, go to theLM Train in Notebook
and run- Put
haha_2019_train.csv
andhaha_2019_test.csv
in./data/
directory - Run
Finetune LM
notebook $cd ./prod/' run
$./mult_seed_run_fwd_finetune.sh | tee --append out_fwd_1.txt`- run run
$./mult_seed_regr_finetune.sh | tee --append out_reg_1.txt
Generate Submission entry: - Run the
Ensemble 20 Seeds select best F1 0610.ipynb
Notebook for the classification - Run the
Ensemble 20 modesl select best MSE 0610.ipynb
Notebook for the regression outputs on the test set
Note:
- Data is installed in the same directory in
./data/
directory (but not checked into this repo.)