Machine learning methods to predict the positivity/sentiment of an IMDb movie review.
- Clone repository (run
git clone https://github.com/RylanSteinkey/imdbSentimentAnalysis.git
) - Change directories into the project folder:
cd imdbSentimentAnalysis
- Download anaconda or miniconda (python 3.7), instructions for that are here
- Install dependecies: run
conda env create -f envi.yaml
- Run
snakemake
- Check results.txt for accuracy and a ranked list of important words
After the above has successfully run, you can execute other models by running models.py as:
python models.py XGB
-- XGBoost (68.2% accuracy with 1000 samples)
python models.py SVM
-- Support Vector Machine (68.4% accuracy with 1000 samples)
python models.py MNB
-- Multinomial Naive Bayes (68.4% accuracy with 1000 samples)
python models.py ANN
-- Artificial Neural Network (78.8% accuracy with 1000 samples)