Sentiment analysis by BERT in PyTorch

BERT is state-of-the-art natural language processing model from Google. Using its latent space, it can be repurpossed for various NLP tasks, such as sentiment analysis.

This simple wrapper based on Transformers (for managing BERT model) and PyTorch achieves 92% accuracy on guessing positivity / negativity on IMDB reviews.

How to use

Prepare data

First, you need to prepare IMDB data which are publicly available. Format used here is one review per line, with first 12500 lines being positive, followed by 12500 negative lines. Or you can simply download dataset on my Google Drive here. Default folder read by script is data/.

Train weights

Training with default parameters can be performed simply by.

python script.py --train

Optionally, you can change output dir for weights or input dir for dataset.

Evaluate weights

You can find out how great you are (until your grandma gets her hands on BERT as well) simply by running

python script.py --evaluate

Of course, you need to train your data first or get them from my drive.

Predict text

python script.py --predict "It was truly amazing experience."

or

python script.py --predict "It was so terrible and disgusting as coffee topped with ketchup."

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md
dataset.py		dataset.py
model.py		model.py
script.py		script.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment analysis by BERT in PyTorch

How to use

Prepare data

Train weights

Evaluate weights

Predict text

About

Releases

Packages

Languages

vonsovsky/bert-sentiment

Folders and files

Latest commit

History

Repository files navigation

Sentiment analysis by BERT in PyTorch

How to use

Prepare data

Train weights

Evaluate weights

Predict text

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages