Description. This dataset is a collection of English tweets scraped using Twint (https://github.com/twintproject/twint) and was processed, cleaned, and organized. The processed tweets was then put individually into the Linguistic Inquiry and Word Count or LIWC (http://liwc.wpengine.com/) software. There are 94 hashtags used as queries to scrape tweets containing those hashtags.
Getting Started.
git clone https://github.com/stressosaurus/twitter-hashtag-94-data.git
pip install --user -r requirements.txt
Jupyter Notebook.
jupyter notebook INSTRUCTIONS.ipynb