GitHub - stressosaurus/processed-data-twitter-hashtag94: Twitter data scraped and preprocessed within the time frame from January 2013 to December 2020.

Twitter Hashtag 94 Data

Alex John Quijano

Description. This dataset is a collection of English tweets scraped using Twint (https://github.com/twintproject/twint) and was processed, cleaned, and organized. The processed tweets was then put individually into the Linguistic Inquiry and Word Count or LIWC (http://liwc.wpengine.com/) software. There are 94 hashtags used as queries to scrape tweets containing those hashtags.

Getting Started.

git clone https://github.com/stressosaurus/twitter-hashtag-94-data.git
pip install --user -r requirements.txt

Jupyter Notebook.

jupyter notebook INSTRUCTIONS.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
INSTRUCTIONS.ipynb		INSTRUCTIONS.ipynb
LICENSE.md		LICENSE.md
README.md		README.md
index.html		index.html
requirements.txt		requirements.txt
tweet-processing-pipeline-8.png		tweet-processing-pipeline-8.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter Hashtag 94 Data

Alex John Quijano

About

Releases

Packages

Languages

License

stressosaurus/processed-data-twitter-hashtag94

Folders and files

Latest commit

History

Repository files navigation

Twitter Hashtag 94 Data

Alex John Quijano

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages