This project for UC Berkeley's undergraduate Data Science Honor's Program seeks to find characteristics of U.S. politicians (ex. age, gender, political party, regional median income, etc.) which most lead to the respective politicians' toxicity levels on Twitter. This repository is modified from the following: https://github.com/ls88-openscienceconnector/final-project.
This repo contains a few subfolders which contain the elements of this project.
Folder | Description |
---|---|
data |
data from the original analysis in CSV, TSV, and pip-separated format |
images |
image files used in the Jupyter Notebooks |
notebooks |
Jupyter Notebook files, including reproduction analysis using both pandas and datascience , as well as the Pre-Analysis Plan and a "blank" notebook |
output |
output data (same as original, for demo purposes) in CSV and TSV format |