Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added stuff to readme #12

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
12 changes: 9 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
# Energy News Roundup Scraping

Chi Hack Night scraping project for energy news information
We scraped daily email digests from energynews.us ranging back all the way to 2013

## Setup
First install the base package requirements
Expand All @@ -14,8 +15,13 @@ Finally, download the required core english web for `spacy`

## Running the script
To run the script run the following in the project directory:
```bash
`
python3 scrape.py
```
`
## Output
The library of digests will be converted into a [csv file](digestItems.csv)
The library of digests will be converted into a [csv file](digestItems.csv)

## Topic Modeling
We also ran all the blurbs through a topic modeling program.
This helps determine how much certain words are used with each other.
Check it out in the [jupyter notebook](topic_modeling.ipynb)