Web Content Downloader

A simple and quick script to view all of the links on a given webpage, and then select which of those links you'd like to extract the contents of.

A good example would be a candidate's "press releases" page, which includes links to press releases. This would let you extract the links to each press release, and download their contents into a single txt file for archiving and analysis.

Features

Lists all links found on an input webpage
Supports selecting multiple pages using ranges (e.g., "1-5") and individual numbers (e.g., "1,3,7")
Handles dynamically loaded JavaScript content
Saves all downloaded content to a single file with clear separators
Includes polite delays between requests

Requirements

Python 3.6+
Google Chrome browser
ChromeDriver (matching your Chrome version)

Installation

Create and activate a virtual environment (recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install required packages:

pip install requests beautifulsoup4 selenium

Usage

Run the script:
```
python scraper.py
```
Enter the URL when prompted
Select which links to download using:
- Ranges: "1-6"
- Individual numbers: "8,12"
- Combinations: "1-6,8,12,15-20"
The script will create downloaded_content.txt with the results

Notes

Respects server load with 2-second delays between requests
Handles JavaScript-rendered content
Works with relative and absolute URLs

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
readme.md		readme.md
requirements.txt		requirements.txt
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Content Downloader

Features

Requirements

Installation

Usage

Notes

About

Releases

Packages

Languages

pdennis/simplescraper

Folders and files

Latest commit

History

Repository files navigation

Web Content Downloader

Features

Requirements

Installation

Usage

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages