NewsScrapperPy is a Streamlit web application that allows users to scrape and summarize news articles from the BBC website. The application utilizes Selenium for web scraping and Hugging Face's Transformers library for text summarization.
- Choose from multiple news categories including Sports, Business, Travel, Innovation, and Culture.
- Scrape a random news article from the selected category.
- Summarize the content of the news article.
- Display the title of the article and its summarized content.
-
Clone the repository:
git clone https://github.com/your_username/NewsScrapperPy.git
-
Install the required dependencies:
pip install -r requirements.txt
-
Run the Streamlit application:
streamlit run news_scraper.py
or
Run the EXE made with Pyinstaller:
news_scaraper.exe
-
Access the application in your web browser at
http://localhost:8501
.
- Select a news category from the dropdown menu.
- Click the "Get Summary" button to scrape and summarize a random news article from the selected category.
- The application will display the title of the article and its summarized content.
- Python
- Streamlit
- Selenium
- Hugging Face Transformers
- pyinstaller
This project uses the following open-source libraries:
- Streamlit: https://www.streamlit.io/
- Selenium: https://www.selenium.dev/
- Hugging Face Transformers: https://huggingface.co/transformers/
- BBC Website: https://www.bbc.com/
This project is licensed under the MIT License - see the LICENSE file for details.