Skip to content

A SIMPLE BBC NEWS SRACPPING TOOL USING STREAMLIT AND SELENIUM IN PYTHON

Notifications You must be signed in to change notification settings

K-GOKULAPPADURAI/NewsScrapperPy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

NewsScrapperPy

NewsScrapperPy is a Streamlit web application that allows users to scrape and summarize news articles from the BBC website. The application utilizes Selenium for web scraping and Hugging Face's Transformers library for text summarization.

Features

  • Choose from multiple news categories including Sports, Business, Travel, Innovation, and Culture.
  • Scrape a random news article from the selected category.
  • Summarize the content of the news article.
  • Display the title of the article and its summarized content.

Getting Started

  1. Clone the repository:

    git clone https://github.com/your_username/NewsScrapperPy.git
    
  2. Install the required dependencies:

    pip install -r requirements.txt
  3. Run the Streamlit application:

    streamlit run news_scraper.py

    or

    Run the EXE made with Pyinstaller:

    news_scaraper.exe
    
  4. Access the application in your web browser at http://localhost:8501.

Usage

  • Select a news category from the dropdown menu.
  • Click the "Get Summary" button to scrape and summarize a random news article from the selected category.
  • The application will display the title of the article and its summarized content.

Technologies Used

  • Python
  • Streamlit
  • Selenium
  • Hugging Face Transformers
  • pyinstaller

Credits

This project uses the following open-source libraries:

License

This project is licensed under the MIT License - see the LICENSE file for details.

Releases

No releases published

Packages

No packages published

Languages