This is the repository for The Natural Language Processing Workshop, published by Packt. It contains all the supporting project files necessary to work through the course from start to finish.
To get started with the project files, you'll need to:
The Natural Language Processing Workshop takes you through fundamental NLP techniques, such as preparing datasets, collecting text, extracting text, and sentiment analysis. As you progress, you’ll get to grips with creating your own chatbots and dynamic models.
- Obtain, verify, clean and transform text data into a correct format for use
- Use methods such as tokenization and stemming for text extraction
- Develop a classifier to classify comments in Wikipedia articles
- Collect data from open websites with the help of web scraping
- Train a model to detect topics in a set of documents using topic modeling
- Discover techniques to represent text as word and document vectors
If you've found this repository useful, you might want to check out some of our other workshop titles: