Here we find code for processing text to use in our models. The only file of note here is the HTML Parser.
The HTML parser is a class used to pull text from NGO websites. Given a URL, or list of URLs, it pulls text from the website(s) associated with URLs using the BeautifulSoup library.