I consider RSS a foundation of free (as in freedom) internet. I am in pain, because many websites either do not support RSS at all, or do it very poorly, eg. without proper timestamps, icons, or uuids.
This repository is a bunch of PHP files, that generate RSS using PHP's DOMDocument, CSS accessors and SimpleXMLElement. The basic idea is to load the page, parse HTML, access proper elements, convert them to the RSS specs, and put into a RSS-compliant XML.
I am not interested in creating a generic engine, because every site has its quirks, and those PHP files are simple as you know what. Why PHP? Because 99.999% of public hosting services support PHP, so using this magic boils down to copying the file on your server, and adding the URL into your favorite reader.
My favorite reader is NetNewsWire.
Project uses PHP Composer to setup dependencies.
brew install --formula php
brew install --formula composer
composer install
https://github.com/ivopetkov/html5-dom-document-php - proper DOM Document access for HTML5 websites https://github.com/gajus/dindent - HTML formatter
https://github.com/guzzle/guzzle/ - HTTP client (not yet used)
- creating base classes for separating tasks and streamlining processes
- using Atom instead of RSS 1.0