-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Switch to osm2pgsql expiry and osm2pgsql-replication #987
Comments
I think we also need to include |
That's correct. All arguments that are only for the pgsql output still need to be given manually to osm2pgsql-replication. |
This has now been deployed on balerion and comparison with bowser shows that it has increased the amount of rendering which is expected now we're dirtying more tiles. |
balerion's CPU usage is about 2x that of bowser, with more high peaks when some relations get touched. Not a huge impact on disk utilization, and IO pressure is approximately double, it's also <2% in both cases so the disks are nowhere maxed out. I think all the servers have sufficient capacity to take the increased load. |
This has now been rolled out across all eight servers. |
One of the common rendering complaints is due to not dirtying tiles when relations change. Switching to osm2pgsql expiry will fix this, as well as allow us to use osm2pgsql-replication, significantly simplifying tile replication scripts. I was setting up a client machine and I was astonished how simple it is these days.
osm2pgsql expiry for the pgsql backend runs in
hybrid
mode which expires all tiles in a multipolygon belowfull_area_limit
and only the boundary above. The default value forfull_area_limit
is 20000 but this can be changed with--expire-bbox-size
. This means if someone makes a tag edit to the US boundary all the tiles along the edge will be expired, but not the whole of the US.osm2pgsql comes with osm2pgsql-replication, which stores all state in the database.
The setup command is just
osm2pgsql-replication init -d gis
and it will determine the date from the data in the database. On a new import it would beosm2pgsql-replication init -d gis --osm-file planet-latest.osm.pbf
.We need a script to take the tile list and touch the relevant files to indicate they need re-rendering. Switch2OSM documents this. Adapting their example slightly, the following would be
/usr/local/bin/expire-tiles
The key is
--touch-from
. When expiring tiles above that zoom, it touches them to indicate they are stale.We should consider adjusting these later.
Ignoring error handling and logging, the command that needs to be run is
osm2pgsql-replication update -d gis --post-processing /usr/local/bin/expire-tiles -- --log-progress=false --number-processes=1 --expire-tiles=13-19 --expire-output=/var/lib/renderd/dirty_tiles.txt
. Settings like hstore, multi-geometry, flat nodes, and style are all stored in the DB by osm2pgsql and are not required when running with--append
.This will download up to 500MB (or otherwise if
--max-diff-size
is set), apply it to the DB, store the tiles list, and run post-processing.To run this on a regular schedule, Switch2OSM recommends a cron job, but osm2pgsql recommends a systemd service. systemd services are better for this
For that we'd create
/etc/systemd/system/osm2pgsql-update.service
And a timer in
/etc/systemd/system/osm2pgsql-update.timer
Note for anyone doing this themselves, they'd also have to enable the timer and start it the first time - see osm2pgsql docs.
This will eliminate replicate.erb, expire-tiles.erb, expire.rb, and expire-tiles-single
Possible issues and notes
--expire-bbox-size
--touch-from=13 --min-zoom=12
, but let's get the basics going firstThe text was updated successfully, but these errors were encountered: