Planet import taking too long on placex data loading #3335
Replies: 5 comments 17 replies
-
During this stage of the import seeing all CPU cores at maximum, for several days, is expected. What are the last lines of the import log? Which rank does it process? |
Beta Was this translation helpful? Give feedback.
-
This sounds really slow I would point to the "2 system" scenario or a generic slowness either storage or network (is there traffic ?) What is Does syslog (both machines) or postgres log show something ? Can you test with import and DB on one VM to narrow this down ? |
Beta Was this translation helpful? Give feedback.
-
The stage you are in is really just a copy from one (huuge) table to another. There is an INSERT trigger involved but it shouldn't be doing very complicated things. If it has been hanging for 5 days though, I suspect that something in the trigger is going wrong. This is really tricky to debug. First of all, can you check which triggers are installed on the |
Beta Was this translation helpful? Give feedback.
-
Good morning folks, After a few hiccups (some infra related issues causing my VMs to reboot), everything went well.
For a next trial, I believe it will be a good idea to give more CPUs to the machine used for the loading (and reduce the database side as well). Everything was being done with a parallelism of 8. The database side was definitely underused all the time. Thank you for the help! :) |
Beta Was this translation helpful? Give feedback.
-
Hi Team, Yesterday I have kicked off import process and found myself also into the same situation where I see it taking too long time, almost more than 6hr at "Load data into placex table...." (osm2pgsql took 48419s (13h 26m 59s) overall.)
But in my case, I have got a brand new cloud VM to start installing for the first time. However, I notice, based on entire conversation in this thread, that Im not using machine resources properly. It would be great if someone could review my machine details and what is the best way to fully utilize the resources to get he installation of nominatim done at the earliest. (especially postgres parameter, import command parameters also cleaning up database to start fresh import) Environment: I have installed both postgres database and nominatim on the same cloud vm, where postgres database data directory is pointed to another mounted drive having 5TB space. 370 GB RAM memory Postgres database parameters: Commands executed: /usr/local/bin/nominatim import -j 6 --osm-file /srv/nominatim/build/planet-latest.osm.pbf 2>&1 | tee setup.log -Prem |
Beta Was this translation helpful? Give feedback.
-
Hi folks,
I'm new to Nominatim and I'm currently trying to do a full Planet import. However, the import process is on the placex table loading for 5 days already. I can see I have a few cores completely maxed to while the actual disk writes are on the kB/s order.
At this point I'm out of ideas and given each attempt takes multiple days, randomly trying is very unproductive :(
Let me share with you my setup and where the process is currently at.
Do you have any ideas on how to debug or speed up this process? Any hints would be much appreciated :)
Thank you!
Environment
The whole process is running on GCP VMs: one for the postgres database, another for the Nominatim import process.
Postgres VM:
Postgres details:
Nominatim import VM:
Process
I've used the following Nominatim options:
In terms of extra data I'm including these in the project directory:
And, finally, I'm running this command:
Status
From the log of
nominatim import
I can see the following:The VM running the
nominatim import
is currently completely idle just waiting for postgres.On the postgres side I can see it is running 7 queries that look like:
7 CPUs of this VM are at 100%. In terms of IO the writes are at about 100 kB/s.
Currently the data disk has 412 GB occupied.
Beta Was this translation helpful? Give feedback.
All reactions