-
Notifications
You must be signed in to change notification settings - Fork 166
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Memory leakage after mapnik map.render
?
#1004
Comments
@JaylanChen - thanks for testing! To try to narrow down the issue, could you try replacing im.encode("png", function (err, buffer) {
fs.writeFileSync("map.png", buffer);
}); with something like im.save("map.png", function (err) {
if (err) throw err;
}); and let me know if memory leak persists ? |
test code map.load(xmlPath, function (err, map) {
console.log(err);
map.zoomAll();
const im = new mapnik.Image(256, 256);
map.render(im, function (err, im) {
if (err) throw err;
const imagePath = path.resolve("test/map.png");
im.save(imagePath, function (err) {
if (err) throw err;
});
});
}); test again memory leak persists. |
test gdal with natural_earth.tif <Map srs="+proj=longlat +ellps=GRS80 +no_defs +type=crs">
<Style name="raster">
<Rule>
<RasterSymbolizer/>
</Rule>
</Style>
<Layer name="layer" srs="+proj=longlat +ellps=GRS80 +no_defs">
<StyleName>raster</StyleName>
<Datasource>
<Parameter name="type">gdal</Parameter>
<Parameter name="file">data/natural_earth.tif</Parameter>
</Datasource>
</Layer>
</Map> http://localhost:3000/gengdal memory leak persists. also test shp memory is not leaked or not obvious. test code is updated (with tif and shp). |
@JaylanChen - Thanks for great self-contained test cases 👍. I had a chance to run your app. I'm not, yet, convinced there is a memory leak, though. I'm seeing high but stable memory usage with So far I only tested on macOS. I'll try running on Linux as well. But I noticed your code can (should!) be improved. You're creating new |
Thank you for your advice. Actually I used Pool to cache an instance of mapnik. this code only for test. |
I also tested it in the Linux ARM64 environment. The initial memory is about 67 mb. (Wait 2 minutes after startup) curl http://localhost:3000/gengdal curl http://localhost:3000/gengdal again. |
@JaylanChen - I did some testing on both macOS and Linux (Ubuntu 24.04) and I'm seeing stable memory usage over time. I did some modification to your app.js and settings, see below. I'm using OSM data and XML file (mapnik.xml) generated as per For load testing I'm using https://www.artillery.io/ e.g DEBUG=http artillery run mapnik-load-test.yml
config:
target: http://localhost:3000
phases:
- duration: 60
arrivalRate: 1
rampTo: 5
maxVusers: 20
name: Warm up
- duration: 30m
arrivalRate: 5
maxVusers: 50
name: Ramp up load
processor: "./location_generator.js"
scenarios:
- flow:
- get:
url: "/generate_map?easting={{ easting }}&northing={{ northing }}"
beforeRequest: getRandomLocation
const fs = require('fs');
const mapnik = require("@mapnik/mapnik");
const json = JSON.parse(fs.readFileSync('./data/Trees.geojson', 'utf8'));
const num_features = Object.keys(json.features).length;
const tr = new mapnik.ProjTransform(new mapnik.Projection("epsg:4326"),
new mapnik.Projection("epsg:3857"));
const generateRandomKey = (length) => Math.floor(Math.random() * length);
const getRandomLocation= (requestParams, context, ee, next) => {
var key = generateRandomKey(num_features);
const lon = json.features[key].geometry.coordinates[0];
const lat = json.features[key].geometry.coordinates[1];
const coord = tr.forward([lon, lat]);
context.vars.easting = Math.floor(coord[0]);
context.vars.northing = Math.floor(coord[1]);
next();
};
module.exports = {
getRandomLocation,
};
module.exports = {
apps: [
{
name: 'mapnik-test',
script: './src/app.js',
instances : "4",
exec_mode : "cluster"
},
],
};
const express = require("express");
const mapnik = require("@mapnik/mapnik");
const path = require("path");
const fs = require("fs");
const genericPool = require("generic-pool");
const { pid } = require('node:process');
mapnik.register_default_fonts();
mapnik.register_default_input_plugins();
const app = express();
const port = 3000;
const map_factory = {
create: function() {
const map = new mapnik.Map(4*256, 4*256);
const xmlPath = path.resolve("../openstreetmap-carto/mapnik.xml");
//const xmlPath = path.resolve("test/gdal_map.xml");
map.loadSync(xmlPath);
console.log(`--> Load Map pid:${pid} xml:${xmlPath}`);
return map;
},
destroy: function(map) {
delete map;
console.log(`<-- Destroy Map ${pid}`);
}
};
const opts = {
max: 6,
min: 2
};
const pool = genericPool.createPool(map_factory, opts);
app.get("/", (req, res) => {
res.send("Hello World, express & mapnik!");
});
app.get("/generate_map", (req, res) => {
const mapPromise = pool.acquire();
mapPromise
.then(function(map) {
var easting = +req.query.easting;
var northing = +req.query.northing;
var bbox = [easting - 1000, northing - 1000, easting + 1000, northing + 1000];
map.zoomToBox(bbox);
const im = new mapnik.Image(4*256,4*256);
map.render(im, function(err, im) {
if (err) throw err;
im.encode('png256', function (err, buffer) {
if (err) throw err;
res.type('png');
res.send(buffer);
console.log(`==> pid:${pid} req(x:${easting} y:${northing}) res(size:${buffer.length})`);
pool.release(map);
});
});
})
.catch(function(err) {
res.send(`FAIL:${err}`);
});
});
app.listen(port, () => {
console.log(`Example app listening on port ${port}`);
});
process.on('SIGINT', function() {
pool.drain().then(function() {
pool.clear();
});
}); |
Generally, the small amount of data (the amount of data covered by a single tile picture) will not continuously increase in memory; when the amount of data is relatively large (the threshold is not clear), it will not be released. I use the thumbnail of the entire map to reproduce the scene where a single tile picture contains a large amount of data. I downloaded the osm us-pacific data and imported it into the pg db using osm2pgsql. osm2pgsql -d osm-us-pacific -U postgres -P 5332 -H 192.168.1.121 -U postgres -W -C 25000 us-pacific-latest.osm.pbf I updated the test code adding logic to the pool of the map instance.(This is similar to your test code, and the same as real use). After the service starts, the memory takes up about 95mb. http://localhost:3000/genpg If there is no concurrency (instances remain minimum 2), the memory stabilizes but is not released. If concurrency occurs, such as 4 requests, two more instances are created and the memory is increased again. (1.7gb) An instance takes up so much memory, is there something wrong? |
@JaylanChen - I'm wondering if "forking" processes might be related to this issue. Could you try "cluster" mode to see if memory usage pattern changes? e.g module.exports = {
apps: [
{
name: 'mapnik-test',
script: './src/app.js',
instances : "4",
exec_mode : "cluster"
},
],
}; |
full table datathe first request, the node with id 1 is responsible for processing the request: the second request, the node with id 3 is responsible for processing the request: the third request, the node with id 1 is responsible for processing the request: concurrent with 20 requestsfive mapnik instances were created for each node (two were already created when the pool was initialized). the memory of each node is about the same. concurrent with 20 requests again. reuse the map instance, with less memory increase. table data with filter (way_area < 50)
3 separate requests.pm2 restart pm2.config.js concurrent with 20 requestsdue to the small amount of data, the single request time is short, with only 3 nodes, and an additional instance is created. HTP. |
@JaylanChen - thanks for trying ^ I'm going to investigate memory usage/leaks further. |
@JaylanChen - It looks like defining NAPI_EXPERIMENTAL improves memory management of a running node process. I'm going to do more testing and if everything is OK I'll release development package to try. I was using following script (OOM without 'use strict'
const mapnik = require("@mapnik/mapnik");
for (var i = 0; i < 10000000; ++i)
{
var im = new mapnik.Image(256, 256);
if (i % 10000 == 0)
{
const memoryUsage = process.memoryUsage();
console.log('Memory Usage:', memoryUsage);
// if (global.gc) {
// global.gc();
// } else {
// console.log('Garbage collection unavailable. Pass --expose-gc '
// + 'when launching node to enable forced garbage collection.');
// }
}
} ref -> nodejs/node-addon-api#1213 |
When the new development package is released, please let me know that I can help test the validation. |
@JaylanChen => Give a try and let me know ^ |
Same as in the previous version. I have also tested the following code
|
@JaylanChen - my mistake, I built using npm install @mapnik/[email protected] Only linux-x64 support for now ^^ Following script now completes for me (before was running out of memory, including const mapnik = require("@mapnik/mapnik");
for (var i = 0; i < 10000000; ++i)
{
const im = new mapnik.Image(256, 256);
if (i % 1000 == 0)
{
const memoryUsage = process.memoryUsage();
const json = JSON.stringify(process.memoryUsage())
console.log(`\nCount=${i} Memory Usage:${json}`);
}
} |
Adding missing context here -
Not accounting for external memory allocations in c++ land is a design choice. The whole issue is easily mitigated with caching/reusing of |
Sorry for not testing the feedback in time. Is there something wrong with my test code? I have a question, why universal code, but different data filter conditions, different memory usage?
According to the current phenomenon, when rendering the tile image, the corresponding data still occupies the memory. Should the data usage be released? |
@JaylanChen - thanks for helping testing memory usage. I had a chance to look into this again. I loaded The issue (as I understand) is that memory allocations in native c++ code are not accounted for by JS GC. This is by design and the only way to trigger more frequent GC sweeps is to use It seems consensus is that unless high RSS usage causes OOM it can be ignored and JS GC will do the right job managing process(s) memory e.g I also tested sync and async versions of loading Can you trigger OOM or you're just concerned about high RSS usage? |
I have read the above instructions and understood the difference in memory management between node and c + +; Yes, I am more concerned about high RSS usage; because in previous use, memory increases too fast due to individual tile requests (many features); causing insufficient memory errors. Next, I will try to set the maximum node memory and see if I can avoid this problem. Thank you for your help. I ll close this issue first and create a new issue if there are other issues. |
os: ubuntu:22.04
nodejs: v22
@mapnik/mapik: v4.6.5
pm2 v5.4.2
Based on pg database test, data count: 182104; geom type: multipolygon, 353MB
When generating a thumbnail with the following code:
Service memory will increase a lot, and not released, continue to increase.
map.xml
generate the thumbnail code
http://localhost:3000/
MEM: 84MB
http://localhost:3000/generate
MEM: 84MB → 391MB
wait 5 minutes later, MEM: 391MB
http://localhost:3000/generate
MEM: 391MB → 690MB
test code, (no pg table data).
The text was updated successfully, but these errors were encountered: