Memory leak during inference #11

andreped · 2022-01-07T06:32:08Z

Memory does not seem to be freed after inference, at least not properly. This was observed using an ONNX model using both TensorRT and OpenVINO (CPU) inference engines. This also affects if you are running a model in batch mode (across multiple WSIs). The memory keeps increasing for every new WSI and eventually OOM occurs.

For TensorFlow, this has been a popular topic for quite some time. This happens as the session, where all inference and model graphs and created and live, is set globally for the entire process. A workaround in Python is therefore to perform all TensorFlow stuff in a child process (using multiprocessing), and then kill the child process after inference, which keeps the main process clean.

However, creating processed in C++ for this purpose is not viable. It is also surprising that the same (or at least a similar) memory leakage issue was observed using onnx runtime. Maybe there is something that is not freed in FAST? Not sure.

andreped added the bug Something isn't working label Jan 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory leak during inference #11

Memory leak during inference #11

andreped commented Jan 7, 2022 •

edited

Loading

Memory leak during inference #11

Memory leak during inference #11

Comments

andreped commented Jan 7, 2022 • edited Loading

andreped commented Jan 7, 2022 •

edited

Loading