Parallelization of KDTree construction #92

dokempf · 2021-11-30T14:11:47Z

Performance benchmarking shows that as is, the KDTree construction is the bottleneck of the entire algorithm. A naive approach to parallelization could be to distribute the subtree construction to threads. The very coarse levels do not parallelize well in this algorithm, but a sufficiently deep tree mitigates this disadvantage.

Implementation includes two non-trivial aspects:

Review nanoflann's internal data structures w.r.t. thread safety as we are operating outside of nanoflann's communicated thread safetly guarantee. However, we are only using the static version of nanoflann (the dynamic would definitely not be thread-safe).
Add a task-based OpenMP parallelization to the recursive construction algorithm.

The text was updated successfully, but these errors were encountered:

dokempf · 2023-06-22T07:47:22Z

Nanoflann v1.5.0 added parallel construction support: https://github.com/jlblancoc/nanoflann/releases/tag/v1.5.0

However, my initial testing was far off the advocated speedup of 3. I got around 20% for medium core counts and the code got slower for large core counts. I am currently hesitant to include that in the code base, at least not without a user interface to control.

dokempf added enhancement New feature or request performance Performance Optimization related improvement or experiment labels Nov 30, 2021

dokempf mentioned this issue Jun 22, 2023

nanoflann v1.5.0 was released #256

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelization of KDTree construction #92

Parallelization of KDTree construction #92

dokempf commented Nov 30, 2021

dokempf commented Jun 22, 2023

Parallelization of KDTree construction #92

Parallelization of KDTree construction #92

Comments

dokempf commented Nov 30, 2021

dokempf commented Jun 22, 2023