perf: use fast `uproot.num_entries` method instead of `TTree.num_entries` #1197

pfackeldey · 2024-10-23T15:12:44Z

uproot.num_entries is a fast path to get the number of events. It should always be faster than TTree.num_entries. This may become noticeable for preprocessing very large filesets.

The nanoevents factory should use uproot.num_entries aswell (at some point), in this PR I just removed the overhead of calculating it twice.

for more information, see https://pre-commit.ci

ikrommyd · 2024-10-23T15:19:10Z

@pfackeldey I was wondering because I find preprocessing slow (many minutes for large datasets), did you try it and have seen speed improvement?

pfackeldey · 2024-10-23T15:23:06Z

@ikrommyd I have not measured it, because I don't have an analysis setup to do so. If you can, I'd be happy to see the difference!
I'm not sure how much one can gain here (I didn't profile the preprocess method), but calculating the number of entries should be always faster with uproot.num_entries compared than TTree.num_entries.
(I noticed slow preprocessing aswell when running the AGC - actually I might be able to measure the performance gain with the 200GBps challenge?)

lgray · 2024-11-14T14:53:13Z

So the reason I used TTree rather than uproot.num_entries was to avoid multiple file opens, which can be the bottleneck in cases using xrootd or any network filesystem.

Since we're already opening the file and deserializing the TTree metadata to get the base form and greatest common basket offsets anyway on line 69 of preprocess. IIUC, this also deserializes the num_entries info for the tree. It makes very little sense to make a request to open the file again so that we can access num_entries faster without having the deserialize the full tree.

I'd be happy to see a benchmark that shows this change is faster though!

lgray · 2024-11-14T15:34:10Z

That being said - if you want to make a version of this that's optimized for when all you're doing is just getting the number of entries (i.e. save_form=False and align_clusters=False, if either is true then you need to deserialize a bunch more data anyway) that uses uproot.num_entries then that would make a fair amount of sense. You'd likely need to call a different function from preprocess, or you can alter get_steps to have the optimal file-opening patterns depending on the options passed.

pfackeldey · 2024-11-14T15:37:28Z

Hi @lgray,
that makes sense to me. I didn't see a visible improvement in a test based on the AGC.
Feel free to close this PR :)

lgray · 2024-11-14T15:46:11Z

I'll leave it open if you want to try to the one optimized pathway I mentioned. If you don't then I'll close it.

pfackeldey and others added 3 commits October 23, 2024 11:03

use fast uproot.num_entries method

600629e

fallback to tree.num_entries for now

fa12703

[pre-commit.ci] auto fixes from pre-commit.com hooks

9ecc3ce

for more information, see https://pre-commit.ci

pfackeldey changed the title ~~use fast uproot.num_entries method instead of TTree.num_entries~~ perf: use fast uproot.num_entries method instead of TTree.num_entries Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: use fast `uproot.num_entries` method instead of `TTree.num_entries` #1197

perf: use fast `uproot.num_entries` method instead of `TTree.num_entries` #1197

pfackeldey commented Oct 23, 2024 •

edited

Loading

ikrommyd commented Oct 23, 2024

pfackeldey commented Oct 23, 2024 •

edited

Loading

lgray commented Nov 14, 2024 •

edited

Loading

lgray commented Nov 14, 2024 •

edited

Loading

pfackeldey commented Nov 14, 2024

lgray commented Nov 14, 2024

perf: use fast uproot.num_entries method instead of TTree.num_entries #1197

Are you sure you want to change the base?

perf: use fast uproot.num_entries method instead of TTree.num_entries #1197

Conversation

pfackeldey commented Oct 23, 2024 • edited Loading

ikrommyd commented Oct 23, 2024

pfackeldey commented Oct 23, 2024 • edited Loading

lgray commented Nov 14, 2024 • edited Loading

lgray commented Nov 14, 2024 • edited Loading

pfackeldey commented Nov 14, 2024

lgray commented Nov 14, 2024

perf: use fast `uproot.num_entries` method instead of `TTree.num_entries` #1197

perf: use fast `uproot.num_entries` method instead of `TTree.num_entries` #1197

pfackeldey commented Oct 23, 2024 •

edited

Loading

pfackeldey commented Oct 23, 2024 •

edited

Loading

lgray commented Nov 14, 2024 •

edited

Loading

lgray commented Nov 14, 2024 •

edited

Loading