uproot.dask
is turning TBranches of fixed-size C arrays into Dask arrays with shape (num_entries,)
, rather than (num_entries, fixed_size)
#1173
Labels
bug
The problem described is something that must be fixed
The issue raised in #1116 is that @Jailbone's test case creates a TTree of
double[fixed_size]
(one fixed-size array per entry), and this should be read as a 2D NumPy array of shape(num_entries, fixed_size)
, butuproot.dask
is presenting it to Dask as having shape(num_entries,)
. Then, of course, Dask does wrong things with it.Reproducer:
(
fixed_size
is 10.)But
There's only one place where Uproot creates a
dask.array
; it's here:uproot5/src/uproot/_dask.py
Line 459 in 724e377
Should we set the Dask array
shape
inchunks
, or is that something else? If we know that the TBranch's Interpretation isAsDtype
(the only type that can have more than one dimension), we can get the part of the shape beyond the number of entries withinner_shape
:The text was updated successfully, but these errors were encountered: