-
Notifications
You must be signed in to change notification settings - Fork 63
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Occasional RuntimeError
in cudf.read_parquet
with kvikio backend with remote data
#599
Comments
Here's another one from a
(I realize now that this and #601 are closely related, since both will likely involve retries. #601 I think will rely on the HTTP status code when the HTTP request completes. This might be more involved). And another one in read_parquet on a dask worker:
Another one:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I'll occasionally see a
RuntimeError
when usingcudf.read_parquet
to read a parquet file from S3.I'll grab a full traceback next time I see one, but here's part of one:
I've also seen
At first glance, this looks a bit like some incomplete read from blob storage. Perhaps kvikio or cudf did a
.read(nbytes)
but less thannbytes
were returned?I'll try to get a more reproducible example and some more debug output.
The text was updated successfully, but these errors were encountered: