Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add bonus difficulty to split-descriptive-statistics for big datasets #1967

Merged
merged 1 commit into from
Oct 11, 2023

Conversation

polinaeterna
Copy link
Contributor

@polinaeterna polinaeterna commented Oct 11, 2023

hopefully will fix JobManagerExceededMaximumDurationError for split-descriptive-statistics for big datasets like https://huggingface.co/datasets/Open-Orca/OpenOrca. It appears that loading data to in-memory table takes too much time

We can try like this and meanwhile I'm exploring other options how to query faster on big data

@codecov-commenter
Copy link

codecov-commenter commented Oct 11, 2023

Codecov Report

All modified lines are covered by tests ✅

Comparison is base (d429191) 95.58% compared to head (d7388b3) 90.74%.

Additional details and impacted files
@@            Coverage Diff             @@
##             main    #1967      +/-   ##
==========================================
- Coverage   95.58%   90.74%   -4.85%     
==========================================
  Files          14      237     +223     
  Lines         521    14810   +14289     
==========================================
+ Hits          498    13439   +12941     
- Misses         23     1371    +1348     
Flag Coverage Δ
jobs_cache_maintenance 95.58% <ø> (ø)
jobs_mongodb_migration 86.32% <ø> (?)
libs_libcommon 92.17% <ø> (?)
services_admin 85.44% <ø> (?)
services_api 86.79% <ø> (?)
services_rows 84.95% <ø> (?)
services_search 79.59% <ø> (?)
services_sse-api 94.16% <ø> (?)
services_worker 92.56% <ø> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Coverage Δ
libs/libcommon/src/libcommon/config.py 77.16% <ø> (ø)

... and 222 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@polinaeterna polinaeterna marked this pull request as ready for review October 11, 2023 14:50
@polinaeterna polinaeterna merged commit 68f8abf into main Oct 11, 2023
@polinaeterna polinaeterna deleted the bouns-difficulty-to-stats branch October 11, 2023 14:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants