Skip to content

Pull requests: mosaicml/streaming

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add upper bound for prefix_int
#823 opened Nov 5, 2024 by XiaohanZhangCMU Loading…
8 tasks
add jpeg quality option
#818 opened Oct 28, 2024 by cabreraalex Loading…
8 tasks
Update pre-commit requirement from <4,>=2.18.1 to >=2.18.1,<5 dependencies Pull requests that update a dependency file
#797 opened Oct 7, 2024 by dependabot bot Loading…
Refactor spanner to avoid creating large array
#773 opened Sep 3, 2024 by XiaohanZhangCMU Loading…
8 tasks done
Bump databricks-sdk from 0.29.0 to 0.30.0 dependencies Pull requests that update a dependency file
#761 opened Aug 19, 2024 by dependabot bot Loading…
Check file size within LocalUploader
#751 opened Aug 13, 2024 by XiaohanZhangCMU Loading…
8 tasks
Heterogeneous
#684 opened May 24, 2024 by XiaohanZhangCMU Draft
8 tasks
Update google-cloud-storage requirement from <2.11.0,>=2.9.0 to >=2.9.0,<2.17.0 dependencies Pull requests that update a dependency file
#641 opened Mar 25, 2024 by dependabot bot Loading…
parallel merge index
#590 opened Feb 5, 2024 by XiaohanZhangCMU Loading…
8 tasks
Add varint to MDS
#574 opened Jan 23, 2024 by knighton Loading…
Add options to precompute the epoch
#569 opened Jan 20, 2024 by knighton Loading…
Nuke 1) torch dist, 2) shared memory, and 3) filelock
#556 opened Dec 30, 2023 by knighton Loading…
Add fine-grained timings to Writers
#555 opened Dec 30, 2023 by knighton Loading…
Let's blow away dist, and also shared memory
#552 opened Dec 26, 2023 by knighton Draft
2 of 3 tasks
Parquet streaming [WIP]
#538 opened Dec 15, 2023 by knighton Loading…
"Golden spike" PR
#488 opened Oct 28, 2023 by knighton Draft
Hf ingestion
#483 opened Oct 23, 2023 by XiaohanZhangCMU Loading…
8 tasks
Modify dataframe_to_mds to accept streaming DF
#478 opened Oct 20, 2023 by maddiedawson Loading…
8 tasks
Training on PQ shards
#443 opened Sep 22, 2023 by knighton Loading…
8 tasks
tag shared and temp files with username
#430 opened Sep 11, 2023 by acutkosky Loading…
3 of 8 tasks
Parallelize StreamingDataset index downloads.
#285 opened Jun 2, 2023 by knighton Loading…
8 tasks
ProTip! no:milestone will show everything without a milestone.