Skip to content
This repository has been archived by the owner on Jul 18, 2024. It is now read-only.

Refactor doc_loader.py to load documents concurrently using Ray actors or Spark tasks, instead of loading them all at once and then putting them into a dataset #510

Open
chaojun-zhang opened this issue Dec 26, 2023 · 0 comments

Comments

@chaojun-zhang
Copy link
Contributor

Refactor doc_loader.py to load documents concurrently using Ray actors or Spark tasks, instead of loading them all at once and then putting them into a dataset

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant