Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Common Issues] Releasing LLaVA-OneVision data yaml files in three stages (mid/single-image/onevision) #199

Open
Luodian opened this issue Sep 1, 2024 · 2 comments
Labels
documentation Improvements or additions to documentation

Comments

@Luodian
Copy link
Contributor

Luodian commented Sep 1, 2024

Checkout here to see the three yamls.

https://github.com/LLaVA-VL/LLaVA-NeXT/tree/main/scripts/train

@Luodian Luodian added the documentation Improvements or additions to documentation label Sep 1, 2024
@Luodian Luodian pinned this issue Sep 1, 2024
@Luodian
Copy link
Contributor Author

Luodian commented Sep 1, 2024

Cross pin for explaination on video data.

#130

@Luodian
Copy link
Contributor Author

Luodian commented Sep 1, 2024


Q: About video data?
A: It's to be released in @ZhangYuanhan-AI next version of a more powerful video model.
Currently we released the data yaml used in onevision stage at onevision.yaml.

You can checkout the three subsets video data, (1) sharegpt4video_255000.json (checkout sharegpt4video) (2) 0718_0_30_s_academic_mc_v0_1_all.json (to be released) (3) academic_source_30s_v1_all.json (to be released).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

1 participant