[Common Issues] Releasing LLaVA-OneVision data yaml files in three stages (mid/single-image/onevision) #199

Luodian · 2024-09-01T05:09:00Z

Checkout here to see the three yamls.

https://github.com/LLaVA-VL/LLaVA-NeXT/tree/main/scripts/train

Luodian · 2024-09-01T05:12:57Z

Cross pin for explaination on video data.

Luodian · 2024-09-01T05:13:31Z

Q: About video data?
A: It's to be released in @ZhangYuanhan-AI next version of a more powerful video model.
Currently we released the data yaml used in onevision stage at onevision.yaml.

You can checkout the three subsets video data, (1) sharegpt4video_255000.json (checkout sharegpt4video) (2) 0718_0_30_s_academic_mc_v0_1_all.json (to be released) (3) academic_source_30s_v1_all.json (to be released).

Luodian added the documentation Improvements or additions to documentation label Sep 1, 2024

Luodian pinned this issue Sep 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Common Issues] Releasing LLaVA-OneVision data yaml files in three stages (mid/single-image/onevision) #199

[Common Issues] Releasing LLaVA-OneVision data yaml files in three stages (mid/single-image/onevision) #199

Luodian commented Sep 1, 2024

Luodian commented Sep 1, 2024

Luodian commented Sep 1, 2024

[Common Issues] Releasing LLaVA-OneVision data yaml files in three stages (mid/single-image/onevision) #199

[Common Issues] Releasing LLaVA-OneVision data yaml files in three stages (mid/single-image/onevision) #199

Comments

Luodian commented Sep 1, 2024

Luodian commented Sep 1, 2024

Luodian commented Sep 1, 2024