Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you provide the images.zip file of the gpt4v-dataset? I have collected all the other datasets, but I can't find this one. #6

Closed
cheng-haha opened this issue Dec 25, 2024 · 5 comments

Comments

@cheng-haha
Copy link

No description provided.

@cheng-haha
Copy link
Author

cheng-haha commented Dec 25, 2024

In fact, my machine can not connected to the Internet, so I can't download pictures during training.

@cheng-haha
Copy link
Author

Similar problems have not been solved either: issues35

@zs-zhong
Copy link
Member

Apologies for the late response! Since this dataset is quite large, you can try downloading it from this link: https://github.com/InternLM/InternLM-XComposer/blob/main/projects/ShareGPT4V/docs/Data.md.
Alternatively, let me know which specific images are missing. If not too many, I can package and upload them for you.

@cheng-haha
Copy link
Author

Thank you very much. I have already downloaded gpt4v-dataset from Hugging Face. Although some images are still missing, it shouldn't be a significant issue. Currently, I have trained Stage 2, but I only have the LoRA weights. How can I execute the script to obtain a CKPT similar to Lyra-Mini-3B? This is because I want to run lyra_chartvqa_speech.sh, but the script don't save the complete CKPT.
1736310710693
1736310839439

@zs-zhong
Copy link
Member

zs-zhong commented Jan 9, 2025

Hi, I’m going to close this issue as it seems to be resolved. If you have any further questions or concerns, feel free to reopen the discussion or create a new issue. Thanks!

@zs-zhong zs-zhong closed this as completed Jan 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants