Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The performance for video caption seems poor #50

Open
Hyu-Zhang opened this issue Apr 25, 2024 · 1 comment
Open

The performance for video caption seems poor #50

Hyu-Zhang opened this issue Apr 25, 2024 · 1 comment

Comments

@Hyu-Zhang
Copy link

Hello, I used the code and weights you provided to execute the inference.py file, but the results seem to be very different from what is shown. Do you know what is the reason for this please?

image
image

@tsaishien-chen
Copy link
Contributor

tsaishien-chen commented Apr 28, 2024

Hi @Hyu-Zhang,
Thanks for your interest in our captioning algorithm and sorry for your inconvenience.
This issue seems to be duplicate as #12. I hyposize this happens because you are using different tokenizer or LLM model.
Did you follow this guideline to prepare vicuna-7b-v0 weight?
Basically, you need to first download the original weight and apply delta weights. Could you please check for that?
You can also check some issues (like this one) in FastChat repo for reference!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants