Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What causes the model to only output "<|im_end|>};" on some pages? #210

Open
plamb-viso opened this issue Nov 11, 2024 · 2 comments
Open

Comments

@plamb-viso
Copy link

I'm loading the model as specified in the demo script and running inference highly similarly:

res = model.chat(tokenizer, str(image_path), ocr_type='format')

On some page images, the only output i get back from the model is:

<|im_end|>};

These images tend to be pages where the entire page is a table and each cell can contain a fair bit of text.

What causes this?

@rahulvalyu
Copy link

got the same issue. did you solve it?

@plamb-viso
Copy link
Author

I have not, but the model is currently being added to transformers and I'm going to try it again once it's in those APIs

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants