support for llama3.2 vision #30

vaiju1981 · 2024-12-17T20:07:24Z

First of all thanks for the amazing work. It helps us build a very simple yet efficient router within our Java applications.

I was wondering if there is any plan to support LLama3.2 vision models.

--Thanks and Regards
Vaijanath

mukel · 2024-12-17T20:53:48Z

I looked into implementing the vision encoder component, specially for QwenVL models, which were merged into llama.cpp just a few days ago. I work on this on my spare time, which is not much lately. To make it easier in the future, I'm working on a simple tensor library for inference in Java. Slowly but I'm on it, I really enjoy hacking on this.

vaiju1981 · 2024-12-18T00:02:27Z

If you can provide with tensor library, I can take a stab at it. Right now in order to make llama3.2 vision to work with current code i need to make weights to have List<FloatTensor[]> and have identity operations for missing layers.

for example attn_q.weight is not available for all the layers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for llama3.2 vision #30

support for llama3.2 vision #30

vaiju1981 commented Dec 17, 2024

mukel commented Dec 17, 2024 •

edited

Loading

vaiju1981 commented Dec 18, 2024

support for llama3.2 vision #30

support for llama3.2 vision #30

Comments

vaiju1981 commented Dec 17, 2024

mukel commented Dec 17, 2024 • edited Loading

vaiju1981 commented Dec 18, 2024

mukel commented Dec 17, 2024 •

edited

Loading