Enable LLaVa-1.5 in VLM Pipeline #917

yatarkan · 2024-10-04T14:31:01Z

Ticket: CVS-153333

…nd minicpm

…for llava and minicpm

src/cpp/include/openvino/genai/vision_encoder.hpp

Wovchena · 2024-10-09T06:28:40Z

Please, resolve the conflict

ilya-lavrenov · 2024-10-09T08:17:12Z

src/cpp/include/openvino/genai/vision_encoder.hpp

    /// @brief A model for image encoding.
    ov::InferRequest m_encoder;
    /// @brief A config to follow.
    ProcessorConfig m_processor_config;

+    // LLaVa specific members
+    ov::InferRequest m_vision_embeddings;


should we define base class for VisionEncoder ? and create inherited classes for MiniCPM and LLava specifically

In this case, we don't need to put all the fields for different models into common class. Similarly, processor config can be model specific class.

ilya-lavrenov · 2024-10-09T08:18:09Z

src/cpp/src/vlm_pipeline.cpp

+    }
+}
+
+ov::Tensor VLMPipeline::get_inputs_embeds_minicpm(const std::string& prompt, const std::vector<ov::Tensor>& images) {


can we create a dedicated class responsible for Inputs embedding?
Such class will be used in Continuous batching implementation

In this case, VLM Pipeline will compute input embeddings w/o knowing any model / pipeline specific and then perform inference of LLM part in auto-regressive mode

ilya-lavrenov

Agreed to perform splitting of implementations of VisionEncoder to model backends and introducing VisionTextInputsEmbedder as separate tasks after llava next / intern vl enablement in GenAI

Wovchena · 2024-10-10T14:35:32Z

There's a conflict again

samples/generation.gif

ilya-lavrenov · 2024-10-10T17:32:01Z

Please, update a list of supported models https://github.com/openvinotoolkit/openvino.genai/blob/master/src/docs/SUPPORTED_MODELS.md#visual-language-models

src/docs/SUPPORTED_MODELS.md

src/cpp/include/openvino/genai/vision_encoder.hpp

yatarkan mentioned this pull request Oct 4, 2024

Enable LLaVa-1.5 in VLM Pipeline Wovchena/openvino.genai-public#63

Closed

ilya-lavrenov assigned ilya-lavrenov and Wovchena Oct 7, 2024

yatarkan added 7 commits October 8, 2024 19:32

Add model type to vlm config

3bc9163

Add llava specific config params to processor config

8701288

Add model type to vision encoder, separate encode methods for llava a…

5b4f145

…nd minicpm

Enable llava model in vlm pipeline, separate preparing inputs embeds …

be3fab4

…for llava and minicpm

Add test for vlm sample with llava model

2ec5ef8

Restore function for merging text and image embeds for llava

49447b9

Move getting input embeds for minicpm to separate method

790981f

yatarkan force-pushed the yt/add-llava-1-5 branch from 42d275f to 790981f Compare October 8, 2024 17:26

yatarkan marked this pull request as ready for review October 8, 2024 17:27

yatarkan requested review from Wovchena and ilya-lavrenov October 8, 2024 17:27

ilya-lavrenov added this to the 2024.5 milestone Oct 8, 2024

Wovchena reviewed Oct 9, 2024

View reviewed changes

src/cpp/include/openvino/genai/vision_encoder.hpp Outdated Show resolved Hide resolved

ilya-lavrenov reviewed Oct 9, 2024

View reviewed changes

yatarkan added 2 commits October 9, 2024 15:12

Add vlm model type enum class

1b5435c

Merge branch 'master' into yt/add-llava-1-5

6ccfce4

yatarkan requested a review from Wovchena October 9, 2024 11:18

Wovchena approved these changes Oct 9, 2024

View reviewed changes

Fix typo in minicpm model type

8912b56

ilya-lavrenov approved these changes Oct 9, 2024

View reviewed changes

Wovchena enabled auto-merge October 10, 2024 04:50

Merge branch 'master' into yt/add-llava-1-5

bc643e1

Wovchena added this pull request to the merge queue Oct 10, 2024

github-merge-queue bot removed this pull request from the merge queue due to a conflict with the base branch Oct 10, 2024

Wovchena enabled auto-merge October 10, 2024 14:35

Merge branch 'master' into yt/add-llava-1-5

304f6ed

ilya-lavrenov reviewed Oct 10, 2024

View reviewed changes

samples/generation.gif Outdated Show resolved Hide resolved

ilya-lavrenov disabled auto-merge October 10, 2024 17:32

ilya-lavrenov enabled auto-merge October 10, 2024 18:32

ilya-lavrenov disabled auto-merge October 10, 2024 18:34

yatarkan added 4 commits October 11, 2024 13:53

Merge branch 'master' into yt/add-llava-1-5

cdff0c1

Add llava to supported models

729b063

Switch to optimum-intel from git in requirements

18b49c7

Remove redundant optimum install

4592cd6

Wovchena requested changes Oct 11, 2024

View reviewed changes

src/docs/SUPPORTED_MODELS.md Outdated Show resolved Hide resolved

src/cpp/include/openvino/genai/vision_encoder.hpp Outdated Show resolved Hide resolved

yatarkan added 2 commits October 11, 2024 15:34

Reorder supported vlm models

8f30428

Reuse m_vision_encoder

04a0014

Wovchena approved these changes Oct 11, 2024

View reviewed changes

Wovchena enabled auto-merge October 11, 2024 11:48

yatarkan added 2 commits October 11, 2024 16:21

Fix samples requirements with lowering numpy for macos

d9feaea

Fix python tests requirements with numpy for macos

8f1e347

andrei-kochin disabled auto-merge October 11, 2024 16:14

andrei-kochin merged commit dbb1f7c into openvinotoolkit:master Oct 11, 2024
46 checks passed

ilya-lavrenov added category: visual language Visual language pipeline dependencies Pull requests that update a dependency file category: samples GenAI samples labels Oct 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable LLaVa-1.5 in VLM Pipeline #917

Enable LLaVa-1.5 in VLM Pipeline #917

yatarkan commented Oct 4, 2024 •

edited

Loading

Wovchena commented Oct 9, 2024

ilya-lavrenov Oct 9, 2024

ilya-lavrenov Oct 9, 2024

ilya-lavrenov left a comment

Wovchena commented Oct 10, 2024

ilya-lavrenov commented Oct 10, 2024

Enable LLaVa-1.5 in VLM Pipeline #917

Enable LLaVa-1.5 in VLM Pipeline #917

Conversation

yatarkan commented Oct 4, 2024 • edited Loading

Wovchena commented Oct 9, 2024

ilya-lavrenov Oct 9, 2024

Choose a reason for hiding this comment

ilya-lavrenov Oct 9, 2024

Choose a reason for hiding this comment

ilya-lavrenov left a comment

Choose a reason for hiding this comment

Wovchena commented Oct 10, 2024

ilya-lavrenov commented Oct 10, 2024

yatarkan commented Oct 4, 2024 •

edited

Loading