Add colpali #1

tonywu71 · 2024-09-19T15:58:59Z

What does this PR do?

Add ColPali support in 🤗 transformers.

Who can review?

yonigozlan

Thanks a lot for working on this!
I only looked at the modeling file for now but it seems to be on the right track :). Main comment for now is to make fix copies.
I also think it's a first in Transformers to have a [Model]ForConditionalGeneration used inside another model for another task. Another solution could be to put all the added code of ColPaliModel inside what is now PaliGemmaForConditionalGeneration, which would become ColPaliForRetrieval. I think it would make more sense, as unless I'm mistaken, ColPaliForConditionalGeneration isn't be usable as is?

yonigozlan · 2024-09-20T00:30:14Z

src/transformers/models/colpali/modeling_colpali.py

+
+
+@dataclass
+# Copied from transformers.models.paligemma.modeling_paligemma.PaliGemmaCausalLMOutputWithPast with PaliGemma->ColPali


Looks like the copied froms were not applied in any of the files. You can use make fix-copies in the terminal to apply them! There shouldn't be any "PaliGemma" in the code itself.

src/transformers/models/colpali/modeling_colpali.py

tonywu71 · 2024-09-26T18:07:39Z

Closed: follow-up PR can be found at huggingface#33736.

* gptqmodel Signed-off-by: jiqing-feng <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * update readme Signed-off-by: jiqing-feng <[email protected]> * gptqmodel need use checkpoint_format (#1) * gptqmodel need use checkpoint_format * fix quantize * Update quantization_config.py * Update quantization_config.py * Update quantization_config.py --------- Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> * Revert quantizer_gptq.py (huggingface#2) * revert quantizer_gptq.py change * pass **kwargs * limit gptqmodel and optimum version Signed-off-by: jiqing-feng <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * fix warning Signed-off-by: jiqing-feng <[email protected]> * fix version check Signed-off-by: jiqing-feng <[email protected]> * revert unrelated changes Signed-off-by: jiqing-feng <[email protected]> * enable gptqmodel tests Signed-off-by: jiqing-feng <[email protected]> * fix requires gptq Signed-off-by: jiqing-feng <[email protected]> * Fix Transformer compat (huggingface#3) * revert quantizer_gptq.py change * pass **kwargs * add meta info * cleanup * cleanup * Update quantization_config.py * hf_select_quant_linear pass checkpoint_format and meta * fix GPTQTestCUDA * Update test_gptq.py * gptqmodel.hf_select_quant_linear() now does not select ExllamaV2 * cleanup * add backend * cleanup * cleanup * no need check exllama version * Update quantization_config.py * lower checkpoint_format and backend * check none * cleanup * Update quantization_config.py * fix self.use_exllama == False * spell * fix unittest * fix unittest --------- Co-authored-by: LRL <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> * fix format Signed-off-by: jiqing-feng <[email protected]> * fix format again Signed-off-by: jiqing-feng <[email protected]> * update gptqmodel version (huggingface#6) * update gptqmodel version * update gptqmodel version * fix unit test (huggingface#5) * update gptqmodel version * update gptqmodel version * "not self.use_exllama" is not equivalent to "self.use_exllama==False" * fix unittest * update gptqmodel version * backend is loading_attibutes (huggingface#7) * fix format and tests Signed-off-by: jiqing-feng <[email protected]> * fix memory check Signed-off-by: jiqing-feng <[email protected]> * fix device mismatch Signed-off-by: jiqing-feng <[email protected]> * fix result check Signed-off-by: jiqing-feng <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * Update src/transformers/quantizers/quantizer_gptq.py Co-authored-by: Marc Sun <[email protected]> * update tests Signed-off-by: jiqing-feng <[email protected]> * review: update docs (huggingface#10) * review: update docs (huggingface#12) * review: update docs * fix typo * update tests for gptqmodel Signed-off-by: jiqing-feng <[email protected]> * update document (huggingface#9) * update overview.md * cleanup * Update overview.md * Update overview.md * Update overview.md * update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md * Update gptq.md --------- Co-authored-by: Qubitium-ModelCloud <[email protected]> * typo * doc note for asymmetric quant * typo with apple silicon(e) * typo for marlin * column name revert: review * doc rocm support * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/gptq.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/quantization/overview.md Co-authored-by: Steven Liu <[email protected]> --------- Signed-off-by: jiqing-feng <[email protected]> Co-authored-by: LRL-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: Qubitium-ModelCloud <[email protected]> Co-authored-by: ZX-ModelCloud <[email protected]> Co-authored-by: LRL <[email protected]> Co-authored-by: Marc Sun <[email protected]> Co-authored-by: Mohamed Mekkouri <[email protected]> Co-authored-by: Steven Liu <[email protected]>

tonywu71 self-assigned this Sep 19, 2024

tonywu71 force-pushed the add-colpali branch from a852aee to 9a12eda Compare September 19, 2024 16:05

yonigozlan reviewed Sep 20, 2024

View reviewed changes

tonywu71 added 23 commits September 26, 2024 14:16

feat: run add-new-model-like

e377f97

feat: add paligemma code with "copied from"

e6b9d0c

feat: add ColPaliProcessor

75041b5

feat: add ColPaliModel

d942533

feat: add ColPaliConfig

289c808

feat: rename ColPaliForConditionalGeneration to ColPaliModel

07b5a98

fixup modeling colpali

9a06c08

fix: fix root import shortcuts

d4fd2d3

fix: fix modeling_auto dict

34e443b

feat: comment out ColPali test file

5e52bb3

fix: fix typos from add-new-model-like

8ba5649

feat: explicit the forward input args

3a16d6b

feat: move everything to modular_colpali.py

54d5310

fix: put back ColPaliProcesor

48f27f2

feat: add auto-generated files

710b4e2

fix: run fix-copies

dda3312

fix: remove DOCStRING constants to make modular converter work

3500c63

fix: fix typo + modular converter

fce02cf

fix: add missing imports

ad3ea52

feat: no more errors when loading ColPaliModel

dea3965

fix: remove unused args in forward + tweak doc

bb0818d

feat: rename ColPaliModel to ColPaliForRetrieval

fd5456b

fix: apply fix-copies

c36b20c

tonywu71 force-pushed the add-colpali branch from 9a12eda to c36b20c Compare September 26, 2024 17:35

tonywu71 added 2 commits September 26, 2024 20:00

temp fix for modular converter: drop commit when PR is merged!

1643705

feat: add ColPaliProcessor to modular_colpali

901cc8d

tonywu71 closed this Sep 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add colpali #1

Add colpali #1

tonywu71 commented Sep 19, 2024 •

edited

Loading

yonigozlan left a comment

yonigozlan Sep 20, 2024

tonywu71 commented Sep 26, 2024



		@dataclass
		# Copied from transformers.models.paligemma.modeling_paligemma.PaliGemmaCausalLMOutputWithPast with PaliGemma->ColPali

Add colpali #1

Add colpali #1

Conversation

tonywu71 commented Sep 19, 2024 • edited Loading

What does this PR do?

Who can review?

yonigozlan left a comment

Choose a reason for hiding this comment

yonigozlan Sep 20, 2024

Choose a reason for hiding this comment

tonywu71 commented Sep 26, 2024

tonywu71 commented Sep 19, 2024 •

edited

Loading