Support image input in the chat completion request #55

Youho99 · 2024-07-10T15:28:16Z

Tested with a single image

This pull request responds to issue #54

It allows you to take into account the architecture of the OpenAI API request with an image

Example on the OpenAI documentation:

curl https://api.openai.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "model": "gpt-4-turbo",
    "messages": [
      {
        "role": "user",
        "content": [
          {
            "type": "text",
            "text": "What'\''s in this image?"
          },
          {
            "type": "image_url",
            "image_url": {
              "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
            }
          }
        ]
      }
    ],
    "max_tokens": 300
  }'

The code has not been prettyfied, so we need to review that

> Tested with a single image

lhenault · 2024-07-12T09:48:15Z

Thanks for your work, will happily review this once you think it's ready (and passing the pre-commit check). If you have a working example for VLM / image processing to share, that would be a nice addition to the existing ones.

Youho99 · 2024-07-15T14:38:03Z

Don't use grpcio and grpcio-tools 1.65.0 version (remised version)

I don't know how to modify it in the poetry requirements

Youho99 · 2024-07-16T08:42:47Z

I just modified the rules regarding the versions of grpcio and grpcio-tools in the toml, and I regenerated the poetry.lock

Since this is my first time doing this, I would like to request special attention on this.

Youho99 · 2024-07-16T08:43:24Z

I will provide an example of using my feature in a second step (in another PR I think)

Youho99 · 2024-07-16T08:44:24Z

@lhenault I think you can review this PR (and change the version accordingly) :)

lhenault · 2024-08-28T10:09:28Z

Hey @Youho99 !

I tried your changes the other day and encountered a few issues, but probably because of me. Thanks again for your PR and sorry for the delay, it's very much appreciated. 😌

Let me have another look soon (and if you have a working example for image inputs that might speed up things).

Youho99 · 2024-08-28T12:14:46Z

@lhenault

In the next few days I'll get back to it, and provide an example.

Let me know if you have any problems.

ggiret-thinkdeep added 2 commits July 10, 2024 15:18

Support image input in the chat completion request

4f312ef

> Tested with a single image

Code matching for the Stream part

3b2ae8c

Youho99 marked this pull request as ready for review July 15, 2024 14:35

ggiret-thinkdeep added 2 commits July 15, 2024 14:59

pre-commit formatting

1cd286d

Update grpc versions

2249c2a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support image input in the chat completion request #55

Support image input in the chat completion request #55

Youho99 commented Jul 10, 2024

lhenault commented Jul 12, 2024

Youho99 commented Jul 15, 2024

Youho99 commented Jul 16, 2024

Youho99 commented Jul 16, 2024

Youho99 commented Jul 16, 2024

lhenault commented Aug 28, 2024

Youho99 commented Aug 28, 2024 •

edited

Loading

Support image input in the chat completion request #55

Are you sure you want to change the base?

Support image input in the chat completion request #55

Conversation

Youho99 commented Jul 10, 2024

lhenault commented Jul 12, 2024

Youho99 commented Jul 15, 2024

Youho99 commented Jul 16, 2024

Youho99 commented Jul 16, 2024

Youho99 commented Jul 16, 2024

lhenault commented Aug 28, 2024

Youho99 commented Aug 28, 2024 • edited Loading

Youho99 commented Aug 28, 2024 •

edited

Loading