You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Gemini API supports image, video and audio inputs(unlike the OpenAI API where you send the file contents base64-encoded, you need to upload the file separately)
Would you be open to a PR that adds support for uploading files, or would you say that is out of scope of this project?
If it's out of scope, I can create a smaller PR that allows media URLs(with the upload happening outside the library):
Instructor.chat_completion(mode: :json_schema,model: "gemini-1.5-flash",response_model: VideoDesc,messages: [%{role: "user",content: [%{type: "video_url",video_url: %{url: "https://generativelanguage.googleapis.com/v1beta/files/..."}},%{type: "text",text: " what's going on in this video?"}]}])
The text was updated successfully, but these errors were encountered:
Currently, it's only possible to send text messages using the Gemini adapter:
instructor_ex/lib/instructor/adapters/gemini.ex
Line 61 in 1abd847
The Gemini API supports image, video and audio inputs(unlike the OpenAI API where you send the file contents base64-encoded, you need to upload the file separately)
Would you be open to a PR that adds support for uploading files, or would you say that is out of scope of this project?
If it's out of scope, I can create a smaller PR that allows media URLs(with the upload happening outside the library):
The text was updated successfully, but these errors were encountered: