v1.5.0 not working with ollama #354

SixK · 2025-01-25T11:17:32Z

v1.4.1 is working with the following code,
but v1.5.0 don't work anymore(ollama version is 0.5.7):

from smolagents.agents import ToolCallingAgent
from smolagents import CodeAgent
from smolagents import tool, LiteLLMModel, DuckDuckGoSearchTool, PythonInterpreterTool, VisitWebpageTool
from typing import Optional

model = LiteLLMModel(
    model_id="ollama_chat/qwen2.5",
    api_base="http://localhost:11434", # replace with remote open-ai compatible server if necessary
    # api_key="your-api-key", # replace with API key if necessary
    num_ctx=102400
)

agent = CodeAgent(tools=[], model=model, add_base_tools=True,additional_authorized_imports=['requests', 'bs4', 'os'])

print(agent.run("download https://raw.githubusercontent.com/huggingface/smolagents/refs/heads/main/src/smolagents/tool_validation.py and list existing functions contained in file"))

Ollama return HTTP 400 Bad Request:

──────────────────────────────────────────────────────── New run ────────────────────────────────────────────────────────╮
│                                                                                                                         │
│ download https://raw.githubusercontent.com/huggingface/smolagents/refs/heads/main/src/smolagents/tool_validation.py and │
│ list existing functions contained in file                                                                               │
│                                                                                                                         │
╰─ LiteLLMModel - ollama_chat/qwen2.5 ────────────────────────────────────────────────────────────────────────────────────╯
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Step 0 ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Give Feedback / Get Help: https://github.com/BerriAI/litellm/issues/new
LiteLLM.Info: If you need to debug this error, use `litellm.set_verbose=True'.

Error in generating model output:
litellm.APIConnectionError: Ollama_chatException - Client error '400 Bad Request' for url 'http://localhost:11434/api/chat'
For more information check: https://developer.mozilla.org/en-US/docs/Web/HTTP/Status/400
[Step 0: Duration 0.02 seconds]

if using tcpdump, I see not particular change unless the payload size 9370 with v1.5.0 vs 9293 with v1.4.1 .

With v1.5.0:

sudo tcpdump -vvv -i any -s 0 -A 'tcp[((tcp[12:1] & 0xf0) >> 2):4] = 0x504F5354'
tcpdump: data link type LINUX_SLL2
tcpdump: listening on any, link-type LINUX_SLL2 (Linux cooked v2), snapshot length 262144 bytes
12:05:25.611240 lo    In  IP (tos 0x0, ttl 64, id 35460, offset 0, flags [DF], proto TCP (6), length 253)
    localhost.47512 > localhost.11434: Flags [P.], cksum 0xfef1 (incorrect -> 0x5f72), seq 3424847324:3424847525, ack 1962811321, win 512, options [nop,nop,TS val 2477752340 ecr 2477752340], length 201
E.....@[email protected]..........,..#..t..............
........POST /api/chat HTTP/1.1
Host: localhost:11434
Accept: */*
Accept-Encoding: gzip, deflate
Connection: keep-alive
User-Agent: litellm/1.59.7
Content-Length: 9370
Content-Type: application/json

with v1.4.1:

12:02:20.875195 lo    In  IP (tos 0x0, ttl 64, id 43810, offset 0, flags [DF], proto TCP (6), length 253)
    localhost.41126 > localhost.11434: Flags [P.], cksum 0xfef1 (incorrect -> 0x215f), seq 3544001019:3544001220, ack 1390350312, win 512, options [nop,nop,TS val 2477567604 ecr 2477567604], length 201
E...."@.@.............,..=).R..............
...t...tPOST /api/chat HTTP/1.1
Host: localhost:11434
Accept: */*
Accept-Encoding: gzip, deflate
Connection: keep-alive
User-Agent: litellm/1.59.6
Content-Length: 9293
Content-Type: application/json

The text was updated successfully, but these errors were encountered:

SixK · 2025-01-25T11:53:06Z

I found a way to get POST payload, here are changes:

to get POST message using tcpdump:
sudo tcpdump -i any -A -s 0 'tcp port 11434 and (((ip[2:2] - ((ip[0]&0xf)<<2)) - ((tcp[12]&0xf0)>>2)) != 0)'

keetrap · 2025-01-25T17:42:29Z

@SixK
Try using ollama/qwen2.5 instead of ollama_chat/qwen2.5.

SixK · 2025-01-25T17:50:26Z

using ollama/qwen2.5 instead of ollama_chat/qwen2.5 works,
but ollama_chat/qwen2.5 is the recommended way for smolagents (If my memory is good).
And result seem's not as good as ollama_chat. (I have to experiment more to be sure)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.5.0 not working with ollama #354

v1.5.0 not working with ollama #354

SixK commented Jan 25, 2025 •

edited

Loading

SixK commented Jan 25, 2025

keetrap commented Jan 25, 2025

SixK commented Jan 25, 2025

v1.5.0 not working with ollama #354

v1.5.0 not working with ollama #354

Comments

SixK commented Jan 25, 2025 • edited Loading

SixK commented Jan 25, 2025

keetrap commented Jan 25, 2025

SixK commented Jan 25, 2025

SixK commented Jan 25, 2025 •

edited

Loading