feat: add Max Tokens and Context Window Setting Options for Ollama Channel #1694

MotorBottle · 2024-07-25T17:31:34Z

已实现：

支持通过max_tokens（对应ollama的num_predict）参数限制输出token数
支持通过num_ctx（ollama原生参数）参数设定更改ollama默认context window大小（默认仅1k/2k上下文）

有个问题：
下图中改动的部分是因为原本的代码会导致报错，console打印后发现原本的data就是正确的json格式不需要对花括号进行增减处理。但奇怪的是目前我通过docker compose部署的正式版本也是能正常使用的，正式版本并没有我对这两行的修改（也可能部署的版本本来没有这两行？没有去查看，总之修改之后添加了条件判断，对两种不同状况都能有效应对）。
目前已确认并非个案， #1702 也是同样的问题，但我觉得条件判断会比直接改掉健壮性更强

close #1691

我已确认该 PR 已自测通过，相关截图如下：

SLKun · 2024-08-06T03:01:34Z

本地测了一下挺好的建议合并

codecov · 2024-08-06T12:30:16Z

Codecov Report

Attention: Patch coverage is 0% with 5 lines in your changes missing coverage. Please review.

Project coverage is 1.30%. Comparing base (c936198) to head (816f5fc).

Files	Patch %	Lines
relay/adaptor/ollama/main.go	0.00%	5 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##            main   #1694      +/-   ##
========================================
- Coverage   1.30%   1.30%   -0.01%     
========================================
  Files        144     144              
  Lines      10140   10143       +3     
========================================
  Hits         132     132              
- Misses      9994    9997       +3     
  Partials      14      14

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

songquanpeng · 2024-08-06T15:44:43Z

Thx~

…annel (songquanpeng#1694) * Update main.go with max_tokens param * Update model.go with max_tokens param * Update model.go * Update main.go * Update main.go * Adds num_ctx param for Ollama Channel * Added num_ctx param for ollama adapter * Added num_ctx param for ollama adapter * Improved data process logic

MotorBottle added 8 commits July 24, 2024 20:04

Update main.go with max_tokens param

8c40a1d

Update model.go with max_tokens param

804fd7e

Update model.go

bc72d5b

Update main.go

8f04f44

Update main.go

d6033eb

Adds num_ctx param for Ollama Channel

eaf3500

Added num_ctx param for ollama adapter

c834f83

Added num_ctx param for ollama adapter

f655976

MotorBottle changed the title ~~Add max_tokens compability for Ollama Channel~~ Add Max Tokens and Context Window Setting Options for Ollama Channel Jul 26, 2024

Improved data process logic

816f5fc

MotorBottle mentioned this pull request Jul 31, 2024

fix: ollama stream response unmarshal error #1702

Open

c121914yu approved these changes Aug 6, 2024

View reviewed changes

songquanpeng changed the title ~~Add Max Tokens and Context Window Setting Options for Ollama Channel~~ feat: add Max Tokens and Context Window Setting Options for Ollama Channel Aug 6, 2024

songquanpeng merged commit 04bb3ef into songquanpeng:main Aug 6, 2024
2 of 4 checks passed

SDAIer mentioned this pull request Sep 27, 2024

请增加下参数num_ctx用来对接ollama本地模型 labring/FastGPT#2812

Closed

homjay mentioned this pull request Nov 6, 2024

Support Ollama Context Size Configuration MartialBE/one-hub#409

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Max Tokens and Context Window Setting Options for Ollama Channel #1694

feat: add Max Tokens and Context Window Setting Options for Ollama Channel #1694

MotorBottle commented Jul 25, 2024 •

edited

Loading

SLKun commented Aug 6, 2024

codecov bot commented Aug 6, 2024

songquanpeng commented Aug 6, 2024

feat: add Max Tokens and Context Window Setting Options for Ollama Channel #1694

feat: add Max Tokens and Context Window Setting Options for Ollama Channel #1694

Conversation

MotorBottle commented Jul 25, 2024 • edited Loading

SLKun commented Aug 6, 2024

codecov bot commented Aug 6, 2024

Codecov Report

songquanpeng commented Aug 6, 2024

MotorBottle commented Jul 25, 2024 •

edited

Loading