-
Notifications
You must be signed in to change notification settings - Fork 533
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ai-proxy 插件后续开发任务 #1530
Comments
@cr7258 补充两个点:
|
我反而觉得在所有 apikey 不可用时,让 higress 直接返回错误信息是不是会更好? 让用户知道 api key 有问题,应该进行人工干预。 将所有 api key 都从黑名单中放出既不能改变请求失败的结果(因为一旦有 api key 健康检测通过就会被加回来,全放出来还是全部都失败),另外这么做可能也会让逻辑更复杂些。 |
@cr7258 主要我担心所有 apikey 都被拉黑的情况,可能是我们自身逻辑或者突发网络问题等。所以如果从健壮性上考虑,都被拉黑时一次性都放出来会更合理一些 |
@johnlanni Ok |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
例如 qwen-max 限流比较严重,fallback 退化到 qwen-plus 仍然可以使用
避免 ai-proxy 插件自身逻辑有问题误将 apiToken 移除的情况
The text was updated successfully, but these errors were encountered: