add LLM in adapter and save query and answer #64

DreamCyc · 2024-12-18T23:25:25Z

No description provided.

hicofeng · 2024-12-19T16:26:42Z

Does the current LLM (Large Language Model) adapter for this project support streaming answers? For scenarios that require low latency, is there a plan to support this feature in the future if it's not available now? Thank you very much for your assistance.

DreamCyc · 2024-12-22T12:56:39Z

@hicofeng When the model is deployed to the server machine and provided with a URL, it can achieve streaming output to avoid user waiting. The functionality provided here is to invoke the deployed model when there are no matching results in the cached data, referring to the OpenAI specification, which may vary depending on the specific model and deployment method used.

add LLM in adapter and save query and answer

779e87b

DreamCyc mentioned this pull request Dec 18, 2024

[编程挑战季] 开发对接大模型 Adapter。 #51

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add LLM in adapter and save query and answer #64

add LLM in adapter and save query and answer #64

DreamCyc commented Dec 18, 2024

hicofeng commented Dec 19, 2024

DreamCyc commented Dec 22, 2024

add LLM in adapter and save query and answer #64

Are you sure you want to change the base?

add LLM in adapter and save query and answer #64

Conversation

DreamCyc commented Dec 18, 2024

hicofeng commented Dec 19, 2024

DreamCyc commented Dec 22, 2024