Added local llm support to cypher-core. #2

skillsharer · 2024-12-24T11:58:35Z

Summary of Changes:

Added comprehensive support for Huggingface models, including server setup and model implementations. Currently Qwen/Qwen2.5-7B-Instruct supported. Further information is in the src/huggingface/README.md

Introduced new TypeScript adapters and clients for the Qwen model.

…gingface.

kingbootoshi · 2024-12-25T09:05:01Z

yo qwen is sick. does qwen need it's own adapter? or can the adapter be hugging face in general?

just tryna figure out how the hugging face interface works

skillsharer · 2024-12-25T09:38:04Z

That was one of the questions in my head also. There are multiple factors which we need to consider.

LLMs, especially local ones respond in different ways, because they have less parameters and trained on less data and sometimes more specific data (e.g. see the difference in instruct based vs. general LLMs)
Where do you want to handle the output? If you want to keep the output handling on the typescript side, it would be consistent with the API adapters, therefore multiple adapters needed for the Huggingface LLMs also. If not, then the output handling could go into the server side, where it would be consistent with those classes. But it can result a big (probably hardly maintanable) adapter later on if we use multiple LLMs.
We need to consider image and video input/output handling as well.

kingbootoshi · 2024-12-25T20:49:45Z

https://huggingface.co/Qwen/QVQ-72B-Preview

would this support this reasoning model for qwen ? has vision tech. looks insane

we should handle output on the baseAgent typescript side so it's consistent

for LLM models that don't have image support, i wanted to route them through a fireworks vision model and just get text returned added to the base agent chat history so it knows the context of the image (explain this image in deep detail)

skillsharer · 2024-12-26T14:29:27Z

The routing of images is a great idea! I checked the 72Billion model. Naive me hoped that I can run on my Macbook pro. Despite the 64 GB ram and the M2 architecture, I was not able to run the inference. I created a branch on my cypher-genesis fork, where the server side is done for this model: https://github.com/skillsharer/cypher-core/tree/feature/integrate-qwen-vision-model
If you have enough resource, go and try out! However, these models are quite large for everyday usage locally. We could distribute the workload, and decide what and when run locally and when we should call APIs.

Added local llm support to cypher-core. Qwen model supported from hug…

09bda4a

…gingface.

skillsharer added 6 commits December 30, 2024 10:25

Merge branch 'main' into feature/local-llm-support

72dc1b2

removed localAgent definition

18284f9

Added new localAgent definitions

b9cbb85

Modified qwen request and response handling

c9b5cf0

Modified qwen adapter and client to handle tool usage also

984d2d5

Updated README

e2b61c9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added local llm support to cypher-core. #2

Added local llm support to cypher-core. #2

skillsharer commented Dec 24, 2024

kingbootoshi commented Dec 25, 2024

skillsharer commented Dec 25, 2024

kingbootoshi commented Dec 25, 2024

skillsharer commented Dec 26, 2024

Added local llm support to cypher-core. #2

Are you sure you want to change the base?

Added local llm support to cypher-core. #2

Conversation

skillsharer commented Dec 24, 2024

kingbootoshi commented Dec 25, 2024

skillsharer commented Dec 25, 2024

kingbootoshi commented Dec 25, 2024

skillsharer commented Dec 26, 2024