Change Model Via the API #1713

mckinlec · 2023-05-01T18:43:31Z

mckinlec
May 1, 2023

Is it possible to change the model via the API? Also, is there an API docs page? I can't seem to find it, I don't have an API link at the bottom of my UI

Dynathresh · 2023-08-10T07:02:05Z

Dynathresh
Aug 10, 2023

You can post a request to load a different model.

req = {
        'action': 'load',
        'model_name': model,
}  # You can tack on more params to this request - see the reference link at bottom of comment
response = requests.post(f'http://{HOST}/api/v1/**model**', json=request)  # This will swap out the model running on the server
return response.json()  # This just shows you the server response to you loading the new model, to help you verify it loaded

I haven't yet found a way to ask the server which models are available.

All learned from here:

text-generation-webui/api-examples/api-example-model.py

Line 42 in 4ba30f6

def complex_model_load(model):

1 reply

mckinlec Aug 10, 2023
Author

Thanks, that's been added since I last took a look, I started using KoboldAI instead but that has issues putting the model on the GPU when loading through the api. I'll take another look at this api

matatonic · 2023-08-10T18:43:34Z

matatonic
Aug 10, 2023

See here for how listing models is used:

text-generation-webui/api-examples/api-example-model.py

Line 132 in 4ba30f6

for model in model_api({'action': 'list'})['result']:

Ex:

def model_api(request):
    response = requests.post(f'http://{HOST}/api/v1/model', json=request)
    return response.json()

all_models = model_api({'action': 'list'})['result']

for m in all_models:
   ...

9 replies

bradylangdale Jan 8, 2024

@VladStepanov It's changed a little bit. Here's the implementation for reference.

You can call it like this.

def load_model(base_url, args_json={}):
    response = requests.post(base_url + 'internal/model/load/', json=args_json)
    return response.json()

url = 'http://localhost:5000/v1/'
args = { "model_name": "Llama2" }

load_model(url, args)

VladStepanov Jan 8, 2024

Thanks! Works perfect

SalomonKisters Mar 15, 2024

Hey guys, this doesnt work anymore, does anyone know how to do it now?

Grego3j Mar 24, 2024

@SalomonKisters Do you have the API command-line flag enabled? Checking these boxes helped me:

SalomonKisters Mar 25, 2024

Yes, I do. This is my command:
bash -c '/app/modules/text-generation-webui/start_linux.sh --listen --listen-host 0.0.0.0 --model $MODEL_FOLDER_NAME --api --api-key $DEFAULT_API_KEY --nowebui'

deevis · 2024-03-25T05:02:38Z

deevis
Mar 25, 2024

For those arriving here late to the game, and using API calls...

In your CMD_FLAGS.txt ( or checked in Session => Boolean Command Line Flags )

# make the api available on port 5000
--api 
# enable the openai extension
--extensions openai
# start the web interface
--listen

Once running, these API calls allow you to check/list/modify models...

To check the currently loaded model:
GET /v1/internal/model/info ( then response.json()['model_name'] )

To list available models:
GET /v1/internal/model/list ( then response.json()['model_names'] )

To load a different model:
POST /v1/internal/model/load ( payload: {'model_name': model_name})

0 replies

bradylangdale · 2024-04-16T01:33:54Z

bradylangdale
Apr 16, 2024

@SalomonKisters anyone who comes across this thread.

Here's an example from a small project you use for reference.

Be sure to have the flags enabled that are mentioned above.

import requests

class Oobabooga:

    def __init__(self, ip='localhost', port='5000') -> None:
        self.ip = ip
        self.port = port

        self.url = f'http://{self.ip}:{self.port}/v1/'

    def get_models(self):
        response = requests.get(self.url + 'internal/model/list')
        return response.json()['model_names']
    
    def load_model(self, args_json={}):
        response = requests.post(self.url + 'internal/model/load/', json=args_json)
        return response.json()
    
    def unload_model(self, args_json={}):
        response = requests.post(self.url + 'internal/model/unload/', json=args_json)
        return response.json()
    
    def get_model_info(self):
        response = requests.get(self.url + 'internal/model/info')
        return response.json()
    
    def stop_generation(self):
        response = requests.post(self.url + 'internal/stop-generation')
        return response.json()
    
    def chat_completion(self, args_json={}, stream=False):
        if not stream:
            response = requests.post(self.url + 'chat/completions', json=args_json)
            return response.json()
        else:
            return requests.post(self.url + 'chat/completions',
                                 headers={ "Content-Type": "application/json" },
                                 json=args_json,
                                 verify=False,
                                 stream=True)
    
    def completion(self, args_json={}, stream=False):
        if not stream:
            response = requests.post(self.url + 'completions', json=args_json)
            return response.json()
        else:
            return requests.post(self.url + 'completions',
                                 headers={ "Content-Type": "application/json" },
                                 json=args_json,
                                 verify=False,
                                 stream=True)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change Model Via the API #1713

{{title}}

Replies: 4 comments 10 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

Change Model Via the API #1713

Replies: 4 comments · 10 replies

mckinlec Aug 10, 2023 Author

Replies: 4 comments 10 replies

mckinlec Aug 10, 2023
Author