Allow specifying `adapter_id` on `chat/completions` requests #2939

tsvisab · 2025-01-22T11:40:06Z

Feature request

It seems that if i want to load a base model with an adapter and consume it, i'll have to use the generate route only which allows specifying adapter_id

`curl 127.0.0.1:3000/generate
-X POST
-H 'Content-Type: application/json'
-d '{
"inputs": "Was "The office" the funniest tv series ever?",
"parameters": {
"max_new_tokens": 200,
"adapter_id": "tv_knowledge_id"
}
}'

but can't use v1/chat/completions

are you planing to support this?

Motivation

Many use v1/chat/completions and train lora adapters for it

Your contribution

Maybe, if you're over your capacity

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow specifying `adapter_id` on `chat/completions` requests #2939

Allow specifying `adapter_id` on `chat/completions` requests #2939

tsvisab commented Jan 22, 2025

Allow specifying adapter_id on chat/completions requests #2939

Allow specifying adapter_id on chat/completions requests #2939

Comments

tsvisab commented Jan 22, 2025

Feature request

Motivation

Your contribution

Allow specifying `adapter_id` on `chat/completions` requests #2939

Allow specifying `adapter_id` on `chat/completions` requests #2939