Better server params and fields #130

Stonelinks · 2023-04-30T02:53:44Z

This is a lot, but the net is that:

Unsupported fields were removed
Supported (but missing) fields were added
A ton of information in parameters were added (via Field annotations)

Reasoning should be in commit messages for individual decisions, with some more details here: Stonelinks#3

The result of all this should be better documentation from the swagger / openapi docs available from the server, as well as improved clients generated from openapi.json (up next)

gjmulder · 2023-05-01T18:53:42Z

Upvote. Muchas gracias! 👍

`model` is ignored, but currently marked "optional"... on the one hand could mark "required" to make it explicit in case the server supports multiple llama's at the same time, but also could delete it since its ignored. decision: mark it required for the sake of openai api compatibility. I think out of all parameters, `model` is probably the most important one for people to keep using even if its ignored for now.

`n`, `presence_penalty`, `frequency_penalty`, `best_of`, `logit_bias`, `user`: not supported, excluded from the calls into llama. decision: delete it

I think this is actually supported (its in the arguments of `LLama.__call__`, which is how the completion is invoked). decision: mark as supported

`llama.create_chat_completion` definitely has a `top_k` argument, but its missing from `CreateChatCompletionRequest`. decision: add it

Slight refactor for common fields shared between completion and chat completion

When I generate a client, it breaks because it fails to process the schema of ChatCompletionRequestMessage These fix that: - I think `Union[Literal["user"], Literal["channel"], ...]` is the same as Literal["user", "channel", ...] - Turns out default value `Literal["user"]` isn't JSON serializable, so replace with "user"

Not sure why this union type was here but taking a look at llama.py, prompt is only ever processed as a string for completion This was breaking types when generating an openapi client

…r-server-params-and-fields

abetlen · 2023-05-05T16:09:27Z

@Stonelinks great work!

I want to merge this in but I'd like to keep the unsupported fields in the API as they were before. Otherwise I'm very happy with the addition of the Fields object because it works really well with how I've implemented CLI options for the server.

Stonelinks force-pushed the better-server-params-and-fields branch from 8b81fc8 to 114c9f3 Compare May 1, 2023 18:09

Stonelinks force-pushed the better-server-params-and-fields branch from 5204a6a to 1c7cc8c Compare May 1, 2023 19:07

Stonelinks added 8 commits May 1, 2023 15:38

llama_cpp server: delete some ignored / unused parameters

b47b954

`n`, `presence_penalty`, `frequency_penalty`, `best_of`, `logit_bias`, `user`: not supported, excluded from the calls into llama. decision: delete it

llama_cpp server: move logprobs to supported

1e42913

I think this is actually supported (its in the arguments of `LLama.__call__`, which is how the completion is invoked). decision: mark as supported

llama_cpp server: add missing top_k param to CreateChatCompletionRequest

a5aa6c1

`llama.create_chat_completion` definitely has a `top_k` argument, but its missing from `CreateChatCompletionRequest`. decision: add it

llama_cpp server: add some more information to fields for completions

978b6da

llama_cpp server: define fields for chat completions

8dcbf65

Slight refactor for common fields shared between completion and chat completion

llama_cpp server: fields for the embedding endpoint

fa2a61e

Stonelinks force-pushed the better-server-params-and-fields branch from 1c7cc8c to dbbfc4b Compare May 1, 2023 22:38

abetlen and others added 2 commits May 1, 2023 22:45

Merge branch 'main' into better-server-params-and-fields

7ab08b8

llama_cpp server: prompt is a string

b9098b0

Not sure why this union type was here but taking a look at llama.py, prompt is only ever processed as a string for completion This was breaking types when generating an openapi client

Stonelinks force-pushed the better-server-params-and-fields branch from 6396f7e to b9098b0 Compare May 2, 2023 21:47

Stonelinks mentioned this pull request May 3, 2023

(WIP) Openapi client gen #144

Open

Merge branch 'main' of github.com:abetlen/llama-cpp-python into bette…

3008a95

…r-server-params-and-fields

abetlen merged commit d8fddcc into abetlen:main May 7, 2023

xaptronic pushed a commit to xaptronic/llama-cpp-python that referenced this pull request Jun 13, 2023

Add section to README on how to run the project on Android (abetlen#130)

60f819a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better server params and fields #130

Better server params and fields #130

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Better server params and fields #130

Better server params and fields #130

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!