Add json schema mode #1122

abetlen · 2024-01-23T23:38:39Z

Adds Anyscale-style json schema mode for all simple chat formats (ie chatml, llama2, alpaca, vicuna, etc) as well as llava multimodal models. Source https://docs.endpoints.anyscale.com/guides/json_mode/

Server Usage

This is using the python openai API but any REST client will work.

import openai

client = openai.OpenAI(
    base_url="http://localhost:8000/v1", api_key="esecret_YOUR_API_KEY"
)

# Note: not all arguments are currently supported and will be ignored by the backend.
chat_completion = client.chat.completions.create(
    model="mistralai/Mixtral-8x7B-Instruct-v0.1",
    messages=[
        {
            "role": "system",
            "content": "You are a helpful assistant that outputs in JSON.",
        },
        {"role": "user", "content": "Who won the world series in 2020"},
    ],
    response_format={
        "type": "json_object",
        "schema": {
            "type": "object",
            "properties": {"team_name": {"type": "string"}},
            "required": ["team_name"],
        },
    },
    temperature=0.7,
)
print(chat_completion.model_dump())

{'id': 'chatcmpl-60994326-99f9-41f6-8fc0-ae9b08026521', 'choices': [{'finish_reason': 'stop', 'index': 0, 'logprobs': None, 'message': {'content': '{ "team_name
8000
": "Los Angeles Dodgers" } ', 'role': 'assistant', 'function_call': None, 'tool_calls': None}}], 'created': 1706391561, 'model': 'mistralai/Mixtral-8x7B-Instruct-v0.1', 'object': 'chat.completion', 'system_fingerprint': None, 'usage': {'completion_tokens': 16, 'prompt_tokens': 65, 'total_tokens': 81}}

Python Usage

>>> from llama_cpp import Llama
>>> llm = Llama(model_path="path/to/model.gguf", chat_format="chatml")
>>> llm.create_chat_completion(
    messages=[
        {
            "role": "system",
            "content": "You are a helpful assistant that outputs in JSON.",
        },
        {"role": "user", "content": "Who won the world series in 2020"},
    ],
    response_format={
        "type": "json_object",
        "schema": {
            "type": "object",
            "properties": {"team_name": {"type": "string"}},
            "required": ["team_name"],
        },
    },
    temperature=0.7,
)

{'id': 'chatcmpl-60994326-99f9-41f6-8fc0-ae9b08026521', 'choices': [{'finish_reason': 'stop', 'index': 0, 'logprobs': None, 'message': {'content': '{ "team_name": "Los Angeles Dodgers" } ', 'role': 'assistant', 'function_call': None, 'tool_calls': None}}], 'created': 1706391561, 'model': 'mistralai/Mixtral-8x7B-Instruct-v0.1', 'object': 'chat.completion', 'system_fingerprint': None, 'usage': {'completion_tokens': 16, 'prompt_tokens': 65, 'total_tokens': 81}}

abetlen added 3 commits January 23, 2024 18:37

Add json schema mode

752287a

Merge branch 'main' into add-json-schema-mode

6ec15de

Add llava chat format support

25aa85b

abetlen merged commit d8f6914 into main Jan 27, 2024

abetlen deleted the add-json-schema-mode branch January 31, 2024 20:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add json schema mode #1122

Add json schema mode #1122

Uh oh!

Uh oh!

Uh oh!

Add json schema mode #1122

Add json schema mode #1122

Uh oh!

Conversation

Uh oh!

Server Usage

Python Usage

Uh oh!

Uh oh!