8000 limit_concurrency · RossAlRed/llama-cpp-python@73ccef9 · GitHub
[go: up one dir, main page]

Skip to content

Commit 73ccef9

Browse files
authored
limit_concurrency
1 parent 3af7b21 commit 73ccef9

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

llama_cpp/server/app.py

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -157,6 +157,10 @@ class Settings(BaseSettings):
157157
)
158158
# Server Params
159159
host: str = Field(default="localhost", description="Listen address")
160+
limit_concurrency: int = Field(
161+
default=2,
162+
description="Number of concurrent processes + 1"
163+
)
160164
port: int = Field(default=8000, description="Listen port")
161165
interrupt_requests: bool = Field(
162166
default=True,

0 commit comments

Comments
 (0)
0