Implement GGUF metadata KV overrides #1011

phiharri · 2023-12-14T21:06:23Z

Quick implementation of KV overrides which accepts strings in llama.cpp --kv_overrides KEY=TYPE:VALUE .. format. For example llama.expert_used_count=int:3 Multiple overrides may be space separated.

Not that familiar with ctypes or the preferred types to use here.

Closes #1084

abetlen · 2024-01-14T04:13:18Z

Hey @phiharri thanks for the contribution, I have a few changes to make but the overall api seems correct. I think the kv overrides argument to the Llama class should just be a dictionary and the types should be based on the python types 8000 but the string based approach for the cli args looks good.

abetlen · 2024-01-15T17:29:23Z

Looks good now, @phiharri thank you for the contribution!

* kv overrides another attempt * add sentinel element, simplify array population * ensure sentinel element is zeroed

Implement GGUF metadata overrides

3e1649b

phiharri mentioned this pull request Dec 14, 2023

add KV override field for llama.cpp loaders oobabooga/text-generation-webui#4925

Closed

Merge branch 'main' into kv_overrides

c1722c2

abetlen added a commit that referenced this pull request Dec 22, 2023

fix: inccorect bindings for kv override. Based on #1011

6d8bc09

abetlen added 3 commits December 22, 2023 17:03

Merge branch 'main' into kv_overrides

125f3a4

whitespace fix

0705ce5

Merge branch 'main' into kv_overrides

a1fa139

abetlen added 4 commits January 15, 2024 10:56

Merge branch 'main' into kv_overrides

92d6cf6

Fix kv overrides.

87af451

Fix pointer and pickle

9dd3a2b

Match llama.cpp kv_overrides cli argument

51dea62

abetlen merged commit 76aafa6 into abetlen:main Jan 15, 2024

abetlen pushed a commit that referenced this pull request Jan 24, 2024

fix: GGUF metadata KV overrides, re #1011 (#1116)

fe5d6ea

* kv overrides another attempt * add sentinel element, simplify array population * ensure sentinel element is zeroed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement GGUF metadata KV overrides #1011

Implement GGUF metadata KV overrides #1011

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Implement GGUF metadata KV overrides #1011

Implement GGUF metadata KV overrides #1011

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!