Support SPM infill #1492

CISC · 2024-05-29T17:26:13Z

Add spm_infill option to perform infill in the Suffix/Prefix/Middle pattern instead of Prefix/Suffix/Middle as some models (like the new Codestral) prefer this.

Also added a tokenizer hack to remove leading space in suffix to improve inference, tested on several different tokenizers/vocabs with success.

CISC · 2024-05-29T17:46:55Z

I also see that mistral-inference adds BOS at the beginning of infill, so I guess we should too, but this requires the add_bos_token function to check if BOS should be added or not. I added this is in #1439 so if that gets merged first we can use that in this PR.

CISC · 2024-05-29T18:22:57Z

Actually, since I ended up not using them there let's make it simple and remove them from that PR and add them here.

This is identical behaviour to llama.cpp I guess any model that doesn't use BOS is recent enough to have the add_bos_token metadata.

CISC · 2024-05-30T21:33:13Z

My Codestral GGUFs are up for those who wish to test with this option.

NOTE: It requires ggml-org/llama.cpp#7644 to not insert garbage middle token!

abetlen · 2024-06-04T16:18:15Z

@CISC looks good overall, do you mind adding something minimal to /examples that would help me review this and demonstrat the FIM API?

CISC · 2024-06-04T17:02:32Z

Sure, no problem.

CISC · 2024-06-04T18:56:16Z

@abetlen Example added.

CISC added 2 commits May 29, 2024 19:22

Support SPM infill

7bb0b56

typo--

c5c056e

one less layer of parenthesis necessary

73a1e72

CISC added 7 commits May 29, 2024 20:26

new required internals

1f9abf8

manually add bos/eos if model requires it

c70483d

add bos even when unknown

e54e47e

This is identical behaviour to llama.cpp I guess any model that doesn't use BOS is recent enough to have the add_bos_token metadata.

don't add bos/eos on non-infill pre-tokenized prompt

fce73d0

add tokenizer hack to remove leading space in suffix

b062762

I keep forgetting metadata are strings

5118dfa

check if bos exists

fa97f86

add example

7b80669

CISC and others added 5 commits June 5, 2024 10:21

Merge branch 'abetlen:main' into spm_infill

e034e9f

add cls/sep instead of bos/eos for WPM vocab

aab7d32

simplify

2d7cb7e

color-code filtered suffix

5a262c6

Merge branch 'main' into spm_infill

79803b5

abetlen closed this Jun 13, 2024

abetlen deleted the spm_infill branch June 13, 2024 07:38

abetlen restored the spm_infill branch June 13, 2024 07:38

abetlen reopened this Jun 13, 2024

abetlen merged commit dbcf64c into abetlen:main Jun 13, 2024
16 checks passed

CISC deleted the spm_infill branch June 13, 2024 11:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support SPM infill #1492

Support SPM infill #1492

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Support SPM infill #1492

Support SPM infill #1492

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!