Support SPM infill by CISC · Pull Request #1492 · abetlen/llama-cpp-python

CISC · 2024-05-29T17:26:13Z

Add spm_infill option to perform infill in the Suffix/Prefix/Middle pattern instead of Prefix/Suffix/Middle as some models (like the new Codestral) prefer this.

Also added a tokenizer hack to remove leading space in suffix to improve inference, tested on several different tokenizers/vocabs with success.

CISC · 2024-05-29T17:46:55Z

I also see that mistral-inference adds BOS at the beginning of infill, so I guess we should too, but this requires the add_bos_token function to check if BOS should be added or not. I added this is in #1439 so if that gets merged first we can use that in this PR.

CISC · 2024-05-29T18:22:57Z

Actually, since I ended up not using them there let's make it simple and remove them from that PR and add them here.

This is identical behaviour to llama.cpp I guess any model that doesn't use BOS is recent enough to have the add_bos_token metadata.

CISC · 2024-05-30T21:33:13Z

My Codestral GGUFs are up for those who wish to test with this option.

NOTE: It requires ggml-org/llama.cpp#7644 to not insert garbage middle token!

abetlen · 2024-06-04T16:18:15Z

@CISC looks good overall, do you mind adding something minimal to /examples that would help me review this and demonstrat the FIM API?

CISC · 2024-06-04T17:02:32Z

Sure, no problem.

CISC · 2024-06-04T18:56:16Z

@abetlen Example added.

CISC added 2 commits May 29, 2024 19:22

Support SPM infill

7bb0b56

typo--

c5c056e

one less layer of parenthesis necessary

73a1e72

CISC added 7 commits May 29, 2024 20:26

new required internals

1f9abf8

manually add bos/eos if model requires it

c70483d

add bos even when unknown

e54e47e

This is identical behaviour to llama.cpp I guess any model that doesn't use BOS is recent enough to have the add_bos_token metadata.

don't add bos/eos on non-infill pre-tokenized prompt

fce73d0

add tokenizer hack to remove leading space in suffix

b062762

I keep forgetting metadata are strings

5118dfa

check if bos exists

fa97f86

add example

7b80669

CISC and others added 5 commits June 5, 2024 10:21

Merge branch 'abetlen:main' into spm_infill

e034e9f

add cls/sep instead of bos/eos for WPM vocab

aab7d32

simplify

2d7cb7e

color-code filtered suffix

5a262c6

Merge branch 'main' into spm_infill

79803b5

abetlen closed this Jun 13, 2024

abetlen deleted the spm_infill branch June 13, 2024 07:38

abetlen restored the spm_infill branch June 13, 2024 07:38

abetlen reopened this Jun 13, 2024

abetlen merged commit dbcf64c into abetlen:main Jun 13, 2024

CISC deleted the spm_infill branch June 13, 2024 11:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support SPM infill#1492

Support SPM infill#1492
abetlen merged 16 commits intoabetlen:mainfrom
CISC:spm_infill

CISC commented May 29, 2024 •

edited

Loading

Uh oh!

CISC commented May 29, 2024

Uh oh!

CISC commented May 29, 2024

Uh oh!

CISC commented May 30, 2024

Uh oh!

abetlen commented Jun 4, 2024

Uh oh!

CISC commented Jun 4, 2024

Uh oh!

CISC commented Jun 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

CISC commented May 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CISC commented May 29, 2024

Uh oh!

CISC commented May 29, 2024

Uh oh!

CISC commented May 30, 2024

Uh oh!

abetlen commented Jun 4, 2024

Uh oh!

CISC commented Jun 4, 2024

Uh oh!

CISC commented Jun 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CISC commented May 29, 2024 •

edited

Loading