Add chat format to support baichuan2 #936

caiyesd · 2023-11-22T10:38:45Z

I found this cool project doesn't support baichuan2, so I make this PR to support it.

What is added

I added a new chat format named "baichuan-2" to support baichuan2 chat model.
I only tested Baichuan2-7B-Chat. Not sure whether Baichuan2-13B-Chat can work.

How to use it

convert baichuan2 mode to baichuan1

see: https://github.com/baichuan-inc/Baichuan2

replace the lm_head.weight with new one.

uses llama.cpp's convert.py to convert Baichuan2-7B-Chat to GGUF format.

python3 convert.py --outfile models/baichuan-2-7b-chat-baichuan-f16.gguf ../baichuan/Baichuan2-7B-Chat-Baichuan/

quantize the model, take Q3_K_M for example.

./quantize models/baichuan-2-7b-chat-baichuan-f16.gguf models/baichuan-2-7b-chat-baichuan-Q3_K_M.gguf Q3_K_M

launch llama-cpp-python server and test. remember to use " --chat_format baichuan-2"

python3 -m llama_cpp.server --model models/baichuan-2-7b-chat-baichuan-Q3_K_M.gguf --host 0.0.0.0 --port 9000 --n_gpu_layers -1 --chat_format baichuan-2

Signed-off-by: caiyesd <caiyesd@gmail.com>

abetlen · 2023-11-22T11:07:35Z

@caiyesd thank you for the contribution!

Add chat format to support baichuan2

0384b5e

Signed-off-by: caiyesd <caiyesd@gmail.com>

caiyesd mentioned this pull request Nov 22, 2023

Missing Baichuan2 chat format #819

Closed

abetlen merged commit b8f29f4 into abetlen:main Nov 22, 2023

caiyesd mentioned this pull request Nov 23, 2023

Add chat format to support baichuan #938

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add chat format to support baichuan2 #936

Add chat format to support baichuan2 #936

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add chat format to support baichuan2 #936

Add chat format to support baichuan2 #936

Uh oh!

Conversation

Uh oh!

What is added

How to use it

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants