feat: upgrade llama.cpp #645

wsxiaoys · 2023-10-26T20:34:49Z

this change will merge once we've updated gguf files for all models in https://tabby.tabbyml.com/docs/models/

fix TAB-281

wsxiaoys · 2023-10-26T20:37:04Z

crates/llama-cpp-bindings/src/engine.cc

  TextInferenceEngineImpl(owned<llama_model> model, owned<llama_context> ctx) :
    model_(std::move(model)),
    ctx_(std::move(ctx)) {
-      batch_ = llama_batch_init(N_BATCH, 0);


Previous usage of batch api in tabby generates a segmentation fault with updated llama.cpp version - roll back to llama_batch_get_one as workaround, will revisit this when integrating the continuous batching support.

wsxiaoys · 2023-10-26T20:37:44Z

crates/tabby-common/src/path.rs

        self.path_string("ggml/q8_0.gguf")
    }
+
+    pub fn ggml_q8_0_v2_file(&self) -> String {


updated llama.cpp requires re-converting all starcoder models, updating filepath to keep forward compatibility.

feat: upgrade llama.cpp

4683906

wsxiaoys commented Oct 26, 2023

View reviewed changes

wsxiaoys added 2 commits October 26, 2023 13:41

update download files

6ca603a

update changelog

00accd7

wsxiaoys force-pushed the upgrade-llama-cpp branch from 8e1690b to 00accd7 Compare October 26, 2023 20:51

wsxiaoys added 2 commits October 26, 2023 15:59

Update CHANGELOG.md

a7e070c

Update CHANGELOG.md

bc68bac

wsxiaoys marked this pull request as ready for review October 27, 2023 19:17

wsxiaoys merged commit f378405 into main Oct 27, 2023

wsxiaoys deleted the upgrade-llama-cpp branch October 27, 2023 19:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: upgrade llama.cpp #645

feat: upgrade llama.cpp #645

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: upgrade llama.cpp #645

feat: upgrade llama.cpp #645

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants