chore: bump llama_cpp_python to 0.3.6 #2368

Ian321 · 2024-10-02T11:48:24Z

Instructlab had been using a outdated version of llama_cpp_python that did not support models such as Mixtral NeMo. This PR ~~simply~~ bumps the version of that dependency to the latest one and updates the pipelines and documentation to use the new building flags.

Checklist:

Commit Message Formatting: Commit titles and messages follow guidelines in the
conventional commits.
Changelog updated with breaking and/or notable changes for the next minor release.
Documentation has been updated, if necessary.
Unit tests have been added, if necessary.
Functional tests have been added, if necessary.
E2E Workflow tests have been added, if necessary.

tiran · 2024-10-02T17:52:57Z

Latest llama-cpp-python has different CMAKE_ARGS for CUDA, ROCm, and MPS support. You have to update the README, other docs, and tests, too. Look for -DLLAMA.

CHANGELOG.md

Ian321 · 2024-10-03T22:30:34Z

The tests with reduced max_ctx_size fail because of this: abetlen/llama-cpp-python#1759

alimaredia · 2024-10-11T21:42:26Z

@Ian321 could you make changes to the tests that fail due to reduced max_ctx_size as part of this PR?

Ian321 · 2024-10-12T03:49:06Z

@alimaredia what kind of changes do you have in mind?

This worked as expected with previous versions of llama-cpp-python and now there seems to have been a regression.

I tested llama.cpp directly with a reduced ctx and it did not crash, so now I'm planning on fixing it in llama-cpp-python and then bump this pr (to hopefully 0.3.2).

nathan-weinberg · 2024-10-15T17:15:12Z

@alimaredia what kind of changes do you have in mind?

This worked as expected with previous versions of llama-cpp-python and now there seems to have been a regression.

I tested llama.cpp directly with a reduced ctx and it did not crash, so now I'm planning on fixing it in llama-cpp-python and then bump this pr (to hopefully 0.3.2).

We just need CI to be passing here to ensure there are no regressions, since this is a non-trivial bump

alimaredia · 2024-10-15T17:32:45Z

If you look at the failure here: https://github.com/instructlab/instructlab/actions/runs/11153487993/job/31003244761?pr=2368#step:15:338, some of the tests here (https://github.com/instructlab/instructlab/blob/main/scripts/functional-tests.sh) are failing.

Those tests get run in order for every PR to merge, so we'd expect changes to the tests in this PR in order to do the version bump. Removal of certain functional tests is on the table if properly justified.

tiran · 2024-10-16T05:29:58Z

Our downstream build pipeline is now configured to handle llama_cpp_python 0.2.75 and 0.3.1.

Ian321 · 2024-10-16T09:15:08Z

@alimaredia the tests fail because there is a bug in abetlen/llama-cpp-python#1759 for which I have provided a fix abetlen/llama-cpp-python#1796 . We just have to wait for the next release of it (where it's hopefully merged of fixed some other way).

@tiran if you mean the changes to the pipeline directly, I only see some general cleanup and nothing that would affect this PR directly. The test that caught this should not be modified or removed as it's what caught the above mentioned bug and helped me submit a PR for it. The only thing I could recommend is to add a timeout to .github/workflows/test.yml, so that it won't hang for 6 hours.

I will still rebase it, just in case I missed something.

tiran · 2024-10-16T09:32:48Z

My update regarding the downstream pipeline was for @nathan-weinberg and @alimaredia . We have an internal build pipeline that rebuilds all Python wheel from sources. Some packages like llama-cpp-python need extra configuration to build correctly. llama-cpp-python 0.3 has deprecated some options and introduced new build flags. Our internal builds are now able to handle >=0.2.75 and 0.3.x.

mergify · 2025-01-24T15:50:17Z

This pull request has merge conflicts that must be resolved before it can be
merged. @Ian321 please rebase it. https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

nathan-weinberg · 2025-01-24T15:50:20Z

@Ian321 I've set this for our 0.24.0 milestone - since we were able to bump to 0.3.2 in ilab 0.23.0 we're hoping to follow up shortly after the release with this! cc @fabiendupont

mergify · 2025-01-29T14:44:23Z

This pull request has merge conflicts that must be resolved before it can be
merged. @Ian321 please rebase it. https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Ignaz Kraft <ignaz.k@live.de>

fabiendupont · 2025-01-29T15:21:09Z

Nice solution @Ian321. Much shorter than I thought.

mergify bot added ci-failure PR has at least one CI failure dependencies Relates to dependencies labels Oct 2, 2024

Ian321 force-pushed the llama_cpp_031 branch from 2856714 to d4d5c43 Compare October 2, 2024 11:49

mergify bot added ci-failure PR has at least one CI failure and removed ci-failure PR has at least one CI failure labels Oct 2, 2024

mergify bot added CI/CD Affects CI/CD configuration container Affects containization aspects documentation Improvements or additions to documentation testing Relates to testing and removed ci-failure PR has at least one CI failure labels Oct 2, 2024

Ian321 changed the title ~~chore: bump llama_cpp_python to 0.3.1~~ chore!: bump llama_cpp_python to 0.3.1 Oct 2, 2024

mergify bot added the ci-failure PR has at least one CI failure label Oct 2, 2024

nathan-weinberg reviewed Oct 2, 2024

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

mergify bot removed the ci-failure PR has at least one CI failure label Oct 2, 2024

Ian321 force-pushed the llama_cpp_031 branch from cf0210a to 5aef651 Compare October 3, 2024 00:07

mergify bot added the ci-failure PR has at least one CI failure label Oct 3, 2024

alimaredia requested review from alimaredia and tiran October 15, 2024 17:36

Ian321 force-pushed the llama_cpp_031 branch from 5aef651 to 8880708 Compare October 16, 2024 09:16

mergify bot removed the ci-failure PR has at least one CI failure label Oct 16, 2024

Ian321 force-pushed the llama_cpp_031 branch from 8880708 to 6a5620d Compare October 16, 2024 09:38

Ian321 changed the title ~~chore!: bump llama_cpp_python to 0.3.5~~ chore!: bump llama_cpp_python to 0.3.6 Jan 10, 2025

Ian321 changed the title ~~chore!: bump llama_cpp_python to 0.3.6~~ chore: bump llama_cpp_python to 0.3.6 Jan 10, 2025

nathan-weinberg requested a review from cdoern January 10, 2025 17:40

nathan-weinberg mentioned this pull request Jan 23, 2025

Update llama_cpp_python to 0.3.6 #2972

Closed

nathan-weinberg linked an issue Jan 23, 2025 that may be closed by this pull request

llama_cpp_python 0.3.2 breaks building from source on aarch64 with CUDA #2971

Closed

nathan-weinberg added this to the 0.24.0 milestone Jan 24, 2025

mergify bot added the needs-rebase This Pull Request needs to be rebased label Jan 24, 2025

Ian321 force-pushed the llama_cpp_031 branch from 4316a5f to 81fb761 Compare January 24, 2025 17:48

mergify bot removed the needs-rebase This Pull Request needs to be rebased label Jan 24, 2025

mergify bot added the needs-rebase This Pull Request needs to be rebased label Jan 29, 2025

chore: bump llama_cpp_python to 0.3.6

57b1433

Signed-off-by: Ignaz Kraft <ignaz.k@live.de>

Ian321 force-pushed the llama_cpp_031 branch from 81fb761 to 57b1433 Compare January 29, 2025 14:47

mergify bot removed the needs-rebase This Pull Request needs to be rebased label Jan 29, 2025

nathan-weinberg removed the hold In-progress PR. Tag should be removed before merge. label Jan 29, 2025

nathan-weinberg approved these changes Jan 29, 2025

View reviewed changes

nathan-weinberg requested review from a team and removed request for tiran, alimaredia and < 8000 a data-hovercard-type="user" data-hovercard-url="/users/cdoern/hovercard" data-octo-click="hovercard-link-click" data-octo-dimensions="link_type:self" href="/cdoern">cdoern January 29, 2025 14:56

mergify bot added the one-approval PR has one approval from a maintainer label Jan 29, 2025

nathan-weinberg removed this from the 0.24.0 milestone Jan 29, 2025

mergify bot added the ci-failure PR has at least one CI failure label Jan 29, 2025

mergify bot removed the ci-failure PR has at least one CI failure label Jan 29, 2025

booxter approved these changes Jan 29, 2025

View reviewed changes

mergify bot removed the one-approval PR has one approval from a maintainer label Jan 29, 2025

mergify bot merged commit c3ec416 into instructlab:main Jan 29, 2025
29 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: bump llama_cpp_python to 0.3.6 #2368

chore: bump llama_cpp_python to 0.3.6 #2368

Uh oh!

Uh oh 8000 !

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chore: bump llama_cpp_python to 0.3.6 #2368

chore: bump llama_cpp_python to 0.3.6 #2368

Uh oh!

Conversation

Uh oh!

Uh oh 8000 !

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!