llama : bump max layers from 256 to 512 #8530

ggerganov · 2024-07-17T07:02:33Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

slaren · 2024-07-17T07:06:05Z

I would suggest moving the assert to the case where the key is actually an array only. That way it wouldn't affect other models. The assert should also be replaced with an exception instead to avoid crashing on invalid models. Ideally it should be a dynamic container instead of a fixed array, but that requires more changes.

ggerganov · 2024-07-17T07:30:33Z

I would suggest moving the assert to the case where the key is actually an array only. That way it wouldn't affect other models.

Wouldn't this loop potentially overflow if we check just array keys?

https://github.com/ggerganov/llama.cpp/blob/b268edf87c3bb996feb88e2d8b2c85b38bff6f36/src/llama.cpp#L4061-L4063

slaren · 2024-07-17T07:36:08Z

Yes, I assumed that it would only set the first element in that case. ~~From what I can tell, only OpenELM ever uses more than the first element.~~

* llama : bump max layers from 256 to 512 * llama : replace asserts with exceptions

llama : bump max layers from 256 to 512

b268edf

ggerganov force-pushed the gg/bump-max-layers branch from fac0355 to b268edf Compare July 17, 2024 07:04

llama : replace asserts with exceptions

7d87a0d

ggerganov requested a review from slaren July 17, 2024 07:50

slaren approved these changes Jul 17, 2024

View reviewed changes

ggerganov mentioned this pull request Jul 19, 2024

llama : move vocab, grammar and sampling into separate files #8508

Merged

7 tasks

ggerganov merged commit d197545 into master Jul 19, 2024
54 checks passed

ggerganov deleted the gg/bump-max-layers branch July 19, 2024 13:50

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 27, 2024

llama : bump max layers from 256 to 512 (ggml-org#8530)

af83110

* llama : bump max layers from 256 to 512 * llama : replace asserts with exceptions

nicoboss mentioned this pull request Oct 16, 2024

llama : bump max layers from 512 to 1024 #9910

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama : bump max layers from 256 to 512 #8530

llama : bump max layers from 256 to 512 #8530

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

llama : bump max layers from 256 to 512 #8530

llama : bump max layers from 256 to 512 #8530

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!