8000 docs : add Moondream2 pre-quantized link by ddpasa · Pull Request #13745 · ggml-org/llama.cpp · GitHub
[go: up one dir, main page]

Skip to content

docs : add Moondream2 pre-quantized link #13745

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
May 25, 2025
Merged
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Apply suggestions from code review
  • Loading branch information
ngxson authored May 25, 2025
commit 40fb6540b6883d2a312db14a65b3d4772f71dfcf
4 changes: 2 additions & 2 deletions docs/multimodal.md
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@ llama-server -hf ggml-org/gemma-3-4b-it-GGUF --no-mmproj-offload

## Pre-quantized models

These are ready-to-use models, most of them come with `Q4_K_M` quantization by default. They can be found at the Hugging Face page of the ggml-org: https://huggingface.co/collections/ggml-org/gguf-vision-models-68244e01ff1f39e5bebeeedc
These are ready-to-use models, most of them come with `Q4_K_M` quantization by default. They can be found at the Hugging Face page of the ggml-org: https://huggingface.co/collections/ggml-org/multimodal-ggufs-68244e01ff1f39e5bebeeedc

Replaces the `(tool_name)` with the name of binary you want to use. For example, `llama-mtmd-cli` or `llama-server`

Expand Down Expand Up @@ -83,7 +83,7 @@ NOTE: some models may require large context window, for example: `-c 8192`
(tool_name) -hf ggml-org/Llama-4-Scout-17B-16E-Instruct-GGUF

# Moondream2 20250414 version
(tool_name) -hf Hahasb/moondream2-20250414-GGUF
(tool_name) -hf ggml-org/moondream2-20250414-GGUF

```

Expand Down
0