Compile bug: NVIDIA A800-SXM4-40GB ggml_cuda_init failed #13059

lld1995 · 2025-04-22T08:49:42Z

Git commit

now master

Operating systems

Linux

GGML backends

CUDA

Problem description & steps to reproduce

cmake .. -DGGML_CUDA=ON -DCMAKE_CUDA_ARCHITECTURES="80;86;87;89;90"

just build it and run

First Bad Commit

No response

Compile command

./llama-server --model /data/llama-model/DeepSeek-R1-GGUF-q4/DeepSeek-R1-UD-IQ1_M/DeepSeek-R1-UD-IQ1_M-00001-of-00004.gguf --ctx-size 26000 --n-gpu-layers 63 --host 0.0.0.0 --parallel 2 --cache-reuse 4096 --port 9990 --metrics --jinja -fa

Relevant log output

run error:
ggml_cuda_init: failed to initialize CUDA: system not yet initialized
warning: no usable GPU found, --gpu-layers option will be ignored
warning: one possible reason is that llama.cpp was compiled without GPU support
warning: consult docs/build.md for compilation instructions

lld1995 · 2025-04-22T09:07:17Z

I guess it may driver error...

lld1995 added the bug-unconfirmed label Apr 22, 2025

github-actions bot added the stale label May 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compile bug: NVIDIA A800-SXM4-40GB ggml_cuda_init failed #13059

Compile bug: NVIDIA A800-SXM4-40GB ggml_cuda_init failed #13059

Uh oh!

Compile bug: NVIDIA A800-SXM4-40GB ggml_cuda_init failed #13059

Compile bug: NVIDIA A800-SXM4-40GB ggml_cuda_init failed #13059

Comments

Git commit

Operating systems

GGML backends

Problem description & steps to reproduce

First Bad Commit

Compile command

Relevant log output

Uh oh!