8000 Vulkan: Add f32 accumulator support to quantized mul mat to fix GLM4 … · robbiemu/llama.cpp@8960efd · GitHub

Commit 8960efd

authored

Vulkan: Add f32 accumulator support to quantized mul mat to fix GLM4 32B incoherence (ggml-org#13607)

1 parent 725f23f commit 8960efdCopy full SHA for 8960efd

1 file changed

+118

-108

lines changed

+118

-108

lines changed

Comments

(0)