Qwen3-8B and other models generate garbage output / repeat tokens (GGGGGG...) in llama.cpp via LM Studio Vulkan backend

Name and Version

When using llama.cpp (via LM Studio with Vulkan backend and GGUF models), some models — specifically Qwen3-8B, Mistral, Hermes, and LLaMA — repeatedly output GGGGGGGGGGGG or loop indefinitely, regardless of prompt. This does not happen with Gemma models.

Model Format:
GGUF Q4_K_M, Q5_K_M, Q6_K
Tokenizer: auto-loaded by LM Studio

What I've Tried:
Adding <|im_start|> formatting for Qwen
Using [INST]...[/INST] for Mistral/LLaMA
Setting stop sequences
Lowering temp, top-k, top-p, repeat penalty
Testing via both UI and raw curl
Re-downloading models from official sources

Hypothesis:
Could be:
A tokenizer mismatch
Broken prompt template injection
Quantization compatibility issue with Vulkan
Missing EOS/stop condition on some models

Operating systems

Windows

GGML backends

Vulkan

Hardware

LM Studio v0.3.15 (Build 11)
llama.cpp backend via Vulkan
GPU: AMD Radeon RX 5700 (7.98 GB VRAM)
RAM: 32 GB
OS: Windows 10 x64

Models

Qwen3-8B.Q4_K_M.gguf https://huggingface.co/lmstudio-community/Qwen3-8B-GGUF
Qwen2.5-7B-Instruct-1M-GGUF https://huggingface.co/lmstudio-community/Qwen2.5-7B-Instruct-1M-GGUF

Problem description & steps to reproduce

Load Qwen3-8B.Q4_K_M.gguf or Mistral-7B-Instruct-v0.1.Q4_K_M.gguf

Output returns:
GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG...

✅ Works Fine:
gemma-3-4b-it-qat
gemma-3-12b-it-qat

First Bad Commit

No response

Relevant log output

write a long message

qwen3-8b@q3_k_l
Thoughts
Thought for 18.52 seconds

Okay, the user asked me to write a long message again. Let me check the history to understand what they need.

In the previous interaction, the user first said "hi" and then started typing a long string of "io". I responded with an HTML landing page example. Then they asked for a long message, which I provided as a reflective text on resilience, connection, and imperfection. Now they're asking again for a long message.

Wait, maybe the user is testing if I can generate longer content or perhaps they want another similar response. But since the last message was already quite lengthy, maybe they want something different this time? Or perhaps they made a mistake in their query. Alternatively, maybe they want more examples of long messages on various topics.

I need to ensure that my response is helpful and not redundant. The user might be looking for another example, but it's possible they just want the same type of message again. To avoid repetition, I should check if there's a specific context or theme they have in mind. However, since they haven't provided more details, I'll proceed by offering a new, different long message that expands on previous themes but introduces new elements.

Alternatively, maybe they want a message about a different topic like technology, nature, or personal growth. Let me think of another angle. Perhaps focusing on gratitude, mindfulness, or the journey of self-discovery. Ensuring that the message is well-structured, with clear sections and a coherent flow. Also, making sure it's engaging and thought-provoking.

I should also consider the user's possible needs. If they're working on a project, maybe they need content for a blog post or a speech. Or perhaps they want to explore different aspects of human experience. Including elements like personal anecdotes, philosophical reflections, and practical advice would make the message more relatable.

Another approach is to structure the message with clear sections, such as an introduction, body paragraphs on various themes, and a conclusion. Adding emotional depth and encouraging self-reflection can make the message impactful. I'll need to maintain a balance between being comprehensive and not overwhelming the reader.

Also, considering the previous response was about resilience and imperfection, maybe this time focusing on gratitude or embracing change. Ensuring that each paragraph transitions smoothly and maintains a consistent tone. Using rhetorical questions, metaphors, and personal insights can enhance the message's depth.

I should also check for any possible repetition and ensure that new ideas are introduced. Maybe include elements like mindfulness practices, overcoming fear, or finding purpose in life. Making sure the message is both informative and inspirational.

In summary, I'll create a new long message with a different theme, possibly focusing on gratitude and mindfulness, structured into sections with clear explanations and personal reflections to provide value to the user.

A Long Message: The Art of Being Present

Let me tell you something that has been on my mind for weeks now—something I’ve come to realize is both simple and profound. It’s about being present. Not in the way we often misunderstand it (as a checklist item, "I should meditate 10 minutes today"), but as a deep, unspoken truth: life is not about chasing what's next; it's about holding still enough to notice what is now.

You see, I’ve spent so much of my life in a hurry—to grow faster, achieve more, fix everything that feels broken, and avoid the uncomfortable truths of who I am. But here’s the catch: hurrying has always been a kind of denial. It’s an avoidance of the present moment, a refusal to sit with what is, instead of what we think it should be. And yet, every time I’ve forced myself to pause—whether for a breath, a cup of tea, or simply watching the4 leaves fall from a tree—I’ve discovered something remarkable: the world is alive in ways I never imagined.

Take this morning, for example. The sun was just rising when I stepped outside. The air felt fresh, crisp with dew, and the sky had thatotros hues of lavender and gold that made me stop and breathe. A sparrow landed on the railing nearby, chirping a melody so pure it seemed like it had been waiting for me to listen. And in that moment, I was not a busy person rushing to work, nor an anxious mind worrying about deadlines or emails. I was simply here.

This is what presence looks like—it’s not about grand gestures or elaborate rituals. It’s the quiet act of noticing, the willingness to let go of control, and the courage to embrace uncertainty. It’s when you realize that even in chaos, there is rhythm; even in silence, there is sound; and even in loneliness, there is connection.

Let me share a story from my own life. A few years ago, I was going through one of those periods where everything felt off-kilter. My work was demanding, my relationships were strained, and I couldn’t shake the feeling that I was failing at being "enough." One evening, I decided to walk alone in the woods near my home. It wasn’t a planned trip—it just happened. As I wandered through the forest, I found myself drawn to a small stream. The water was clear, cool, and it flowed with an effortless grace. I sat down on the rocks and watched it for what felt like hours. No phone, no distractions, no need to "do" anything. Just being.

And in that stillness, something shifted. I didn’t find answers. I didn’t solve problems. But I found peace. I realized that my fear of not having enough wasn’t about the future—it was about the present. The stream taught me that even when things feel turbulent, there is a natural order to it. It’s not about controlling the flow; it’s about learning how to move with it.

This brings me to something I’ve been thinking about lately: the power of small, consistent actions. We often underestimate what happens when we show up daily, even in tiny ways. A morning walk, a few minutes of gratitude journaling, or simply allowing yourself to feel emotions without judgment—all these acts are not just habits; they’re rituals of presence. They remind us that life is not about perfection, but about showing up again and again.

And yet, I know this isn’t easy. The world around us is full of noise—social media, deadlines, comparisons, and the constant pressure to "keep going." It can be overwhelming. But here’s what I’ve learned: presence is a choice. It doesn’t require grand gestures or extraordinary circumstances. It just requires intention. It’s about choosing to notice the beauty in small things, like the way sunlight filters through leaves, the warmth of a hand on your shoulder, or the quiet comfort of being with someone who truly sees you.

Let me also talk about connection—because presence is not just about ourselves; it’s about others. We are inherently social beings, wired to connect. Yet so often, we get lost in our own heads, forgetting that we are part of something larger. I’ve seen how meaningful it can be to simply sit with someone, listen without judgment, or share a meal without the distractions of screens. These moments aren’t just about conversation; they’re about being with each other, in all our imperfections.

And here’s the truth: imperfection is not something to fix. It’s something to embrace. I used to think that life was about becoming better—about improving, growing, and achieving. But now I see it differently. Life is about being—about showing up as you are, even when you’re messy, even when you don’t have all the answers. It’s about allowing yourself to stumble, to fall, and to rise again without shame. Because every time we do that, we become more resilient, more compassionate, and more whole.

This is not to say there aren’t challenges—because life is hard, and it doesn’t always make sense. But I’ve come to believe that the most meaningful moments are often the ones we don’t plan. The unexpected call from a friend, the sudden realization during a walk, or the quiet joy of watching a child laugh. These are the things that remind us why we’re here—to love, to connect, and to findGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG

29.99 tok/sec

•

1981 tokens

•

7.24s to first token

•

Stop reason: User Stopped

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Description

Name and Version

Operating systems

GGML backends

Hardware

Models

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions