How to check the number of tokens processed or the load of each expert in the Qwen3 MoE model during inference? · Issue #38147 · huggingface/transformers · GitHub

8000 How to check the number of tokens processed or the load of each expert in the Qwen3 MoE model during inference? · Issue #38147 · huggingface/transformers · GitHub

How to check the number of tokens processed or the load of each expert in the Qwen3 MoE model during inference? #38147

Closed

Closed

How to check the number of tokens processed or the load of each expert in the Qwen3 MoE model during inference?#38147

opened

on May 15, 2025

No description provided.

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

0