Remove Qwen Image Redundant RoPE Cache #12452

dg845 · 2025-10-09T01:12:12Z

What does this PR do?

This PR removes self.rope_cache in QwenEmbedRope, so that RoPE frequency caching is only done once through the functools.lru_cache decorator on _compute_video_freqs. It also changes the maxsize argument for lru_cache from None to 128 to prevent the cache from causing OOM errors.

Fixes #12401

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@sayakpaul
@naykun
@chenxiao111222

HuggingFaceDocBuilderDev · 2025-10-09T01:20:11Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

@naykun could you also give it a look?

dg845 · 2025-10-09T04:29:58Z

One thing to note is that the current PR implementation (in 52cf252) isn't quite equivalent to the implementation in main when not compiling since frame is part of the cache key for the lru_cache, but not part of the cache key for self.rope_cache.

@naykun do you think changing the cache key to include frame makes sense? If frame is the same for most calls to _compute_video_freqs, then I think the two implementations are approximately equivalent.

sayakpaul · 2025-10-09T05:05:53Z

I think the use of a frame based notation is present to allow extension to videos easily. For images, this shouldn't matter at all because the idx is always a constant and since img_shapes is supplemented just once from the pipeline:

diffusers/src/diffusers/models/transformers/transformer_qwenimage.py

Line 569 in a519272

img_shapes: Optional[List[Tuple[int, int, int]]] = None,

diffusers/src/diffusers/pipelines/qwenimage/pipeline_qwenimage_edit.py

Line 752 in a519272

img_shapes = [

(example)

do you think changing the cache key to include frame makes sense? If frame is the same for most calls to _compute_video_freqs, then I think the two implementations are approximately equivalent.

For the rope_cache, I am not sure how much of a readability compromise that would be, though.

Refactor QwenEmbedRope to only use the LRU cache for RoPE caching

52cf252

sayakpaul approved these changes Oct 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove Qwen Image Redundant RoPE Cache #12452

Remove Qwen Image Redundant RoPE Cache #12452

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Remove Qwen Image Redundant RoPE Cache #12452

Are you sure you want to change the base?

Remove Qwen Image Redundant RoPE Cache #12452

Conversation

What does this PR do?

Who can review?

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!