8000
We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fixes #38002. Sliding mask should not be applied when config.use_sliding_window = False
config.use_sliding_window = False
Added a small test as well
Sorry, something went wrong.
fix sliding attn
10df803
Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.
Ready for review
Merge branch 'main' into fix-sliding-attn
3afee99
make style
df740a6
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.
There was a problem hiding this comment.
The reason will be displayed to describe this comment to others. Learn more.
LGTM, and thank you for adding a test 🙏
Update tests/test_modeling_common.py
e6ff276
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
no a second throught, should default to True fo BC
True
1c35131
Hahah okay I think I reviewed a similar PR recently, saying we should isolate to models that can have sliding_window not defined in the config, but this is better I think. Mit supprised that moshi or mimi are affected as I did not know they had sliding but they prob do!
sliding_window
cb21bae
0a52bd2
[fix] sliding window attention mask (huggingface#38045)
c8cabee
* fix sliding attn * make style * Update tests/test_modeling_common.py Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * no a second throught, should default to `True` fo BC --------- Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
cd9c5a9
gante gante approved these changes
ArthurZucker ArthurZucker approved these changes
Successfully merging this pull request may close these issues.
use_sliding_window