multiple values for argument softmax_scale
#101603
Labels
module: fsdp
oncall: distributed
Add this issue/PR to distributed oncall triage queue
oncall: pt2
pt2d-triage-nov2024
triaged
This issue has been looked at a team member, and triaged and prioritized into an appropriate module
🐛 Describe the bug
Posting this on behalf of Mosaic
python attention.py
https://raw.githubusercontent.com/sashaDoubov/llm-foundry/sasha/repro-issue/llmfoundry/models/layers/attention.py
Error logs
Not an error but instead seeing this scary warning
How to remove the error
As a workaround you can comment out softmax_scale and the code then works https://gist.github.com/msaroufim/5fe1a5cf745e31baabeb62b8dce10c82 but that's not a real solution
Versions
n
cc @H-Huang @awgu @kwen2501 @wanchaol @fegin @fduwjj @wz337 @wconstab @d4l3k @c-p-i-o @zhaojuanmao @mrshenli @rohan-varma @chauhang @penguinwu @pritamdamania87 @satgera @gqchen @aazzolini @osalpekar @jiayisuse @XilunWu @tianyu-l @yf225 @ezyang @bdhirsh @anijain2305 @zou3519 @kiukchung @LucasLLC @ngimel
The text was updated successfully, but these errors were encountered: