-
Notifications
You must be signed in to change notification settings - Fork 25.9k
AMD/ROCm OCP Micro-scaling Format (mx-fp8/mx-fp4) Support #151360
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from all commits
Commits
Show all changes
18 commits
Select commit
Hold shift + click to select a range
baf2e30
Enhance support for Float8 and Float4 data types in scaled_gemm and r…
3b0b22e
Update support for Float8 and Float4 data types in CUDA operations. A…
ade8224
Add support for MX GEMM in CUDA operations
85f45d9
Add HIP data type mapping for CUDA_R_4F_E2M1
be0378e
Enhance data scaling functions for Float4 and Float8 types in CUDA tests
43c0f41
Enhance bias checks in scaled matrix multiplication for CUDA
c54eb0f
Update ROCm version checks for Float4 and Float8 types in CUDA
5f032e7
Refactor data type handling in _scaled_mm_out_cuda for improved clarity
c83fc39
Enhance CUDABlas scaling support for ROCm and CUDA versions
348d2a2
Update scaling choice in CUDA Blas implementation to BlockWise1x32
33aabd3
lint
74d8c5c
Refactor row-wise scaling condition in CUDABlas implementation
b87ecc8
Refactor scaling attribute handling in scaled_gemm function
05cafc5
Merge branch 'main' into rocm_mx_fp4
petrex 84c267a
Fix scaling choice condition in _scaled_mm_out_cuda for BlockWise1x32…
84f218b
Add skip condition for Float8_e4m3fn in FP8Matmul tests on ROCm CI setup
39c34c5
Add skip condition for tests on ROCm platform in ReproTestsDevice class
73022c3
Fix formatting of PLATFORM_SUPPORTS_FP8 initialization in common_cuda.py
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This check need not be inside #if defined(USE_ROCM) && !defined(HIPBLASLT_OUTER_VEC) && defined(HIPBLASLT_VEC_EXT)