8000 Add Vectorized FP8 E4M3 by leslie-fang-intel · Pull Request #152417 · pytorch/pytorch · GitHub
[go: up one dir, main page]

Skip to content

Add Vectorized FP8 E4M3 #152417

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 10 commits into
base: gh/leslie-fang-intel/195/base
Choose a base branch
from

Conversation

leslie-fang-intel
Copy link
Collaborator
@leslie-fang-intel leslie-fang-intel commented Apr 29, 2025

Stack from ghstack (oldest at bottom):

Summary
This PR mainly adding the Vectorized<Float8_e4m3fn> class to support the vectorization of FP8 E4M3 with methods:

  • Convert to/from Vectorized<float>
  • Common vectorized methods like: mul, abs, eq and etc.

Test Plan

./build/bin/vec_test_all_types_AVX512 --gtest_filter=FP8E4M3Test.*

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @jerryzh168

[ghstack-poisoned]
Copy link
pytorch-bot bot commented Apr 29, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/152417

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit 84888d1 with merge base 9608e7f (image):

NEW FAILURE - The following job has failed:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Apr 29, 2025
@leslie-fang-intel leslie-fang-intel marked this pull request as draft April 29, 2025 09:57
@leslie-fang-intel leslie-fang-intel added ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category labels Apr 29, 2025
[ghstack-poisoned]
[ghstack-poisoned]
[ghstack-poisoned]
@leslie-fang-intel leslie-fang-intel marked this pull request as ready for review April 30, 2025 09:45
[ghstack-poisoned]
Copy link
Collaborator
8000
@jgong5 jgong5 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Add UT?

@leslie-fang-intel
Copy link
Collaborator Author

Add UT?

Thanks for the comment. The test case has been added.

[ghstack-poisoned]
Copy link
Collaborator
@mingfeima mingfeima left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Generally LGTM, just the test case needs a little refinement.

One more thing, we don't do E5M2?

[ghstack-poisoned]
@leslie-fang-intel
Copy link
Collaborator Author

Generally LGTM, just the test case needs a little refinement.

One more thing, we don't do E5M2?

Thanks for comment. I think we will add E5M2 in next step.

@leslie-fang-intel leslie-fang-intel requested a review from jgong5 May 9, 2025 06:57
[ghstack-poisoned]
[ghstack-poisoned]
@leslie-fang-intel
Copy link
Collaborator Author

@pytorchbot merge

@pytorchmergebot
Copy link
Collaborator

Merge failed

Reason: Approvers from one of the following sets are needed:

  • superuser (pytorch/metamates)
  • Core Reviewers (mruberry, lezcano, Skylion007, ngimel, peterbell10, ...)
  • Core Maintainers (soumith, gchanan, ezyang, dzhulgakov, malfet, ...)
Details for Dev Infra team Raised by workflow job

Failing merge rule: Core Maintainers

[ghstack-poisoned]
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/trunk Trigger trunk jobs on your pull request module: cpu CPU specific problem (e.g., perf, algorithm) open source topic: not user facing topic category
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants
0