-
Notifications
You must be signed in to change notification settings - Fork 609
CPU Optimizations for FP8 #2559
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
vthumbe1503
wants to merge
46
commits into
NVIDIA:main
Choose a base branch
from
vthumbe1503:cpu_fp8_optimizations
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
46 commits
Select commit
Hold shift + click to select a range
93ee022
add all the optimizations
vthumbe1503 06338bc
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 50de9cd
requires_grad optimization
vthumbe1503 5fee841
Merge branch 'cpu_fp8_optimizations' of github.com:vthumbe1503/Transf…
vthumbe1503 4c79ac7
Merge branch 'main' into cpu_fp8_optimizations
vthumbe1503 62b88e1
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 99494d7
test if commenting out requires_grad works
vthumbe1503 b157f85
Merge branch 'cpu_fp8_optimizations' of github.com:vthumbe1503/Transf…
vthumbe1503 2a7b627
Merge branch 'main' into cpu_fp8_optimizations
vthumbe1503 b61a6a8
fix minor bug
vthumbe1503 938651e
Merge branch 'cpu_fp8_optimizations' of github.com:vthumbe1503/Transf…
vthumbe1503 88dfdbd
fix ci
vthumbe1503 1526eea
Merge branch 'main' into cpu_fp8_optimizations
vthumbe1503 5809dcc
missed a bug
vthumbe1503 b3bd748
Merge branch 'cpu_fp8_optimizations' of github.com:vthumbe1503/Transf…
vthumbe1503 30fecf2
Update transformer_engine/pytorch/csrc/quantizer.cpp
vthumbe1503 1b0d497
fix some bugs pointed to by copilot
vthumbe1503 138b7bf
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] eec1e86
linting error
vthumbe1503 8169d9c
fix the error
vthumbe1503 6fefaf2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] a5feaf9
fix the bug
vthumbe1503 285dbff
Merge branch 'cpu_fp8_optimizations' of github.com:vthumbe1503/Transf…
vthumbe1503 afb2f23
get rid of the change
vthumbe1503 3919cb8
fix the transpose shape bug
vthumbe1503 fd36424
Merge branch 'main' into cpu_fp8_optimizations
vthumbe1503 4668133
minor linter fix
vthumbe1503 5a00652
fix lint
vthumbe1503 739bbad
fix linting error
vthumbe1503 e8042c1
address copilot review comment regarding error check when both data a…
vthumbe1503 1d323d7
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] da7fbf5
Merge branch 'main' into cpu_fp8_optimizations
vthumbe1503 e2c7435
fix linting errors
vthumbe1503 f4e2492
fix merge conflict
vthumbe1503 beada36
missed a merge conflict
vthumbe1503 06a72a2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 5d21db2
final optimizations
vthumbe1503 1dfd6fe
Merge branch 'cpu_fp8_optimizations' of github.com:vthumbe1503/Transf…
vthumbe1503 8c8dd20
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] c1acd62
fix ci error
vthumbe1503 7f35b0b
fix merge conflixt
vthumbe1503 ca177ae
Merge branch 'main' into cpu_fp8_optimizations
vthumbe1503 1538fd9
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 710b581
address review comment from greptile
vthumbe1503 8a57a75
fix merge conflixt
vthumbe1503 7e4f093
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.