-
Notifications
You must be signed in to change notification settings - Fork 353
GPTQ test #1179
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
sugunav14
wants to merge
48
commits into
main
Choose a base branch
from
svelury/gptq-vq-f
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
GPTQ test #1179
Changes from all commits
Commits
Show all changes
48 commits
Select commit
Hold shift + click to select a range
1fee97c
add rabbit feedback
Fridah-nv 3f717dd
minor
Fridah-nv 971b168
tested perplexity
sugunav14 10c16ca
tested, revert later
sugunav14 364fd78
tested
sugunav14 5aee517
refactor
sugunav14 4b1e42f
Track global_amax for weight FP4 MSE sweep; Refactor to NVFP4StaticQa…
realAsma 6a15d0d
address reviewers feedback, delegate scaling factor calculation to NV…
Fridah-nv 7b7146b
tested perplexity
sugunav14 40c14ef
tested exported checkpoints on 0211
sugunav14 7a1e006
tested nano v3
sugunav14 e6df379
added activation MSE logging
sugunav14 b81fed8
super v3 run
sugunav14 f3a9524
added activationmse logging helper
sugunav14 22e2b95
input amax sync added + tested gptq super sft checkpoint
sugunav14 10d21ba
checkpoints generated on 0223
sugunav14 188fa1d
tested perplexity
sugunav14 599227e
tested, revert later
sugunav14 60df0d8
tested
sugunav14 f88ba6e
initial cleanup
sugunav14 7b24cd3
cleanup
sugunav14 b17b917
removed stray prints
sugunav14 8ff8976
fix rebase issues
sugunav14 5815ce8
minor
sugunav14 b1f1434
tested e2e on qwen
sugunav14 df6b182
removed perplexity eval
sugunav14 75a08fe
update
sugunav14 9e58a6f
revert later
sugunav14 16086c7
minor update
sugunav14 9b47e77
update
sugunav14 4ec2433
gptq faster
sugunav14 2b0af3d
added metrics files, remove later
sugunav14 ee40b48
claude review
sugunav14 a175178
remove stray files
sugunav14 a948497
refactor
sugunav14 7e235b4
claude review + coderabbit review
sugunav14 d1498be
refactor
sugunav14 d8b1d93
stray changes removed
sugunav14 19fc0c2
Address PR comments
sugunav14 068e8a9
fixed circular import issue
sugunav14 2930b55
tested e2e on qwen3-8b
sugunav14 b35bc85
tested e2e on qwen3-8b
sugunav14 0f621cd
latest run with export
sugunav14 af59a55
clean up
sugunav14 d52d614
update
sugunav14 c705c24
updated e2e test
sugunav14 6d8f9b3
new perplexity eval
sugunav14 224f77d
update
sugunav14 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This plugin-config export bypass skips the normal artifact cleanup.
By bypassing
export_quantized(), this branch never restores the tokenizer's original padding settings and never copies custom model files/configs. That makes the saved artifact materially different from the standard export path, especially fortrust_remote_codemodels.🤖 Prompt for AI Agents