FP8 ONNX导出问题 by tjthereal · Pull Request #266 · ModelTC/MQBench

tjthereal · 2023-09-14T08:33:27Z

具体在fpemu.py文件中（MQBench/FP8_Emulator/ptyquant/cuda(cpp))有算子注册的问题，导致无法导出onnx模型

…_deploy_model.onnx

Update GPTQFakeQuantize

2.Modify the method of exporting Onnx and add checks for the exported Onnx files. 3.Fix the PTQ bug under sophgo_tpu backend.

github-actions · 2024-01-13T02:08:36Z

This PR has not received any updates in 120 days. Please reply to this issue if this still unresolved!

wangxc2006 and others added 30 commits October 9, 2022 15:14

test

c9e8504

support sophgo_tpu backend,initial ver

0491be5

update, fix some bug

ef3f30e

add deconv and refine SophgoTpuQuantizer

96683b6

add some verfiy code

e612cfd

for qat int8 release

99775c4

for qat int8 release

252ede6

fix linear+bn bug

cfddb1d

add int4&int8 mix prec func and infer net output shape in xxx_mqmoble…

21f39c0

…_deploy_model.onnx

fix int8 bug in int4 version

232ca1f

fix sub/abs op no fake quant node

1d09bc4

add some class and func to adapt to torch1.10_cpu and torch2.0.1_cpu

1f43b9d

commit message here

47c3cde

commit message here

2612562

qat gpt2

4642917

添加了FP8 fakequant以及修改了config以及prepare_by_platform中的一些问题”

99f2a16

QAT example

e57a418

[Feature] NLP trace support and example

459ed0b

hide STOCHASTIC

a6b2c12

Add GPTQ

505b832

Update gptq.py

96caec1

Update GPTQFakeQuantize

Add bert-base-uncased-mrpc GPTQ version

de5a1c1

Fixed two devices problem

892331d

Update gptq.py

73fc14e

gptq use academic backend

02c248f

fix model insert & delete info | fix deploy

dc22db9

nlp deploy

1530d26

1.Correcting the method for registering operators in MQBench.

5c6e623

2.Modify the method of exporting Onnx and add checks for the exported Onnx files. 3.Fix the PTQ bug under sophgo_tpu backend.

Complete merge

38a0c54

conflict fixed

195b522

zhengjin-xu11 and others added 2 commits September 14, 2023 11:33

add fast test before push

d489ab7

FP8 ONNX problem

4290bd7

github-actions Bot added the Stale label Jan 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FP8 ONNX导出问题#266

FP8 ONNX导出问题#266
tjthereal wants to merge 32 commits into
ModelTC:mainfrom
sophgo:fp8-modified

tjthereal commented Sep 14, 2023

Uh oh!

github-actions Bot commented Jan 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

tjthereal commented Sep 14, 2023

Uh oh!

github-actions Bot commented Jan 13, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants