Add flag to apply DeduplicateHashedInitializersPass post graph surgery #2295

qti-ashimaj · 2025-12-24T05:59:34Z

Describe your changes

Add flag to apply DeduplicateHashedInitializersPass post graph surgery.
With the DeduplicateHashedInitializersPass, the VRAM usage for onnx static quantization increased multifold, hence adding an option to keep this pass.
For Qwen2.5-1.5B-Instruct model, using DeduplicateHashedInitializersPass needs ~58GB VRAM while without DeduplicateHashedInitializersPass needs only ~14GB VRAM for static quantization

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

(Optional) Issue link

add flag to apply DeduplicateHashedInitializersPass

6851d82

qti-ashimaj force-pushed the dev/qti-ashimaj/dedupinit branch from d8dc7cc to 6851d82 Compare December 24, 2025 05:59

qti-ashimaj marked this pull request as ready for review December 24, 2025 06:07

jambayk requested a review from xiaoyu-work January 1, 2026 12:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add flag to apply DeduplicateHashedInitializersPass post graph surgery #2295

Add flag to apply DeduplicateHashedInitializersPass post graph surgery #2295

Uh oh!

qti-ashimaj commented Dec 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add flag to apply DeduplicateHashedInitializersPass post graph surgery #2295

Are you sure you want to change the base?

Add flag to apply DeduplicateHashedInitializersPass post graph surgery #2295

Uh oh!

Conversation

qti-ashimaj commented Dec 24, 2025

Describe your changes

Checklist before requesting a review

(Optional) Issue link

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant