Skip to content

feat: add expert_wise_scale support for per-expert FP8 quantization in MoE models#35

Open
lifelongeeek wants to merge 2 commits intoaws-neuron:mainfrom
lifelongeeek:feat/expert-wise-scale
Open

feat: add expert_wise_scale support for per-expert FP8 quantization in MoE models#35
lifelongeeek wants to merge 2 commits intoaws-neuron:mainfrom
lifelongeeek:feat/expert-wise-scale

Commits

Commits on Feb 13, 2026