feat: add expert_wise_scale support for per-expert FP8 quantization in MoE models#35

Open

lifelongeeek wants to merge 2 commits intoaws-neuron:mainfrom

lifelongeeek:feat/expert-wise-scale

Commits on Feb 13, 2026

feat: add expert_wise_scale support for per-expert FP8 quantization in MoE models
lifelongeeek
committed
fix: read expert_wise_scale per-model instead of from global wrapper config
lifelongeeek
committed