Fix: Remove --apply-chat-template from Qwen3-235B SFT script #1315

kaysonyu · 2026-01-04T03:30:22Z

Summary

This PR removes the --apply-chat-template flag from scripts/run-qwen3-235B-A22B-sft.sh to align it with the fix made in PR #1307.

The --apply-chat-template flag should not be used in SFT training because:

MultiTurnLossMaskGenerator.get_loss_mask() expects messages in list format (list of dicts with role and content keys)
When --apply-chat-template is enabled during the Dataset loading phase, it converts messages to a string format via tokenizer.apply_chat_template()
This causes get_loss_mask() to fail since it needs to iterate through messages to generate proper loss masks for assistant responses
The tokenizer.apply_chat_template() should be called internally by MultiTurnLossMaskGenerator, not during data loading

Comment out --apply-chat-template in scripts/run-qwen3-235B-A22B-sft.sh (line 52)

Fix: Remove --apply-chat-template from Qwen3-235B SFT script

9b2ac54

kaysonyu force-pushed the fix-apply-chat-template branch from d5a1057 to 9b2ac54 Compare January 4, 2026 05:33

Merge branch 'main' into fix-apply-chat-template

4b00b58

zhuzilin merged commit 6883db8 into THUDM:main Jan 5, 2026

kaysonyu deleted the fix-apply-chat-template branch January 5, 2026 10:48