update default paths and disable offloading for AMD qwen3-4B training #1225

Vivicai1005 · 2025-12-26T06:40:33Z

This PR updates the training script scripts/run-qwen3-4B-amd.sh with two key changes:

Update Default Paths: Changed the default SLIME_DIR, MODEL_DIR, and DATA_DIR to /root to align with the actual model download location.
Disable Memory Offloading: Explicitly set --no-offload-train and --no-offload-rollout. Since the AMD MI300X GPU provides 192GB of VRAM, the model fits entirely in memory without offloading. Disabling this avoids the memory leak issue caused by frequent process group destruction/reloading on the AMD backend, while also accelerating training by eliminating overhead from host-device memory transfers.

update default paths and disable offloading for AMD qwen3-4B training

ba44850

Provide feedback