Skip to content

Conversation

Projects

None yet

Development

Successfully merging this pull request may close these issues.

AssertionError in _log_rollout_data when training qwen3-vl-8B with true_on_policy_mode

1 participant