Skip to content

(feat): publish roll v0.2.0.#338

Merged
PanAndy merged 1 commit intomainfrom
sync/publish_v0.2.0
Feb 4, 2026
Merged

(feat): publish roll v0.2.0.#338
PanAndy merged 1 commit intomainfrom
sync/publish_v0.2.0

Conversation

@PanAndy
Copy link
Collaborator

@PanAndy PanAndy commented Feb 4, 2026

🚀亮点:

  • 新增模型支持:Qwen3-VL、Qwen3-MoE-VL、Qwen3-Omni、GLM-4.7
  • agentic 训练与 Rollout GPU部分重叠,训练空闲GPU切换为Rollout
  • DynamicSamplingScheduler协程化重构
  • 新增: FSDP2 Strategy
  • 训练支持 Sequence packing 和 Dynamic batching

🚀主要新特性:

  • Rollout
    • DynamicSamplingScheduler协程化重构
    • 自定义rollout pre/post process, 支持动态samping param、多阶段生成、ThinkingBudget控制
    • Sglang: Strategy重构,支持server模式,onload/offload native化,inflight FP8 quant rollout,跨机多节点部署
    • vLLM:DP/EP 支持, 支持vllm==0.12.0
    • 提供AgentNative Rollout范式,AgentNativeStepEnvManager + SokobanNativeEnv,完全由env进行上下文管理
    • Async Rollout Hang Detect:增加异步 Rollout 卡死检测,快速定位问题env
    • 支持rollout dump & mock,提高forward/train阶段精度对齐效率
    • agentic pipeline支持 train-val/rollout overlap
  • Training
  • Model Update实现优化:消除机间冗余、权重转换和nccl broadcast overlap、优化H2D、pp间串行同步调整为lock模式并行同步
  • Asynchronous Feature
  • Pipeline recipe
    • VLM image tool use: DeepEyes,工具调用与reward计算overlap
  • Models:新增模型支持 Qwen3-VL、Qwen3-MoE-VL、Qwen3-Omni-Thinker、GLM-4.7

Co-Authored-By: chengengru.cgr <chengengru.cgr@taobao.com>
Co-Authored-By: fengjingxuan.fjx <fengjingxuan.fjx@alibaba-inc.com>
Co-Authored-By: ft498870 <ft498870@taobao.com>
Co-Authored-By: heyancheng.hyc <heyancheng.hyc@taobao.com>
Co-Authored-By: hongzhen.yj <hongzhen.yj@alibaba-inc.com>
Co-Authored-By: huangju.hj <huangju.hj@alibaba-inc.com>
Co-Authored-By: jiamang.wang <jiamang.wang@alibaba-inc.com>
Co-Authored-By: scott.lxy <scott.lxy@taobao.com>
Co-Authored-By: shenjingyu.sjy <shenjingyu.sjy@alibaba-inc.com>
Co-Authored-By: shenliao.sla <shenliao.sla@taobao.com>
Co-Authored-By: tianhe.lzd <tianhe.lzd@alibaba-inc.com>
Co-Authored-By: weixun.wwx <weixun.wwx@alibaba-inc.com>
Co-Authored-By: wzy496492 <wzy496492@alibaba-inc.com>
Co-Authored-By: xiongshaopan.xsp <xiongshaopan.xsp@alibaba-inc.com>
Co-Authored-By: xuehuanran.xhr <xuehuanran.xhr@alibaba-inc.com>
Co-Authored-By: zhaohaizhou.zhz <zhaohaizhou.zhz@alibaba-inc.com>
Co-Authored-By: bzd02333762 <bzd02333762@alibaba-inc.com>
Co-authored-by: beiyue.lj <beiyue.lj@alibaba-inc.com>
@PanAndy PanAndy merged commit 3077bef into main Feb 4, 2026
4 of 5 checks passed
@PanAndy PanAndy deleted the sync/publish_v0.2.0 branch February 4, 2026 08:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant