🍭[Roadmap] ms-swift3.6-3.8 · Issue #4561 · modelscope/ms-swift · GitHub

8000 🍭[Roadmap] ms-swift3.6-3.8 · Issue #4561 · modelscope/ms-swift · GitHub

🍭[Roadmap] ms-swift3.6-3.8 #4561

Open

Labels

discussiongood first issue

opened

on Jun 11, 2025

模型

✅最新模型接入 P0
Omni pending
a. 部署支持输出音频
b. 支持talker的训练
All-to-All优化 pending
✅embedding: 支持推理、部署
✅reranker训练支持
a. 推理部署支持
序列分类: 多标签/回归支持量化

训练

RAY支持 P0
✅长文本ring attention
✅AutoTP P0
✅channel loss支持packing/padding_free
✅多模态packing优化
✅new_special_tokens支持
✅多模态packing/padding_free支持更多模型
✅混合模态训练支持更多模态
✅flash-attention-3
✅DFT

Megatron-SWIFT

新模型支持
a. ✅多模态: qwen2.5-VL/qwen2.5-Omni P0
b. ✅DeepSeekV3
c. Llama4
✅fp8
a. blockwise fp8 P0
✅LoRA支持
a. ✅MoE LoRA支持router训练
✅支持提前预处理数据集
RLHF支持 P0
a. GRPO
b. KTO
c. ✅DPO
✅bshd格式支持
swanlab支持 P0
✅loss_scale支持
分类/Embedding模型支持
Deepspeed集成
✅channel loss

RL

GRPO
a. ✅多轮AsyncEngine
b. Agent MCP
c. sglang
d. 效率对比benchmark
e. ✅多机rollout
f. ✅GSPO
g. ✅DeepEyes
✅MPO
✅GKD
DPO
a. ✅packing支持
b. ✅LD-DPO
RLOO P1
Reinforce++ P06.多模态PPO
KTO padding_free支持
RM 支持 pointwise 训练 P0

推理与部署

✅sglang接入推理部署
a. 多模态模型
vLLM支持分类模型和RM

量化导出

✅fp8/bnb支持多模态模型
✅qlora支持merge-lora P1
✅fp8量化

ms-swift3.9-3.11 roadmap: #5721

569C

Metadata

Assignees

No one assigned

Labels

discussiongood first issue

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

0