中文版本
新功能
- 支持了Qwen/Qwen3.6-27B的魔搭官方服务,详情查看:https://www.modelscope.cn/organization/twinkle-kit
Bug修复
- 修复了expert权重同步错误的问题
- 修复了多lora场景下GRPO MoE训练崩塌的问题
- 修复了对多模态输入的序列切分问题
- 修复了pp > 1 和tp>1时服务器不正常的问题
- 修复了多处remote_function不正确的问题
- 修复了服务器训练模型上传和模型训练共用pipeline导致阻塞的问题
- 修复了采样器模块的一些bug
English Version
New Features
- Added support for the official ModelScope service on Qwen/Qwen3.6-27B. For details, see: https://www.modelscope.cn/organization/twinkle-kit
Bug Fixes
- Fixed an issue with incorrect expert weight synchronization.
- Fixed a training collapse issue with GRPO MoE in multi-LoRA scenarios.
- Fixed a sequence splitting issue for multimodal inputs.
- Fixed abnormal server behavior when pp > 1 and tp > 1.
- Fixed multiple incorrect
remote_functionimplementations. - Fixed a blocking issue caused by the model upload and model training pipelines sharing the same pipeline on the server side.
- Fixed several bugs in modules such as the Sampler.
What's Changed
- add base_layer suffix for expert weights by @hjh0119 in #159
- update cookbook and doc 0415 by @Yunnglin in #157
- Docs support Q3.6 by @tastelikefeet in #158
- Fix multi lora device by @tastelikefeet in #160
- Fix MoE multi-lora training by @tastelikefeet in #161
- Fix model id and upload to hub by @Yunnglin in #162
- Add notebooks by @tastelikefeet in #164
- Npu adapt megatron by @addsubmuldiv in #153
- Fix save by @tastelikefeet in #165
- A small refactor by @tastelikefeet in #166
- A small refactor, move 4d mask to processor by @tastelikefeet in #167
- Fix some potential bugs by @tastelikefeet in #168
- Fix some bugs by @tastelikefeet in #169
- fix mm tokentypeids splitting by @tastelikefeet in #170
- Fix model pp > 1 and tp > 1 errors by @Yunnglin in #171
- Fix moe weight sync by @tastelikefeet in #172
- update notebooks by @Yunnglin in #174
- Modify remote_function decorators in multi_lora_transformers by @xichengpro in #173
- support cp ,fix qwen3.5 gdn sp by @meichangsu1 in #138
- support qwen3.6 grpo & in-place add lora by @hjh0119 in #163
- Fix multi lora by @tastelikefeet in #177
- support q3.6-27b by @tastelikefeet in #178
- Fix sampler and grpo by @Yunnglin in #179
Full Changelog: v0.2.0...v0.2.1