v0.2.1

Latest

Latest

tastelikefeet released this 22 Apr 13:45

· 4 commits to main since this release

593e567

中文版本

新功能

支持了Qwen/Qwen3.6-27B的魔搭官方服务，详情查看：https://www.modelscope.cn/organization/twinkle-kit

Bug修复

修复了expert权重同步错误的问题
修复了多lora场景下GRPO MoE训练崩塌的问题
修复了对多模态输入的序列切分问题
修复了pp > 1 和tp>1时服务器不正常的问题
修复了多处remote_function不正确的问题
修复了服务器训练模型上传和模型训练共用pipeline导致阻塞的问题
修复了采样器模块的一些bug

English Version

New Features

Added support for the official ModelScope service on Qwen/Qwen3.6-27B. For details, see: https://www.modelscope.cn/organization/twinkle-kit

Bug Fixes

Fixed an issue with incorrect expert weight synchronization.
Fixed a training collapse issue with GRPO MoE in multi-LoRA scenarios.
Fixed a sequence splitting issue for multimodal inputs.
Fixed abnormal server behavior when pp > 1 and tp > 1.
Fixed multiple incorrect remote_function implementations.
Fixed a blocking issue caused by the model upload and model training pipelines sharing the same pipeline on the server side.
Fixed several bugs in modules such as the Sampler.

What's Changed

add base_layer suffix for expert weights by @hjh0119 in #159
update cookbook and doc 0415 by @Yunnglin in #157
Docs support Q3.6 by @tastelikefeet in #158
Fix multi lora device by @tastelikefeet in #160
Fix MoE multi-lora training by @tastelikefeet in #161
Fix model id and upload to hub by @Yunnglin in #162
Add notebooks by @tastelikefeet in #164
Npu adapt megatron by @addsubmuldiv in #153
Fix save by @tastelikefeet in #165
A small refactor by @tastelikefeet in #166
A small refactor, move 4d mask to processor by @tastelikefeet in #167
Fix some potential bugs by @tastelikefeet in #168
Fix some bugs by @tastelikefeet in #169
fix mm tokentypeids splitting by @tastelikefeet in #170
Fix model pp > 1 and tp > 1 errors by @Yunnglin in #171
Fix moe weight sync by @tastelikefeet in #172
update notebooks by @Yunnglin in #174
Modify remote_function decorators in multi_lora_transformers by @xichengpro in #173
support cp ,fix qwen3.5 gdn sp by @meichangsu1 in #138
support qwen3.6 grpo & in-place add lora by @hjh0119 in #163
Fix multi lora by @tastelikefeet in #177
support q3.6-27b by @tastelikefeet in #178
Fix sampler and grpo by @Yunnglin in #179

Full Changelog: v0.2.0...v0.2.1

Contributors

meichangsu1, addsubmuldiv, and 4 other contributors

Assets 2