Skip to content

v0.2.1

Latest

Choose a tag to compare

@tastelikefeet tastelikefeet released this 22 Apr 13:45
· 4 commits to main since this release

中文版本

新功能

  1. 支持了Qwen/Qwen3.6-27B的魔搭官方服务,详情查看:https://www.modelscope.cn/organization/twinkle-kit

Bug修复

  1. 修复了expert权重同步错误的问题
  2. 修复了多lora场景下GRPO MoE训练崩塌的问题
  3. 修复了对多模态输入的序列切分问题
  4. 修复了pp > 1 和tp>1时服务器不正常的问题
  5. 修复了多处remote_function不正确的问题
  6. 修复了服务器训练模型上传和模型训练共用pipeline导致阻塞的问题
  7. 修复了采样器模块的一些bug

English Version

New Features

  1. Added support for the official ModelScope service on Qwen/Qwen3.6-27B. For details, see: https://www.modelscope.cn/organization/twinkle-kit

Bug Fixes

  1. Fixed an issue with incorrect expert weight synchronization.
  2. Fixed a training collapse issue with GRPO MoE in multi-LoRA scenarios.
  3. Fixed a sequence splitting issue for multimodal inputs.
  4. Fixed abnormal server behavior when pp > 1 and tp > 1.
  5. Fixed multiple incorrect remote_function implementations.
  6. Fixed a blocking issue caused by the model upload and model training pipelines sharing the same pipeline on the server side.
  7. Fixed several bugs in modules such as the Sampler.

What's Changed

Full Changelog: v0.2.0...v0.2.1