[Feature] Support MiniMax-M2.5 FP8 MoE inference on SM80 (A100/A800)#7723
Open
ZhijunLStudio wants to merge 4 commits into
Open
[Feature] Support MiniMax-M2.5 FP8 MoE inference on SM80 (A100/A800)#7723ZhijunLStudio wants to merge 4 commits into
ZhijunLStudio wants to merge 4 commits into