Skip to content

Add Kimi-K2.5 INT4 vLLM v0.16.0 benchmark for MI300X#860

Open
functionstackx wants to merge 1 commit intomainfrom
claude/issue-859-20260303-0604
Open

Add Kimi-K2.5 INT4 vLLM v0.16.0 benchmark for MI300X#860
functionstackx wants to merge 1 commit intomainfrom
claude/issue-859-20260303-0604

Conversation

@functionstackx
Copy link
Contributor

@functionstackx functionstackx commented Mar 3, 2026

following AMD andy's recipe https://x.com/linluo77/status/2017024513595301985

Add single-node benchmark configuration for Kimi-K2.5 INT4 on MI300X using vLLM v0.16.0, following AMD Andy Luo's recipe. Based on the existing MI355X INT4 Kimi recipe with TP=8, concurrency 4-64.

Closes #859

Generated with Claude Code

Add single-node benchmark configuration for Kimi-K2.5 INT4 on MI300X
using vLLM v0.16.0, following AMD Andy Luo's recipe. Based on the
existing MI355X INT4 Kimi recipe with TP=8, concurrency 4-64.

Closes #859

Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>
@functionstackx
Copy link
Contributor Author

#861

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

vllm 0.16 single node mi300 kimi k2.5 vllm tp8

1 participant