Skip to content

Add Kimi K2.5 INT4 single-node MI325X vLLM benchmark (TP8)#857

Open
functionstackx wants to merge 3 commits intomainfrom
claude/issue-856-20260303-0249
Open

Add Kimi K2.5 INT4 single-node MI325X vLLM benchmark (TP8)#857
functionstackx wants to merge 3 commits intomainfrom
claude/issue-856-20260303-0249

Conversation

@functionstackx
Copy link
Contributor

@functionstackx functionstackx commented Mar 3, 2026

following AMD andy's recipe https://x.com/linluo77/status/2017024513595301985

Add Kimi K2.5 INT4 single-node MI325X vLLM benchmark (TP8) using vLLM ROCm v0.16.0, based on MI355X INT4 recipe with AMD Andy Luo's recipe comment.

Closes #856

Generated with Claude Code

- Add benchmark script benchmarks/single_node/kimik2.5_int4_mi325x.sh
  based on MI355X INT4 recipe with AMD Andy Luo's recipe comment
- Add kimik2.5-int4-mi325x-vllm config to amd-master.yaml using
  vllm/vllm-openai-rocm:v0.16.0 image
- Update perf-changelog.yaml

Closes #856

Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>
Copy link
Collaborator

@chunfangamd chunfangamd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

vllm 0.16 single node mi325 kimi k2.5 vllm tp8

2 participants