Skip to content

[ci] Add MTP cases and run the case according to P0/P1/P2#803

Merged
zejunchen-zejun merged 7 commits into
ROCm:mainfrom
junyyang-amd:junyyang/vllm-benchmark-p0-p1-p2
May 15, 2026
Merged

[ci] Add MTP cases and run the case according to P0/P1/P2#803
zejunchen-zejun merged 7 commits into
ROCm:mainfrom
junyyang-amd:junyyang/vllm-benchmark-p0-p1-p2

Conversation

@junyyang-amd
Copy link
Copy Markdown
Collaborator

Motivation

Because the geomean of some cases is lower, we run these cases on P0/P1/P2 to increase the frequency of P0.

@zejunchen-zejun
Copy link
Copy Markdown
Collaborator

zejunchen-zejun commented May 15, 2026

Hi @junyyang-amd 几个问题可以fix下:

  1. MTP case 跑不了,AWS-P1 里加了 4 个 MTP model,但 oot_benchmark_models.json 里没有对应的 variant,select_by_display() 匹配不到,会被静默跳过。需要在 oot_benchmark_models.json 里补上。
  2. 周日两次调度可能撞周一,周日 10:00 跑 P2+OOB(208 jobs,预计 ~22h),21:00 的 META-P1(160 jobs)会排队等前一个跑完,串联总计 ~38h,可能延伸到周一晚上和 AWS-P0 冲突。
  3. else 分支是decode code,weekday 0~6 已被前面的分支全部覆盖,else: selected_group = "SKIP" 不会被执行到

Copy link
Copy Markdown
Collaborator Author

@junyyang-amd junyyang-amd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

更改了调度时间,将周六白天充分利用起来,预计周一凌晨能跑完。
oot_benchmark_models.json已覆盖4个MTP模型。

@zejunchen-zejun zejunchen-zejun merged commit 044d089 into ROCm:main May 15, 2026
24 of 30 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants