Support new arguments for exllamav3 Qwen3.5/3.6 MTP by alexhunt7 · Pull Request #421 · theroyallab/tabbyAPI

alexhunt7 · 2026-05-08T14:40:42Z

Is your pull request related to a problem? Please describe.
Support new arguments for exllamav3 Qwen3.5/3.6 MTP support.

Also adds draft acceptance metrics to the logs (in a separate commit).

Why should this feature be added?
MTP speeds things up tremendously. This supports adds support for configuring MTP in conjunction with turboderp-org/exllamav3#206

Examples
~2x token generation speed in Qwen3.6 27B on my 5070ti + 3060ti.

Additional context
Keeping as a draft until turboderp-org/exllamav3#206 is merged, but this is otherwise ready to go.

Adds draft_arch_override and num_draft_tokens to DraftModelConfig so Qwen3.5/3.6 BF16 directories can be loaded as MTP-only draft models (arch_override="Qwen3_5MTPDraftModel"). Threads both options through to Config.from_directory and AsyncGenerator. If draft_arch_override is set but draft_model_name is omitted, treat the main model_directory as the source for the draft model. This covers the case where the same checkpoint contains both the trunk and the mtp.* tensors — no need to point at a separate directory or extract the MTP head into its own dir.

alex-hunt-materialize and others added 2 commits May 8, 2026 15:33

log draft acceptance rate metrics

4744fdc

alexhunt7 mentioned this pull request May 8, 2026

MTP speculative decoding for Qwen3.5 / Qwen3.6 turboderp-org/exllamav3#206

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support new arguments for exllamav3 Qwen3.5/3.6 MTP#421

Support new arguments for exllamav3 Qwen3.5/3.6 MTP#421
alexhunt7 wants to merge 2 commits into
theroyallab:mainfrom
alexhunt7:mtp

alexhunt7 commented May 8, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

alexhunt7 commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

alexhunt7 commented May 8, 2026 •

edited

Loading