🚀 We just released gpt-oss-20b-DFlash! #28
Pinned
jianc99
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Feel free to try it on SGLang. In our test, it consistently delivers 2x speedup across concurrency 1-32 on math, code and chat tasks. More details see https://huggingface.co/z-lab/gpt-oss-20b-DFlash
The DFlash draft models for Qwen3-Coder-Next and gpt-oss-120b is on the way.
Beta Was this translation helpful? Give feedback.
All reactions