Skip to content

[Bugfix] Allow fp8_e5m2 KV cache on W4A16 compressed-tensors models (V100/SM70)#49

Open
rivetphilbot wants to merge 44 commits into
1CatAI:mainfrom
rivetphilbot:p7-fp8-kv-ct-classifier-fix
Open

[Bugfix] Allow fp8_e5m2 KV cache on W4A16 compressed-tensors models (V100/SM70)#49
rivetphilbot wants to merge 44 commits into
1CatAI:mainfrom
rivetphilbot:p7-fp8-kv-ct-classifier-fix

Commits

Commits on Mar 21, 2026

Commits on Mar 30, 2026

Commits on Apr 2, 2026

Commits on Apr 9, 2026

Commits on Apr 10, 2026

Commits on Apr 18, 2026

Commits on Apr 26, 2026

Commits on May 1, 2026

Commits on May 8, 2026

Commits on May 9, 2026

Commits on May 13, 2026

Commits on May 14, 2026

Commits on May 17, 2026

Commits on May 18, 2026

Commits on May 20, 2026

Commits on May 25, 2026

Commits on May 27, 2026

Commits on May 28, 2026

Commits on Jun 1, 2026