[Bugfix] Allow fp8_e5m2 KV cache on W4A16 compressed-tensors models (V100/SM70)#49
Open
rivetphilbot wants to merge 44 commits into
Open
[Bugfix] Allow fp8_e5m2 KV cache on W4A16 compressed-tensors models (V100/SM70)#49rivetphilbot wants to merge 44 commits into
rivetphilbot wants to merge 44 commits into