[Bugfix] Allow fp8_e5m2 KV cache on W4A16 compressed-tensors models (V100/SM70) by rivetphilbot · Pull Request #49 · 1CatAI/1Cat-vLLM