Skip to content

sycl : port multi-column MMVQ from CUDA backend (~45% speculative decoding speedup on Intel Arc)#21845

Merged
ggerganov merged 1 commit into
ggml-org:masterfrom
masonmilby:sycl-mmvq-multicol
Jun 5, 2026
Merged

sycl : port multi-column MMVQ from CUDA backend (~45% speculative decoding speedup on Intel Arc)#21845
ggerganov merged 1 commit into
ggml-org:masterfrom
masonmilby:sycl-mmvq-multicol