Gemma-4 MTP: skip pooling/sampling epilogue for MTP graphs (fixes decode crash) + widen DKQ=512 TILE routing to all gqa_ratio#25
Open
PhilEgly wants to merge 2 commits into
background
wait
wait-all
cancel
parallel
Loading