[V100/SM70] gemma-4 (31B + MTP) support: fully-FA hybrid attention (sliding-window + head_dim-512)#59
Closed
rivetphilbot wants to merge 25 commits into
Closed
[V100/SM70] gemma-4 (31B + MTP) support: fully-FA hybrid attention (sliding-window + head_dim-512)#59rivetphilbot wants to merge 25 commits into
rivetphilbot wants to merge 25 commits into