Warn when speculative decoding may hurt throughput for MoE models#1313
Open
Shylin26 wants to merge 2 commits into
Open
Warn when speculative decoding may hurt throughput for MoE models#1313Shylin26 wants to merge 2 commits into
Shylin26 wants to merge 2 commits into