perf(inference): chunked batched prefill for long prompts on Metal#188
Open
ohdearquant wants to merge 1 commit into
Open
perf(inference): chunked batched prefill for long prompts on Metal#188ohdearquant wants to merge 1 commit into
ohdearquant wants to merge 1 commit into