Skip to content

Optimize matmuls#2

Merged
efoley merged 2 commits intomainfrom
optimize_matmul
Mar 20, 2025
Merged

Optimize matmuls#2
efoley merged 2 commits intomainfrom
optimize_matmul

Conversation

@efoley
Copy link
Copy Markdown
Owner

@efoley efoley commented Mar 20, 2025

No description provided.

efoley added 2 commits March 20, 2025 08:03
This gives about 30-40x speedup on Qwen2-0.5B-Instruct.
That is probably due to both threading and vectorization.
@efoley efoley merged commit 07093a2 into main Mar 20, 2025
4 checks passed
@efoley efoley deleted the optimize_matmul branch March 20, 2025 15:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant