add support for half precision gemm by bjarthur · Pull Request #32 · FluxML/NNlibCUDA.jl

bjarthur · 2021-11-16T21:48:42Z

in conjunction with FluxML/NNlib.jl#363, add support for half-precision gemm, for which a special kernel is provided by Nvidia. see JuliaGPU/CUDA.jl#1080

mcabbott · 2021-11-19T04:26:49Z

Why do you say this is needed in addition? It looks like an alternative path. But the existing method NNlib._batched_gemm!(::Type{<:CuArray}, ought to match Float16 (if NNlib.jl would let it be called).

What would be good to add here is tests using this precision. Which I think should test the user-facing batched_mul not the internal functions.

DhairyaLGandhi · 2021-11-19T06:03:28Z

Why would nnlib prevent it from getting called?

bjarthur · 2021-11-19T12:36:07Z

the current code actually works with Float16, but falls back to batched_mul_generic! where a loop is performed over the last dimension. so painfully slow. i thought about tests, but couldn't come up with a way to test that the batched nvidia kernel is called instead.

ToucheSir · 2021-12-12T06:29:38Z

Yup, the overriden method in NNlib uses BlasFloat, which does not include Float16. Now, one hang-up I see with this PR is that _batched_try_gemm! also only accepts BlasFloat. @bjarthur can you confirm this works locally without any errors?

bjarthur · 2022-02-22T23:07:38Z

indeed, it does work locally without any errors, otherwise i would not have submitted it! ;)

ToucheSir · 2022-02-22T23:36:25Z

Great, I think per @mcabbott's comment a test for this would be good :)

add support for half precision gemm

a7e6f2b

bjarthur mentioned this pull request Nov 16, 2021

permit NNlibCUDA to use Float16 FluxML/NNlib.jl#363

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add support for half precision gemm#32

add support for half precision gemm#32
bjarthur wants to merge 1 commit into
FluxML:masterfrom
bjarthur:bja/float16

bjarthur commented Nov 16, 2021 •

edited

Loading

Uh oh!

mcabbott commented Nov 19, 2021

Uh oh!

DhairyaLGandhi commented Nov 19, 2021

Uh oh!

bjarthur commented Nov 19, 2021

Uh oh!

ToucheSir commented Dec 12, 2021

Uh oh!

bjarthur commented Feb 22, 2022

Uh oh!

ToucheSir commented Feb 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

bjarthur commented Nov 16, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mcabbott commented Nov 19, 2021

Uh oh!

DhairyaLGandhi commented Nov 19, 2021

Uh oh!

bjarthur commented Nov 19, 2021

Uh oh!

ToucheSir commented Dec 12, 2021

Uh oh!

bjarthur commented Feb 22, 2022

Uh oh!

ToucheSir commented Feb 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bjarthur commented Nov 16, 2021 •

edited

Loading