add support for half precision gemm#32
Conversation
|
Why do you say this is needed in addition? It looks like an alternative path. But the existing method What would be good to add here is tests using this precision. Which I think should test the user-facing |
|
Why would nnlib prevent it from getting called? |
|
the current code actually works with Float16, but falls back to |
|
Yup, the overriden method in NNlib uses |
|
indeed, it does work locally without any errors, otherwise i would not have submitted it! ;) |
|
Great, I think per @mcabbott's comment a test for this would be good :) |
in conjunction with FluxML/NNlib.jl#363, add support for half-precision
gemm, for which a special kernel is provided by Nvidia. see JuliaGPU/CUDA.jl#1080