Skip to content

solaslin/Tensile

 
 

Repository files navigation

A tool for creating a benchmark-driven backend library for GEMMs, GEMM-like problems (such as batched GEMM), N-dimensional tensor contractions, and anything else that multiplies two multi-dimensional objects together on a GPU.

See Tensile Wiki for documentation.

About

Stretching GPU performance for GEMMs and tensor contractions.

Resources

License

Contributing

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages

  • Python 56.3%
  • C++ 37.4%
  • TeX 2.7%
  • CMake 1.7%
  • Shell 1.4%
  • Groovy 0.4%
  • Other 0.1%