clone下来后采用nvcc gemm.cu -o gemm -arch sm_80 -I/cutlassIncludePath进行编译,gemm_v1最大误差达到10^-1,gemm_v2-gemm_v4编译正常运行全部报错,最大误差为-nan,且ncu.sh运行也报错。
clone下来后采用nvcc gemm.cu -o gemm -arch sm_80 -I/cutlassIncludePath进行编译,gemm_v1最大误差达到10^-1,gemm_v2-gemm_v4编译正常运行全部报错,最大误差为-nan,且ncu.sh运行也报错。