Skip to content

Latest commit

 

History

History
23 lines (13 loc) · 705 Bytes

README.md

File metadata and controls

23 lines (13 loc) · 705 Bytes

Hands-on-GEMM

A GEMM tutorial by Yi Zhang/张译 at NLP Lab., Northeastern University.

Performance

SGEMM 性能对比

Usage

src/cuda 文件夹下面,找到你想看性能的 gemm,记住那个名字,然后回到主项目文件夹下,首先mkdir build,然后输入 make benchmark_xxx

如你想看 double_buffer_yhs_refine_gemm.cu 这个矩阵乘的性能,就输入:

make benchmark_double_buffer_yhs_refine

然后二进制会出现在 bin 文件夹下面。

Tutorial

公众号链接:这里