cuda学习资料汇总
一 gemm
1.1 ampere
1.2 hopper
1.2.1 github中的使用cutlass和wgmma
https://github.com/NVIDIA/cutlass/blob/main/examples/cute/tutorial/wgmma_sm90.cu
1.2.2 TMA
(1)TMA cutlass
https://github.com/NVIDIA/cutlass/blob/main/examples/cute/tutorial/wgmma_sm90.cu
(1)TMA cutlass