跳到主要内容

Kernel Generation

Compiling CUDA code with NVRTC, and generate PTX or cubin. The CUDA code can come from Kernel Fusion.

实例

Fusion examples:

  • Point-wise
  • Point-wise + reduction
  • Conv/gemm + point-wise

相关内容

参考文献