Kernel Generation
Compiling CUDA code with NVRTC, and generate PTX or cubin. The CUDA code can come from Kernel Fusion.
实例
Fusion examples:
- Point-wise
- Point-wise + reduction
- Conv/gemm + point-wise
Compiling CUDA code with NVRTC, and generate PTX or cubin. The CUDA code can come from Kernel Fusion.
Fusion examples: