Skip to main content

Mojo package

conv_sm100

SM100 Structured Convolution Kernels.

High-performance Conv2D for NVIDIA Blackwell (SM100) GPUs using implicit GEMM with hardware TMA im2col. Reuses infrastructure from sm100_structured matmul.

Supported: Conv2D fprop with stride=1, dilation=1, BF16/FP16.

Modules

Was this page helpful?