IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo function

dispatch_sm100_conv2d

def dispatch_sm100_conv2d[input_type: DType, filter_type: DType, output_type: DType, //, filter_is_fcrs: Bool = False, elementwise_lambda_fn: Optional[def[dtype: DType, width: Int, *, alignment: Int = 1](IndexList[2], SIMD[dtype, width]) capturing -> None] = None, has_residual: Bool = False](input: TileTensor[input_type, address_space=input.address_space, linear_idx_type=input.linear_idx_type, element_size=input.element_size], filter: TileTensor[filter_type, address_space=filter.address_space, linear_idx_type=filter.linear_idx_type, element_size=filter.element_size], output: TileTensor[output_type, address_space=output.address_space, linear_idx_type=output.linear_idx_type, element_size=output.element_size], symmetric_padding: IndexList[2], ctx: DeviceContext, source_ptr: OptionalReg[UnsafePointer[Scalar[output_type], MutAnyOrigin]] = None, beta: Float32 = 0)

Dispatch to SM100 structured conv2d with filter transpose.

This function gates the SM100 kernel import behind @parameter if on dtype, so the kernel is never compiled for unsupported dtypes.

Parameters:

Args:

Raises:

Error if kernel launch fails.