Skip to main content

/

Mojo module

grouped_matmul_sm100_1d1d

`comptime` values

`WarpRole`

comptime WarpRole = WarpRole[False]

Structs

B200BlockScaledMatmulSmem:

Functions

blackwell_block_scaled_matmul_tma_umma_warp_specialized: Launch grouped block-scaled matmul kernel on SM100.
blackwell_block_scaled_tma_umma_warp_specialized_kernel:
consumer_main_loop:
copy_accum_to_gmem:
grouped_matmul_dynamic_scaled_nvfp4: Performs grouped matrix multiplication with NVFP4 quantization.
load_AB:
multi_stage_store_C:

comptime values
- WarpRole
Structs
Functions

View source

View source

Was this page helpful?

Thank you! We'll create more content like this.

Thank you for helping us improve!