Mojo module
grouped_matmul_sm100_1d1d
comptime valuesβ
WarpRoleβ
comptime WarpRole = WarpRole[False]
Structsβ
Functionsβ
- β
blackwell_block_scaled_matmul_tma_umma_warp_specialized: Launch grouped block-scaled matmul kernel on SM100. - β
blackwell_block_scaled_tma_umma_warp_specialized_kernel: - β
consumer_main_loop: - β
copy_accum_to_gmem: - β
grouped_matmul_dynamic_scaled_nvfp4: Performs grouped matrix multiplication with NVFP4 quantization. - β
load_AB: - β
multi_stage_store_C:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!