Mojo module
profiler
comptime valuesβ
MatmulProfileWarpβ
comptime MatmulProfileWarp[warp_role: UInt32, max_entries_per_warp: UInt32] = BlackwellProfileWarp[BlackwellWarpProfilingWorkspaceManager(), warp_role]
Parametersβ
MatmulWarpSpecializationWorkSpaceManagerβ
comptime MatmulWarpSpecializationWorkSpaceManager[max_entries_per_warp: UInt32] = BlackwellWarpProfilingWorkspaceManager[UInt32(1), UInt32(1), UInt32(1), UInt32(4), max_entries_per_warp]
Parametersβ
- βmax_entries_per_warp (
UInt32):
Structsβ
- β
BlackwellProfileWarp: This struct calculates execution time for a warp/s, and writes a single entry to the workspace. - β
BlackwellWarpProfilingWorkspaceManager: This struct manages the profiling workspace. The workspaces consists of equal sized chunks, the total number of which is equal to the total number of active SMs. Each SM chunk consists of sequences of entries, with a maximum number of entries per warp role.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!