Mojo struct
BlackwellProfileWarp
struct BlackwellProfileWarp[load_warps: UInt32, mma_warps: UInt32, scheduler_warps: UInt32, epilogue_warps: UInt32, max_entries_per_warp: UInt32, //, WorkspaceManager: BlackwellWarpProfilingWorkspaceManager[load_warps, mma_warps, scheduler_warps, epilogue_warps, max_entries_per_warp], warp_role: UInt32 = UInt32(0)]
This struct calculates execution time for a warp/s, and writes a single entry to the workspace.
Fieldsβ
- βtimeline (
Tuple[UInt64, UInt64]): - βworkspace (
Span[UInt64, MutAnyOrigin]): - βentry_idx (
UInt32):
Implemented traitsβ
AnyType,
Copyable,
ImplicitlyCopyable,
ImplicitlyDestructible,
Movable
comptime membersβ
enable_profilingβ
comptime enable_profiling = (max_entries_per_warp > UInt32(0))
Methodsβ
__init__β
__init__(out self, workspace: Span[UInt64, MutAnyOrigin], entry_idx: UInt32)
__enter__β
__enter__(mut self)
__exit__β
__exit__(mut self)
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!