Mojo struct
BlackwellProfileWarp
struct BlackwellProfileWarp[load_warps: UInt32, mma_warps: UInt32, scheduler_warps: UInt32, epilogue_warps: UInt32, max_entries_per_warp: UInt32, //, WorkspaceManager: BlackwellWarpProfilingWorkspaceManager[load_warps, mma_warps, scheduler_warps, epilogue_warps, max_entries_per_warp], warp_role: UInt32 = 0]
This struct calculates execution time for a warp/s, and writes a single entry to the workspace.
Fields
- timeline (
Tuple[UInt64, UInt64]
): - workspace (
Span[UInt64, MutableAnyOrigin]
): - entry_idx (
UInt32
):
Implemented traits
AnyType
,
Copyable
,
ImplicitlyCopyable
,
Movable
,
UnknownDestructibility
Aliases
__copyinit__is_trivial
alias __copyinit__is_trivial = False
__del__is_trivial
alias __del__is_trivial = False
__moveinit__is_trivial
alias __moveinit__is_trivial = False
enable_profiling
alias enable_profiling = (max_entries_per_warp > 0)
Methods
__init__
__init__(out self, workspace: Span[UInt64, MutableAnyOrigin], entry_idx: UInt32)
__enter__
__enter__(mut self)
__exit__
__exit__(mut self)
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!