Skip to main content

Mojo struct

BlackwellProfileWarp

struct BlackwellProfileWarp[load_warps: UInt32, mma_warps: UInt32, scheduler_warps: UInt32, epilogue_warps: UInt32, max_entries_per_warp: UInt32, //, WorkspaceManager: BlackwellWarpProfilingWorkspaceManager[load_warps, mma_warps, scheduler_warps, epilogue_warps, max_entries_per_warp], warp_role: UInt32 = 0]

This struct calculates execution time for a warp/s, and writes a single entry to the workspace.

Fields

  • timeline (Tuple[UInt64, UInt64]):
  • workspace (Span[UInt64, MutableAnyOrigin]):
  • entry_idx (UInt32):

Implemented traits

AnyType, Copyable, ImplicitlyCopyable, Movable, UnknownDestructibility

Aliases

__copyinit__is_trivial

alias __copyinit__is_trivial = False

__del__is_trivial

alias __del__is_trivial = False

__moveinit__is_trivial

alias __moveinit__is_trivial = False

enable_profiling

alias enable_profiling = (max_entries_per_warp > 0)

Methods

__init__

__init__(out self, workspace: Span[UInt64, MutableAnyOrigin], entry_idx: UInt32)

__enter__

__enter__(mut self)

__exit__

__exit__(mut self)

Was this page helpful?