Skip to main content

Mojo struct

EpiLoadPipelineStorage

struct EpiLoadPipelineStorage[num_stages: Int]

Storage for epilogue load pipeline (source C loading).

For EpilogueLoad warp → Epilogue warps synchronization. The epilogue load warp loads source tensor C into SMEM, and the epilogue warps consume it for residual operations.

Producer: EpilogueLoad warp (1 warp, 32 threads) Consumer: Epilogue warps (4 warps, 128 threads)

Parameters

  • num_stages (Int): Number of epilogue load pipeline stages (typically 2).

Fields

  • barriers (BarrierPair[num_stages]):

Implemented traits

AnyType, ImplicitlyDestructible

comptime members

__del__is_trivial

comptime __del__is_trivial = True

BarrierArray

comptime BarrierArray = SMemArray[SharedMemBarrier, (num_stages * 2)]

Methods

create_pipeline

create_pipeline(ref[AddressSpace._value._mlir_value] self) -> ProducerConsumerPipeline[num_stages]

Create runtime pipeline from this storage.

Returns:

ProducerConsumerPipeline

barrier_ptr

barrier_ptr(ref[AddressSpace._value._mlir_value] self) -> MbarPtr

Escape hatch: Get raw barrier pointer.

Returns:

MbarPtr

Was this page helpful?