Mojo struct
EpiLoadPipelineStorage
struct EpiLoadPipelineStorage[num_stages: Int]
Storage for epilogue load pipeline (source C loading).
For EpilogueLoad warp → Epilogue warps synchronization. The epilogue load warp loads source tensor C into SMEM, and the epilogue warps consume it for residual operations.
Producer: EpilogueLoad warp (1 warp, 32 threads) Consumer: Epilogue warps (4 warps, 128 threads)
Parameters
- num_stages (
Int): Number of epilogue load pipeline stages (typically 2).
Fields
- barriers (
BarrierPair[num_stages]):
Implemented traits
AnyType,
ImplicitlyDestructible
comptime members
__del__is_trivial
comptime __del__is_trivial = True
BarrierArray
comptime BarrierArray = SMemArray[SharedMemBarrier, (num_stages * 2)]
Methods
create_pipeline
create_pipeline(ref[AddressSpace._value._mlir_value] self) -> ProducerConsumerPipeline[num_stages]
Create runtime pipeline from this storage.
Returns:
barrier_ptr
barrier_ptr(ref[AddressSpace._value._mlir_value] self) -> MbarPtr
Escape hatch: Get raw barrier pointer.
Returns:
MbarPtr
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!