Mojo module
layout_tensor
Aliases
-
binary_op_type = fn[DType, Int](lhs: SIMD[$0, $1], rhs: SIMD[$0, $1]) -> SIMD[$0, $1]
:
Structs
-
LayoutTensor
: This is a Tensor type that has a specified memory layout and rank. The following example demonstrate a LayoutTensor of float32 with a row major layout of shape (5, 4). -
LayoutTensorIter
: Iterate through a memory buffer and construct layout tensor. -
ThreadScope
:
Functions
-
copy_dram_to_local
: Used to copy data from DRAM to registers for AMD GPUs. It uses buffer_load intrinsic to load data and can check for bounds. In addition to dst and src, it takes src_base as an argument to construct the buffer descriptor of the src tensor. src_base is the original global memory tensor from which src is derived. -
copy_dram_to_sram
: -
copy_dram_to_sram_async
: -
copy_local_to_dram
: -
copy_local_to_local
: -
copy_local_to_sram
: -
copy_sram_to_dram
: -
copy_sram_to_local
: -
cp_async_k_major
: -
cp_async_mn_major
: -
stack_allocation_like
:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!