Mojo struct
Inner_matmul_neon
struct Inner_matmul_neon
Implemented traits
AnyType,
Copyable,
ImplicitlyCopyable,
ImplicitlyDestructible,
InnerMatmulKernel,
Movable
comptime members
__copyinit__is_trivial
comptime __copyinit__is_trivial = True
__del__is_trivial
comptime __del__is_trivial = True
__moveinit__is_trivial
comptime __moveinit__is_trivial = True
Methods
__inner_matmul__
__inner_matmul__[kernel_rows: Int, kernel_cols: Int, simd_size: Int](self, c: LayoutTensor[c.dtype, c.layout, c.origin, address_space=c.address_space, element_layout=c.element_layout, layout_int_type=c.layout_int_type, linear_idx_type=c.linear_idx_type, masked=c.masked, alignment=c.alignment], a: LayoutTensor[a.dtype, a.layout, a.origin, address_space=a.address_space, element_layout=a.element_layout, layout_int_type=a.layout_int_type, linear_idx_type=a.linear_idx_type, masked=a.masked, alignment=a.alignment], b_packed: LayoutTensor[b_packed.dtype, b_packed.layout, b_packed.origin, address_space=b_packed.address_space, element_layout=b_packed.element_layout, layout_int_type=b_packed.layout_int_type, linear_idx_type=b_packed.linear_idx_type, masked=b_packed.masked, alignment=b_packed.alignment], global_offset: GemmShape, global_bound: GemmShape, tile_n_k: IndexList[2], skip_boundary_check: Bool)
Utility function on the inner loop. Run the inner kernel on the whole (kernel_rows, TileN, TileK) tile.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!