For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo function

causal_conv1d_varlen_states_cpu

def causal_conv1d_varlen_states_cpu[x_dtype: DType, cu_seqlens_dtype: DType, states_dtype: DType](total_tokens: Int, dim: Int, batch: Int, state_len: Int, x: TileTensor[x_dtype, Storage=x.Storage, address_space=x.address_space, linear_idx_type=x.linear_idx_type], cu_seqlens: TileTensor[cu_seqlens_dtype, Storage=cu_seqlens.Storage, address_space=cu_seqlens.address_space, linear_idx_type=cu_seqlens.linear_idx_type], states: TileTensor[states_dtype, Storage=states.Storage, address_space=states.address_space, linear_idx_type=states.linear_idx_type], x_seqlen_stride: UInt32, x_dim_stride: UInt32, states_batch_stride: UInt32, states_dim_stride: UInt32, states_seqlen_stride: UInt32)

Extract the last state_len elements from each variable length sequence.

For each sequence in the batch, copies the last state_len tokens (or fewer if the sequence is shorter) to the states tensor. If a sequence is shorter than state_len, the earlier positions in states are zero-padded.

This is the CPU reference implementation for causal_conv1d_varlen_states.

Parameters:

x_dtype (DType): Data type of the input tensor.
cu_seqlens_dtype (DType): Data type of the cumulative sequence lengths.
states_dtype (DType): Data type of the output states tensor.

Args:

total_tokens (Int): Total number of tokens across all sequences.
dim (Int): Number of channels/dimensions.
batch (Int): Number of sequences.
state_len (Int): Number of elements to extract per sequence (typically width - 1).
x (TileTensor[x_dtype, Storage=x.Storage, address_space=x.address_space, linear_idx_type=x.linear_idx_type]): Input tensor of shape (total_tokens, dim).
cu_seqlens (TileTensor[cu_seqlens_dtype, Storage=cu_seqlens.Storage, address_space=cu_seqlens.address_space, linear_idx_type=cu_seqlens.linear_idx_type]): Cumulative sequence lengths of shape (batch + 1,).
states (TileTensor[states_dtype, Storage=states.Storage, address_space=states.address_space, linear_idx_type=states.linear_idx_type]): Output states tensor of shape (batch, dim, state_len).
x_seqlen_stride (UInt32): Stride for sequence dimension in x.
x_dim_stride (UInt32): Stride for dimension in x.
states_batch_stride (UInt32): Stride for batch dimension in states.
states_dim_stride (UInt32): Stride for dimension in states.
states_seqlen_stride (UInt32): Stride for sequence dimension in states.