Mojo module

flash_attention

Functions

flash_attention:
flash_attention_kv_cache:
flash_attention_split_kv: Variant of flash attention that takes the previous KV cache input_{k,v}_cache_fn and the current KV tensors input_k_fn and input_v_fn as separate arguments.