Skip to main content

Mojo module

shard_and_stack

Functions

  • shard_and_stack: Shard weight tensors across multiple devices for tensor parallelism.

Was this page helpful?