Mojo module
shard_and_stack
Functions
-
shard_and_stack: Shard weight tensors across multiple devices for tensor parallelism.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!
Mojo module
shard_and_stack: Shard weight tensors across multiple devices for tensor parallelism.Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!