Mojo module
shard_and_stack
Functions
-
shard_and_stack: Shard weight tensors across multiple devices for tensor parallelism.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!
Mojo module
shard_and_stack: Shard weight tensors across multiple devices for tensor parallelism.Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!