Re-exports§
pub use safetensors::Shard;
pub use safetensors::ShardedSafeTensors;
pub use safetensors::ShardedVarBuilder;
Modules§
Structs§
- This layer has a weight that is parallelized along the output dimension, taking the “full” input dimension.
- Device/configurable intelligent matrix multiplication
- This layer has no parallelization
- This layer has a weight that is parallelized along the input dimension, returning the “full” output dimension.
Enums§
Constants§
- Offset for the quant type. UQFF always serializes the version first.
Statics§
Traits§
- Quantized method for a quantized matmul.