Quantization types
ISQ types supported by mistral.rs. For normal CLI usage, use --quant; use --isq only when you want to force runtime ISQ and skip the UQFF lookup. For format selection guidance, see the quantization decision guide. For underlying tradeoffs, see the explanation page.
Numeric shorthands
Section titled “Numeric shorthands”Pass --quant N for the normal CLI path. If it falls back to runtime ISQ, or if you pass --isq N directly, mistral.rs resolves N to a format based on the detected backend.
| Shorthand | Metal resolves to | CUDA / CPU resolves to |
|---|---|---|
2 | AFQ2 | Q2K |
3 | AFQ3 | Q3K |
4 | AFQ4 | Q4K |
5 | Q5K | Q5K |
6 | AFQ6 | Q6K |
8 | AFQ8 | Q8_0 |
Format-specific types
Section titled “Format-specific types”AFQ family (Metal-only)
Section titled “AFQ family (Metal-only)”Adaptive float quantization, Metal backend only.
| Type | Bits |
|---|---|
afq2 | 2 |
afq3 | 3 |
afq4 | 4 |
afq6 | 6 |
afq8 | 8 |
Loading AFQ on CUDA or CPU returns an error.
Q*K family (CUDA and CPU)
Section titled “Q*K family (CUDA and CPU)”GGML K-quant formats. Supported on all backends.
| Type | Bits |
|---|---|
q2k | 2 |
q3k | 3 |
q4k | 4 |
q5k | 5 |
q6k | 6 |
Legacy GGML types
Section titled “Legacy GGML types”Supported for GGUF compatibility:
| Type | Bits |
|---|---|
q4_0, q4_1 | 4 |
q5_0, q5_1 | 5 |
q8_0 | 8 |
GGUF files using these types load correctly.
Native FP8 on NVIDIA GPUs with compute capability 8.9+.
| Type | Bits | Layout |
|---|---|---|
fp8 | 8 | E4M3 (4-bit exponent, 3-bit mantissa) |
f8q8 | 8 | FP8 weights, INT8 activations |
4-bit microscaling format. Native on Blackwell; emulated elsewhere.
| Type | Bits |
|---|---|
mxfp4 | 4 |
Half-quadratic quantization.
| Type | Bits |
|---|---|
hqq4 | 4 |
hqq8 | 8 |
GPTQ and AWQ
Section titled “GPTQ and AWQ”Not ISQ types, pre-quantized formats. Load directly when a Hugging Face model is available as GPTQ or AWQ:
mistralrs run --format plain -m <gptq-or-awq-repo>mistral.rs detects the quantization from the model’s config. No --quant or --isq required.
See the pick-a-quantization guide for format selection.