Auto-tune with mistralrs tune
mistralrs tune recommends a quantization and device-mapping configuration for the host.
Basic usage
Section titled “Basic usage”mistralrs tune -m google/gemma-4-E4B-itOutput is a table with columns: Quant | Est. Size | VRAM % | Context Room | Quality | Status. The status column marks one row as 🚀 Recommended; other rows are marked ✅ Fits, ⚠️ Hybrid, or ❌ Too Large.
Quality tiers: Baseline, Near-lossless, Good, Acceptable, Degraded.
A recommended command line is printed below the table.
Profiles
Section titled “Profiles”mistralrs tune --profile quality -m google/gemma-4-E4B-it--profile accepts quality, balanced (default), or fast.
Saving the recommendation
Section titled “Saving the recommendation”mistralrs tune -m google/gemma-4-E4B-it --emit-config gemma.tomlRun with the recommended settings:
mistralrs from-config -f gemma.tomlMachine-readable output
Section titled “Machine-readable output”mistralrs tune -m google/gemma-4-E4B-it --json > results.json