Skip to content

Auto-tune with mistralrs tune

mistralrs tune recommends a quantization and device-mapping configuration for the host.

Terminal window
mistralrs tune -m google/gemma-4-E4B-it

Output is a table with columns: Quant | Est. Size | VRAM % | Context Room | Quality | Status. The status column marks one row as 🚀 Recommended; other rows are marked ✅ Fits, ⚠️ Hybrid, or ❌ Too Large.

Quality tiers: Baseline, Near-lossless, Good, Acceptable, Degraded.

A recommended command line is printed below the table.

Terminal window
mistralrs tune --profile quality -m google/gemma-4-E4B-it

--profile accepts quality, balanced (default), or fast.

Terminal window
mistralrs tune -m google/gemma-4-E4B-it --emit-config gemma.toml

Run with the recommended settings:

Terminal window
mistralrs from-config -f gemma.toml
Terminal window
mistralrs tune -m google/gemma-4-E4B-it --json > results.json