Troubleshooting

Common issues and solutions for mistral.rs.

Debug Mode

Enable debug mode for more information:

MISTRALRS_DEBUG=1 mistralrs run -m <model>

Debug mode causes:

If loading a GGUF or GGML model, outputs a file containing the names, shapes, and types of each tensor:
- mistralrs_gguf_tensors.txt or mistralrs_ggml_tensors.txt
Increased logging verbosity

Run the built-in diagnostics tool:

mistralrs doctor

This checks your system configuration and reports any issues.

Setting the CUDA compiler path:

Error: recompile with -fPIE:

Error: CUDA_ERROR_NOT_FOUND or symbol not found:

Minimum CUDA compute capability:

The minimum supported CUDA compute cap is 5.3
Set a specific compute cap with: CUDA_COMPUTE_CAP=80 cargo build --release --features cuda

Metal not found (error: unable to find utility “metal”):

Set the active developer directory:

sudo xcode-select --switch /Applications/Xcode.app/Contents/Developer

error: cannot execute tool ‘metal’ due to missing Metal toolchain

Install Metal Toolchain:

xcodebuild -downloadComponent MetalToolchain

Disabling Metal kernel precompilation:

By default, Metal kernels are precompiled during build time for better performance

To skip precompilation (useful for CI or when Metal is not needed):

MISTRALRS_METAL_PRECOMPILE=0 cargo build --release --features metal

Disabling mmap loading:

Out of memory errors:

Model type not auto-detected:

Chat template issues:

If you’re still stuck:

When reporting issues, please include: