Guides

Guides answer “how do I…” questions. They assume mistral.rs is installed. Otherwise, start with the Tutorials.

Choose by task

If you need to…	Start here
Install for a specific platform or deployment target	Install and deploy
Run an HTTP server, compatibility API, or web UI	Serve models
Reduce memory use or improve throughput	Performance
Add tools, search, code execution, or MCP	Build agents
Use the Python package	Python SDK
Use the Rust crate	Rust SDK
Work with vision, speech, image generation, or embeddings	Model types
Change model behavior or load adapters	Customize

Install and deploy: platform-specific install steps, Docker images, and pre-production checks.
Serve models: HTTP server configuration, multi-model serving, the web UI, OpenAI-compatible APIs, and the Anthropic Messages API.
Performance: quantization selection, the tune command, Flash and Paged attention, and multi-GPU or multi-machine splits.
Build agents: tool calling, code execution, web search, MCP, and persistent sessions.
Python SDK: streaming completions, image and video input, and the multi-turn session API.
Rust SDK: streaming and embedding mistral.rs in an Axum application.
Model types: vision input, image generation, speech, and embedding models.
Customize: LoRA adapters, AnyMoE, MatFormer, sampling parameters, and TOML config.