Skip to content

Serve models

Tutorial 2 covers basic single-model serving. These guides cover configuration beyond a single local server, including OpenAI-compatible /v1 clients and Anthropic-compatible /v1/messages clients.

If you need to…Start here
Change host, port, CORS, request limits, or authenticationHTTP server configuration
Serve more than one model from one processRunning multiple models
Use the browser chat interfaceUsing the web UI
Use OpenAI-compatible clientsOpenAI-compatible APIs
Use the newer OpenAI Responses endpointOpenAI Responses API
Use Anthropic-compatible clientsAnthropic Messages API
Use Codex or Claude Code with a local serverUse Codex and Claude Code

For operational concerns (reverse proxy, Docker, health checks, TLS), see the deployment guides.