Serve models
Tutorial 2 covers basic single-model serving. These guides cover the configuration needed beyond a single local server.
Choose by task
Section titled “Choose by task”| If you need to… | Start here |
|---|---|
| Change host, port, CORS, request limits, or authentication | HTTP server configuration |
| Serve more than one model from one process | Running multiple models |
| Use the browser chat interface | Using the web UI |
| Use the newer OpenAI Responses endpoint | OpenAI Responses API |
For operational concerns (reverse proxy, Docker, health checks, TLS), see the deployment guides.