Skip to content

Serve models

Tutorial 2 covers basic single-model serving. These guides cover the configuration needed beyond a single local server.

If you need to…Start here
Change host, port, CORS, request limits, or authenticationHTTP server configuration
Serve more than one model from one processRunning multiple models
Use the browser chat interfaceUsing the web UI
Use the newer OpenAI Responses endpointOpenAI Responses API

For operational concerns (reverse proxy, Docker, health checks, TLS), see the deployment guides.