ENTERPILOT · SantiagoDePolonia · Jun 20, 2026 · Jun 20, 2026 · Jun 20, 2026 · Jun 20, 2026
diff --git a/README.md b/README.md
diff --git a/docs/advanced/admin-endpoints.mdx b/docs/advanced/admin-endpoints.mdx
@@ -17,26 +17,10 @@ Both are on by default because observability shouldn't be opt-in. If you don't n
 
 ## Configuration
 
-| Variable                              | Description                                      | Default |
-| ------------------------------------- | ------------------------------------------------ | ------- |
-| `ADMIN_ENDPOINTS_ENABLED`             | Enable the admin REST API                        | `true`  |
-| `ADMIN_UI_ENABLED`                    | Enable the admin dashboard UI                    | `true`  |
-| `DASHBOARD_LIVE_LOGS_ENABLED`         | Stream realtime dashboard audit/usage previews   | `true`  |
-| `DASHBOARD_LIVE_LOGS_BUFFER_SIZE`     | In-memory replay window for live dashboard events | `10000` |
-| `DASHBOARD_LIVE_LOGS_REPLAY_LIMIT`    | Max events replayed to one reconnecting client   | `1000`  |
-| `DASHBOARD_LIVE_LOGS_HEARTBEAT_SECONDS` | Idle stream heartbeat interval in seconds      | `15`    |
-
-Or in YAML:
-
-```yaml
-admin:
-  endpoints_enabled: true
-  ui_enabled: true
-  live_logs_enabled: true
-  live_logs_buffer_size: 10000
-  live_logs_replay_limit: 1000
-  live_logs_heartbeat_seconds: 15
-```
+Admin and dashboard behavior is controlled by environment variables (or the
+equivalent `admin:` YAML block). See
+[Admin configuration](/advanced/configuration#admin) for the full table of
+variables, defaults, and the equivalent `admin:` YAML block.
 
 <Note>
   The dashboard UI requires the REST API to be enabled. If you set

diff --git a/docs/advanced/api-endpoints.mdx b/docs/advanced/api-endpoints.mdx
@@ -0,0 +1,73 @@
+---
+title: "API Endpoints"
+description: "Reference for GoModel's OpenAI-compatible and Anthropic-compatible endpoints, provider passthrough, and operations routes."
+icon: "list-tree"
+---
+
+GoModel exposes OpenAI-compatible and Anthropic-compatible APIs, provider-native
+passthrough, and a set of operations endpoints. Admin and dashboard routes are
+documented separately in [Admin Endpoints](/advanced/admin-endpoints).
+
+For request and response details, see the dedicated guides:
+[Responses API](/advanced/responses-api), [Conversations API](/advanced/conversations-api),
+[Anthropic Messages API](/advanced/anthropic-messages-api), and
+[Audio API](/advanced/audio-api).
+
+## OpenAI-Compatible API
+
+| Endpoint                          | Method | Description                                                                                                  |
+| --------------------------------- | ------ | ------------------------------------------------------------------------------------------------------------ |
+| `/v1/chat/completions`            | POST   | Chat completions (streaming supported)                                                                       |
+| `/v1/responses`                   | POST   | Create an OpenAI Responses API response                                                                      |
+| `/v1/responses/{id}`              | GET    | Retrieve a stored response                                                                                   |
+| `/v1/responses/{id}`              | DELETE | Delete a stored response (forwards native deletion where supported)                                          |
+| `/v1/responses/{id}/cancel`       | POST   | Cancel an in-progress response (provider-native where supported)                                             |
+| `/v1/responses/{id}/input_items`  | GET    | List the input items of a stored response                                                                    |
+| `/v1/responses/input_tokens`      | POST   | Count input tokens for a Responses request                                                                   |
+| `/v1/responses/compact`           | POST   | Compact a Responses conversation (provider-native where supported)                                           |
+| `/v1/conversations`               | POST   | Create a conversation (gateway-managed)                                                                      |
+| `/v1/conversations/{id}`          | GET    | Retrieve a conversation                                                                                      |
+| `/v1/conversations/{id}`          | POST   | Replace conversation metadata in full                                                                        |
+| `/v1/conversations/{id}`          | DELETE | Delete a conversation                                                                                        |
+| `/v1/embeddings`                  | POST   | Text embeddings                                                                                              |
+| `/v1/models`                      | GET    | List available models                                                                                        |
+| `/v1/audio/speech`                | POST   | Text-to-speech, returning binary audio                                                                       |
+| `/v1/audio/transcriptions`        | POST   | Speech-to-text from a multipart upload                                                                       |
+| `/v1/realtime`                    | GET    | Realtime speech-to-speech websocket upgrade (when `REALTIME_ENABLED`)                                        |
+| `/v1/files`                       | POST   | Upload a file (OpenAI-compatible multipart)                                                                  |
+| `/v1/files`                       | GET    | List files                                                                                                   |
+| `/v1/files/{id}`                  | GET    | Retrieve file metadata                                                                                       |
+| `/v1/files/{id}`                  | DELETE | Delete a file                                                                                                |
+| `/v1/files/{id}/content`          | GET    | Retrieve raw file content                                                                                    |
+| `/v1/batches`                     | POST   | Create a native provider batch (OpenAI-compatible schema; inline `requests` supported where provider-native) |
+| `/v1/batches`                     | GET    | List stored batches                                                                                          |
+| `/v1/batches/{id}`                | GET    | Retrieve one stored batch                                                                                    |
+| `/v1/batches/{id}/cancel`         | POST   | Cancel a pending batch                                                                                       |
+| `/v1/batches/{id}/results`        | GET    | Retrieve native batch results when available                                                                 |
+
+## Anthropic-Compatible API
+
+| Endpoint                    | Method | Description                                                                   |
+| --------------------------- | ------ | ----------------------------------------------------------------------------- |
+| `/v1/messages`              | POST   | Anthropic Messages API through translated model routing (streaming supported) |
+| `/v1/messages/count_tokens` | POST   | Heuristic Anthropic Messages input token estimate                             |
+
+## Provider Passthrough
+
+| Endpoint            | Method                                       | Description                                                |
+| ------------------- | -------------------------------------------- | ---------------------------------------------------------- |
+| `/p/{provider}/...` | GET, POST, PUT, PATCH, DELETE, HEAD, OPTIONS | Provider-native passthrough with opaque upstream responses |
+
+## Admin Endpoints
+
+Admin REST and dashboard routes (`/admin/*`) are covered in
+[Admin Endpoints](/advanced/admin-endpoints).
+
+## Operations Endpoints
+
+| Endpoint              | Method | Description                                                                        |
+| --------------------- | ------ | ---------------------------------------------------------------------------------- |
+| `/health`             | GET    | Liveness check (always 200 while the process serves)                               |
+| `/health/ready`       | GET    | Readiness check: pings storage (503 if down) and Redis cache (degraded, still 200) |
+| `/metrics`            | GET    | Prometheus metrics (experimental, when enabled)                                    |
+| `/swagger/index.html` | GET    | Swagger UI (when enabled)                                                          |
diff --git a/docs/docs.json b/docs/docs.json
@@ -60,6 +60,7 @@
                             "advanced/configuration",
                             "advanced/config-yaml",
                             "advanced/cli",
+                            "advanced/api-endpoints",
                             "advanced/resilience",
                             "advanced/responses-api",
                             "advanced/responses-compatibility",

diff --git a/docs/providers/overview.mdx b/docs/providers/overview.mdx
@@ -13,30 +13,69 @@ quirks.
 
 ## Supported providers
 
-| Provider | Credential | Guide |
-| -------- | ---------- | ----- |
-| OpenAI | `OPENAI_API_KEY` | — |
-| Anthropic | `ANTHROPIC_API_KEY` | [Anthropic](/providers/anthropic) |
-| Google Gemini | `GEMINI_API_KEY` | [Google Gemini](/providers/gemini) |
-| Google Vertex AI | `VERTEX_PROJECT` + `VERTEX_LOCATION` + GCP credentials | [Google Vertex AI](/providers/vertex) |
-| DeepSeek | `DEEPSEEK_API_KEY` | [DeepSeek](/providers/deepseek) |
-| Groq | `GROQ_API_KEY` | — |
-| OpenRouter | `OPENROUTER_API_KEY` | — |
-| Z.ai | `ZAI_API_KEY` (`ZAI_BASE_URL` optional) | — |
-| xAI (Grok) | `XAI_API_KEY` | — |
-| MiniMax | `MINIMAX_API_KEY` (`MINIMAX_BASE_URL` optional) | — |
-| Alibaba Cloud Model Studio (Bailian) | `BAILIAN_API_KEY` (`BAILIAN_BASE_URL` optional) | [Alibaba Cloud Model Studio](/providers/bailian) |
-| Xiaomi MiMo | `XIAOMI_API_KEY` (`XIAOMI_BASE_URL` optional) | [Xiaomi MiMo](/providers/xiaomi) |
-| OpenCode Go | `OPENCODE_GO_API_KEY` (`OPENCODE_GO_BASE_URL` optional) | [OpenCode Go](/providers/opencode-go) |
-| Azure OpenAI | `AZURE_API_KEY` + `AZURE_BASE_URL` (`AZURE_API_VERSION` optional) | [Azure OpenAI](/providers/azure) |
-| Amazon Bedrock | `BEDROCK_BASE_URL` (region or endpoint) + AWS credentials | [Amazon Bedrock](/providers/bedrock) |
-| Oracle GenAI | `ORACLE_API_KEY` + `ORACLE_BASE_URL` | [Oracle GenAI](/providers/oracle) |
-| Ollama | `OLLAMA_BASE_URL` | [Ollama](/providers/multiple-ollama) |
-| vLLM | `VLLM_BASE_URL` (`VLLM_API_KEY` optional) | [vLLM](/providers/vllm) |
+Example model identifiers are illustrative and subject to change; consult
+provider catalogs for current models. Feature columns reflect gateway API
+support, not every individual model capability exposed by an upstream provider.
 
-See the [README provider table](https://github.com/ENTERPILOT/GoModel#supported-llm-providers)
-for per-provider feature support (chat, Responses, embeddings, files, batches,
-passthrough).
+| Provider | Credential | Example Model | Chat | `/responses` | Embed | Files | Batches | Passthru | Guide |
+| -------- | ---------- | ------------- | :--: | :----------: | :---: | :---: | :-----: | :------: | ----- |
+| OpenAI | `OPENAI_API_KEY` | `gpt-5.5` | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | — |
+| Anthropic | `ANTHROPIC_API_KEY` | `claude-sonnet-4-20250514` | ✅ | ✅ | ❌ | ❌ | ✅ | ✅ | [Anthropic](/providers/anthropic) |
+| Google Gemini | `GEMINI_API_KEY` | `gemini-2.5-flash` | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | [Google Gemini](/providers/gemini) |
+| Google Vertex AI | `VERTEX_PROJECT` + `VERTEX_LOCATION` + GCP credentials | `google/gemini-2.5-flash` | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ | [Google Vertex AI](/providers/vertex) |
+| DeepSeek | `DEEPSEEK_API_KEY` | `deepseek-v4-pro` | ✅ | ✅ | ❌ | ❌ | ❌ | ✅ | [DeepSeek](/providers/deepseek) |
+| Groq | `GROQ_API_KEY` | `llama-3.3-70b-versatile` | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | — |
+| OpenRouter | `OPENROUTER_API_KEY` | `google/gemini-2.5-flash` | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | — |
+| Z.ai | `ZAI_API_KEY` (`ZAI_BASE_URL` optional) | `glm-5.1` | ✅ | ✅ | ✅ | ❌ | ❌ | ✅ | — |
+| xAI (Grok) | `XAI_API_KEY` | `grok-4` | ✅ | ✅ | ✅ | ✅ | ✅ | ❌ | — |
+| Alibaba Cloud Model Studio (Bailian) | `BAILIAN_API_KEY` (`BAILIAN_BASE_URL` optional) | `qwen3-max` | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | [Alibaba Cloud Model Studio](/providers/bailian) |
+| MiniMax | `MINIMAX_API_KEY` (`MINIMAX_BASE_URL` optional) | `MiniMax-M3` | ✅ | ✅ | ✅ | ❌ | ❌ | ✅ | — |
+| Xiaomi MiMo | `XIAOMI_API_KEY` (`XIAOMI_BASE_URL` optional) | `mimo-v2.5-pro` | ✅ | ✅ | ❌ | ❌ | ❌ | ✅ | [Xiaomi MiMo](/providers/xiaomi) |
+| OpenCode Go | `OPENCODE_GO_API_KEY` (`OPENCODE_GO_BASE_URL` optional) | `glm-5.1` | ✅ | ✅ | ❌ | ❌ | ❌ | ❌ | [OpenCode Go](/providers/opencode-go) |
+| Azure OpenAI | `AZURE_API_KEY` + `AZURE_BASE_URL` (`AZURE_API_VERSION` optional) | `gpt-5` | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | [Azure OpenAI](/providers/azure) |
+| Oracle GenAI | `ORACLE_API_KEY` + `ORACLE_BASE_URL` | `openai.gpt-oss-120b` | ✅ | ✅ | ❌ | ❌ | ❌ | ❌ | [Oracle GenAI](/providers/oracle) |
+| Ollama | `OLLAMA_BASE_URL` | `llama3.2` | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ | [Ollama](/providers/multiple-ollama) |
+| vLLM | `VLLM_BASE_URL` (`VLLM_API_KEY` optional) | `meta-llama/Llama-3.1-8B-Instruct` | ✅ | ✅ | ✅ | ❌ | ❌ | ✅ | [vLLM](/providers/vllm) |
+| Amazon Bedrock | `BEDROCK_BASE_URL` (region or endpoint) + AWS credentials | `anthropic.claude-3-5-haiku-20241022-v1:0` | ✅ | ✅ | ❌ | ❌ | ❌ | ❌ | [Amazon Bedrock](/providers/bedrock) |
+
+✅ Supported ❌ Unsupported
+
+## Provider notes
+
+- **Z.ai GLM Coding Plan** — set
+  `ZAI_BASE_URL=https://api.z.ai/api/coding/paas/v4`.
+- **Xiaomi MiMo** — TTS (`mimo-v2.5-tts*`) and ASR (`mimo-v2.5-asr`) are served
+  through `/v1/audio/speech` and `/v1/audio/transcriptions` (translated to
+  MiMo's chat-completions audio dialect) as well as directly via chat
+  completions; for 1M context append `[1m]` to the model ID and list it in
+  `XIAOMI_MODELS`.
+- **OpenCode Go (OpenCode Zen)** — routes per model: most models use
+  OpenAI-style `/chat/completions`, while `/messages`-only models (default
+  `qwen3.7-max`, override with `OPENCODE_GO_MESSAGES_MODELS`) are sent to the
+  Anthropic-native endpoint. Set `OPENCODE_GO_API_KEY`; the base URL defaults to
+  `https://opencode.ai/zen/go/v1`.
+- **Configured model lists** — available for every provider with
+  `<PROVIDER>_MODELS`, for example
+  `OPENROUTER_MODELS=openai/gpt-oss-120b,anthropic/claude-sonnet-4` or
+  `ORACLE_MODELS=openai.gpt-oss-120b,xai.grok-3`. DeepSeek defaults to
+  `https://api.deepseek.com`; set `DEEPSEEK_BASE_URL` only when using a
+  compatible proxy or alternate DeepSeek endpoint. By default,
+  `CONFIGURED_PROVIDER_MODELS_MODE=fallback` uses those lists only when upstream
+  `/models` is unavailable or empty. Set
+  `CONFIGURED_PROVIDER_MODELS_MODE=allowlist` to expose only configured models
+  for providers that define a list, skipping their upstream `/models` calls.
+- **vLLM** — set `VLLM_API_KEY` only if the upstream server was started with
+  `--api-key`.
+- **Multiple instances of one provider type** — without `config.yaml`, use
+  suffixed env vars such as `OPENAI_EAST_API_KEY` and `OPENAI_EAST_BASE_URL`;
+  add `OPENAI_EAST_MODELS` to configure that instance's model list. This
+  registers provider `openai-east` with type `openai`. Vertex AI follows the
+  same suffix pattern — `VERTEX_US_PROJECT` registers provider `vertex-us`.
+  Vertex project and location env vars must match the instance prefix: for a
+  suffixed instance such as `VERTEX_US_PROJECT`, also set `VERTEX_US_LOCATION`
+  and any other suffixed settings for that instance, rather than the generic
+  `VERTEX_PROJECT` / `VERTEX_LOCATION`. `VERTEX_AUTH_TYPE` defaults to
+  Application Default Credentials (`gcp_adc`).
 
 ## Why some providers have dedicated pages
 

diff --git a/docs/providers/xiaomi.mdx b/docs/providers/xiaomi.mdx
@@ -1,7 +1,7 @@
 ---
 title: "Xiaomi MiMo"
 description: "Configure Xiaomi MiMo in GoModel: thinking mode, the [1m] context suffix, and how TTS/ASR map onto the standard audio endpoints."
-icon: "microphone"
+icon: "mic"
 ---
 
 Xiaomi MiMo speaks an OpenAI-compatible chat API with a few dialect quirks: