diff --git a/openhands/usage/llms/llms.mdx b/openhands/usage/llms/llms.mdx index f9da5fd4..b2270bb4 100644 --- a/openhands/usage/llms/llms.mdx +++ b/openhands/usage/llms/llms.mdx @@ -29,7 +29,7 @@ then switch back to a stronger model for planning, debugging, and review. |--------|-------------------|--------------|-------------------------| | Claude | [claude-opus-4-8](https://github.com/OpenHands/openhands-index-results/tree/main/results/claude-opus-4-8) | Not yet listed | 71.9 | | GPT | [GPT-5.5](https://github.com/OpenHands/openhands-index-results/tree/main/results/GPT-5.5) | `openai/gpt-5.5` | 65.9 | -| Gemini | [Gemini-3.1-Pro](https://github.com/OpenHands/openhands-index-results/tree/main/results/Gemini-3.1-Pro) | `gemini/gemini-3.1-pro-preview` | 57.0 | +| Gemini | [Gemini-3.5-Flash](https://github.com/OpenHands/openhands-index-results/tree/main/results/Gemini-3.5-Flash) | Not yet listed | 62.6 | ### Strong Open / Open-Weight Models @@ -38,10 +38,10 @@ These open or open-weight models have good OpenHands Index scores or are recomme | Model | Suggested Model String | OpenHands Index Average | |-------|------------------------|-------------------------| | [GLM-5.1](https://github.com/OpenHands/openhands-index-results/tree/main/results/GLM-5.1) | `openrouter/z-ai/glm-5.1` | 58.2 | +| [MiniMax-M3](https://github.com/OpenHands/openhands-index-results/tree/main/results/MiniMax-M3) | `openrouter/minimax/minimax-m3` | 57.2 | | [Kimi-K2.6](https://github.com/OpenHands/openhands-index-results/tree/main/results/Kimi-K2.6) | `openrouter/moonshotai/kimi-k2.6` | 57.1 | | [GLM-5](https://github.com/OpenHands/openhands-index-results/tree/main/results/GLM-5) | `openrouter/z-ai/glm-5` | 49.4 | | [Kimi-K2.5](https://github.com/OpenHands/openhands-index-results/tree/main/results/Kimi-K2.5) | `openrouter/moonshotai/kimi-k2.5` | 49.2 | -| [DeepSeek-V3.2-Reasoner](https://github.com/OpenHands/openhands-index-results/tree/main/results/DeepSeek-V3.2-Reasoner) | `openrouter/deepseek/deepseek-v3.2-reasoner` | 45.7 | Hosted model strings can vary by provider and region. If a model string is not accepted, check the provider console and