Feature Request: WebUI add reasoning effort level on a per-message basis

### Prerequisites

- [x] I am running the latest code. Mention the version if possible as well.
- [x] I carefully followed the [README.md](https://github.com/ggml-org/llama.cpp/blob/master/README.md).
- [x] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
- [x] I reviewed the [Discussions](https://github.com/ggml-org/llama.cpp/discussions), and have a new and useful enhancement to share.

### Feature Description

I would like to propose a feature enhancement for our WebUI: the ability to dynamically change a model's reasoning effort level on a per-message basis at runtime.

Currently, for models like GPT OSS that support it, we can pass a reasoning_effort parameter (e.g., low, medium, high) via kwargs or directly in the API request. 

Proposed Solution:
Add a dedicated button or dropdown selector within the message input pane. This would allow users to select the reasoning effort level (e.g., Low, Medium, High, Off, On) for each new message they send, without needing to alter the overall session configuration.

Extended Use Case & Models:
This functionality would be highly valuable for other models that support similar runtime parameters, such as: Qwen3  ,NVIDIA Nemotron toggling reasoning. 

This feature would provide greater flexibility and control during conversations, enabling users to optimize for speed or depth as needed for each query.

### Motivation

It is already implemented in Cherry Studio and some other clients. 

<img width="987" height="514" alt="Image" src="https://github.com/user-attachments/assets/c4bee64a-6f04-4fe6-a1df-3f8405e17488" />

### Possible Implementation

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: WebUI add reasoning effort level on a per-message basis #18405

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Feature Request: WebUI add reasoning effort level on a per-message basis #18405

Description

Prerequisites

Feature Description

Motivation

Possible Implementation

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions