Providers

Providers only power the optional chat and planning interface. The core runtime, direct CLI tools, validation commands, and the first MCP slice do not require any remote model provider.

Current providers

Master Control currently supports these planning providers:

auto: prefer local Ollama, then OpenAI, then the local heuristic planner
openai: use the OpenAI Responses API explicitly
ollama: use the Ollama /api/chat endpoint with schema-constrained JSON output
heuristic: local rules-only planner for offline bootstrap and tests
noop: disabled provider that only returns static guidance

Planning contract

The planning layer is now explicit about turn state. A provider response must declare one of:

needs_tools: one or more typed tool steps are required before the answer is complete
complete: the current context is already sufficient and no more tools are needed
blocked: the request cannot continue safely with the available tools

That decision now also carries a typed kind, for example:

refresh_required
inspection_request
diagnostic_step
evidence_sufficient
unsupported_request
missing_safe_tool

This is carried in plan_decision at the chat-interface boundary and recorded in plan_generated audit events.

After execution, MC also derives a turn_decision for the final payload and audit trail. This lets the operator distinguish planner intent from turn outcome, including cases such as:

awaiting_confirmation
execution_failed
evidence_sufficient

The final assistant message also consumes that turn_decision. This gives the operator deterministic next-step guidance even if the provider message is vague, for example:

explicit confirmation commands when the turn is waiting on approval
a clear mc tools hint when the runtime lacks the safe tool needed for the request
a direct interruption note when execution failed before completion
recommendation evidence summaries and next-step commands when the turn surfaces proactive guidance

OpenAI provider

The OpenAI integration uses the Responses API with a single required function tool named submit_plan.

Why this shape:

the model returns a structured plan instead of free-form reasoning
the plan is restricted to the currently registered MC tools
execution still happens inside MC, through policy and audit

Ollama provider

The Ollama integration uses the /api/chat endpoint with a strict JSON schema in the format field.

Why this shape:

the local model stays on the same planning contract as the other providers
planning remains structured even without remote function-calling infrastructure
execution still happens inside MC, through policy and audit

mc doctor probes the local Ollama endpoint through /api/tags and reports whether the configured model is already installed. This is also what drives MC_PROVIDER=auto.

Multi-turn context

MC persists provider state per chat session. For providers that support it, the active session can store the last provider response id and reuse it on the next turn.

For OpenAI, this means the chat interface can pass previous_response_id when the same session is resumed.

MC also persists recent conversation messages locally and passes them back to the provider when no remote response chain is available. This keeps follow-up requests useful across local and remote providers.

To avoid depending only on a short message window, MC also maintains a compact deterministic session summary and derives a first-class SessionContext before each planning or recommendation pass. That structured context carries tracked entities and recent findings such as:

tracked unit or service
tracked service scope
tracked filesystem path
last intent
recent memory, disk, service, log, and process observations with freshness-backed state

All providers receive the compact summary text and the structured session context. The heuristic provider consumes the structured context directly for high-risk follow-ups and diagnostic summaries. The LLM providers receive the same structured context as an explicit planning hint, while execution still remains local.

The summary remains the compact carry-forward artifact, but the intended steady state is for summary text to stay useful for rendering and debugging, not to be the primary safety boundary.

MC also persists session-scoped observations with TTL metadata. Before each planning pass, the runtime computes freshness for the latest host observations and the chat interface passes that to the active provider. This gives all planners the same rule:

fresh observations may be summarized and reused
stale observations should be refreshed through the matching typed tool before the planner relies on them

The local heuristic planner already uses this actively for performance diagnosis, so repeated requests such as o host esta lento can refresh memory, processes, or service state when the previous observation window expired.

With the closeout utility additions, that same planner can also use a dedicated process_to_unit step before service_status when it needs typed evidence for a process -> systemd relationship.

For service-oriented follow-ups, the current trust rule is explicit:

performance diagnosis may refresh service state only when a service target is explicit in the request or already tracked in session context
hot-process observations alone do not create executable service recommendations
tracked scope=user|system is part of service identity and must survive follow-up, recommendation, and execution flows

Operators can inspect the same freshness state directly through:

mc observations --session-id <id>
mc observations --session-id <id> --stale-only

Recommendations also consume this same freshness state together with the structured session context. If an alert depends on stale service, process, memory, or disk data, MC now prefers a refresh-oriented recommendation such as service_status or top_processes before it suggests a riskier follow-up action.

The recommendation listing also exposes that confidence explicitly, so operators can inspect whether an item is backed by a fresh signal, a stale signal, or no current observation.

That same confidence now affects ordering: recommendations backed by fresh signals are shown before stale ones in both mc recommendations and the chat-side session highlights.

When a recommendation is actionable, the operator now also sees the next safe command directly in both CLI and chat-oriented render paths.

Operators can also reconcile that queue explicitly without sending a new natural-language request:

mc reconcile --session-id <id>
mc reconcile --all

This recomputes insights and recommendation state from the persisted summary, current observations, and the derived structured session context, and records an audit event for the reconciliation pass.

MC also persists those suggestions as explicit session recommendations, so follow-up operations can track recommendation lifecycle instead of recomputing meaning from raw chat alone.

When a recommendation includes an executable action, that action remains provider-independent metadata. The provider proposes observations and plans; MC decides whether a recommendation can expose a typed action such as restart_service, and any execution still goes through local policy, confirmation, and audit.

For service recommendations, executable actions now require explicit service evidence from the current request, tracked context, or a matching service observation. MC does not expose a restart action from a hot-process signal alone.

The local heuristic provider also supports a small set of approval-gated actions directly, such as restart_service, reload_service, and safe reads of managed config paths. Those plans still execute through the same local confirmation boundary as every other tool.

The chat interface can also call a provider multiple times inside one user turn. Earlier tool results are summarized and reassembled into structured session context so the next planning pass can continue the diagnosis or stop and summarize.

For openai and ollama, MC now also performs a dedicated final response synthesis step after tool execution. This keeps the planning contract strict while still letting the active model produce the operator-facing explanation from the actual observed results.

That synthesis layer follows the same safety rules:

it receives only the executed-tool evidence and local rendered summaries
it cannot trigger tools directly
if synthesis fails, MC falls back to the local deterministic rendering path and records the provider error in audit

Environment variables

Core selection:

MC_PROVIDER=auto|openai|ollama|heuristic|noop
MC_PROVIDER_PROBE_TIMEOUT_S defaults to 0.75

OpenAI credentials and endpoint:

OPENAI_API_KEY
OPENAI_BASE_URL defaults to https://api.openai.com/v1
OPENAI_ORGANIZATION optional
OPENAI_PROJECT optional

OpenAI behavior:

MC_OPENAI_MODEL defaults to gpt-5.4
MC_OPENAI_REASONING_EFFORT defaults to none
MC_OPENAI_TIMEOUT_S defaults to 20
MC_OPENAI_STORE defaults to false

Ollama behavior:

MC_OLLAMA_BASE_URL defaults to http://localhost:11434/api
MC_OLLAMA_MODEL defaults to qwen2.5:7b
MC_OLLAMA_TIMEOUT_S defaults to 60
MC_OLLAMA_KEEP_ALIVE optional
OLLAMA_API_KEY optional

Recommended startup

export OPENAI_API_KEY=...
export MC_PROVIDER=openai
mc doctor
mc chat --new-session --once "o host esta lento"
mc sessions --limit 5
mc chat --session-id 1 --once "e agora me mostre a memoria"

If you want automatic local fallback when no key is configured:

export MC_PROVIDER=auto
mc doctor

When MC_PROVIDER=auto, MC resolves the backend in this order:

ollama if the local endpoint is reachable and the configured model is installed
openai if OPENAI_API_KEY is configured
heuristic otherwise

Direct local planning with Ollama:

ollama serve
ollama pull qwen2.5:7b
export MC_PROVIDER=ollama
export MC_OLLAMA_MODEL=qwen2.5:7b
mc doctor
mc chat --once "o host esta lento"

If the local server is listening on a different port or host, point MC to it explicitly:

export MC_PROVIDER=ollama
export MC_OLLAMA_BASE_URL=http://127.0.0.1:11435/api
export MC_OLLAMA_MODEL=qwen2.5:7b
mc doctor
mc chat --once "mostre o uso de memoria"

One operational finding from the current alpha work: MC's Ollama integration has now been validated against a real local endpoint and installed model inventory, not only mocked transports. The current alpha default is qwen2.5:7b, but validated host setups can still override it through MC_OLLAMA_MODEL.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Providers

Current providers

Planning contract

OpenAI provider

Ollama provider

Multi-turn context

Environment variables

Recommended startup

FilesExpand file tree

providers.md

Latest commit

History

providers.md

File metadata and controls

Providers

Current providers

Planning contract

OpenAI provider

Ollama provider

Multi-turn context

Environment variables

Recommended startup