From 3e672fe6cecc53ecc1e18f88c851af7bf0489c04 Mon Sep 17 00:00:00 2001
From: Matthew Betancourt <mattbeta1@gmail.com>
Date: Sun, 7 Jun 2026 16:57:46 -0400
Subject: [PATCH 1/3] feat(subagents): propose Model parameter for
 invoke_subagent / define_subagent
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Add a proposal document describing the desired API changes to support
per-subagent model selection in invoke_subagent and define_subagent.

Motivation:
- Cost-optimised orchestration (orchestrator on pro, subagents on flash)
- Capability-differentiated pipelines (extended thinking for specific tasks)
- Model benchmarking within a single session

The proposal covers:
- Schema diffs for both invoke_subagent and define_subagent
- Override chain semantics (session → define_subagent.Model → invoke_subagent.Model)
- Behaviour spec table for all edge cases
- Backwards compatibility (Model is optional, no regressions)
- Example JSON for concurrent mixed-model subagent invocation
---
 docs/proposals/subagent-model-selection.md | 150 +++++++++++++++++++++
 1 file changed, 150 insertions(+)
 create mode 100644 docs/proposals/subagent-model-selection.md

diff --git a/docs/proposals/subagent-model-selection.md b/docs/proposals/subagent-model-selection.md
new file mode 100644
index 000000000..6cb0bf2a1
--- /dev/null
+++ b/docs/proposals/subagent-model-selection.md
@@ -0,0 +1,150 @@
+# Proposal: Per-Subagent Model Selection
+
+**Status:** Draft  
+**Author:** MattBetancourt  
+**Related issue:** (link after opening)
+
+---
+
+## Summary
+
+Add an optional `Model` string parameter to the `invoke_subagent` and
+`define_subagent` tools, allowing callers to specify which model a subagent
+should run on rather than always inheriting the session-level model.
+
+---
+
+## Motivation
+
+Since v1.0.5, users can choose a model at session start (`--model`) or switch
+it mid-session (`/model`). Subagents, however, have no equivalent — they always
+inherit whatever model the parent session is currently using.
+
+This creates friction for several practical agent architectures:
+
+1. **Cost-optimised orchestration.** An orchestrator reasonably runs on a
+   capable model (e.g. `gemini-2.5-pro`), but research and file-scanning
+   subagents only need a fast, cheap model (e.g. `gemini-2.0-flash`). There is
+   currently no way to express this split without the user manually switching
+   models between invocations.
+
+2. **Capability-differentiated pipelines.** Some subtasks benefit from extended
+   thinking (e.g. adversarial security review, complex planning). Spawning one
+   subagent with `claude-sonnet-4-6-thinking` while keeping others on a standard
+   model requires either separate sessions or awkward workarounds.
+
+3. **Model benchmarking.** Running the same prompt against two subagents using
+   different models — for comparison or quality-assurance purposes — is
+   impossible today in a single session.
+
+---
+
+## Proposed API Changes
+
+### `invoke_subagent`
+
+Add an optional `Model` field (same identifiers accepted by `--model` / `/model`):
+
+```diff
+ {
+   "TypeName": "string (required)",
+   "Role": "string (required)",
+   "Prompt": "string (required)",
+-  "Workspace": "inherit | branch | share  (optional, default: inherit)"
++  "Workspace": "inherit | branch | share  (optional, default: inherit)",
++  "Model": "string (optional) — model identifier, e.g. \"gemini-2.0-flash\""
+ }
+```
+
+### `define_subagent`
+
+Add an optional `Model` field that sets the default model for all invocations of
+that subagent type:
+
+```diff
+ {
+   "name": "string (required)",
+   "description": "string (required)",
+   "system_prompt": "string (required)",
+   "enable_mcp_tools": "bool (optional)",
+   "enable_subagent_tools": "bool (optional)",
+-  "enable_write_tools": "bool (optional)"
++  "enable_write_tools": "bool (optional)",
++  "Model": "string (optional) — default model for this subagent type"
+ }
+```
+
+---
+
+## Behaviour Spec
+
+| Scenario | Expected result |
+|---|---|
+| `Model` omitted from both `define_subagent` and `invoke_subagent` | Subagent inherits the parent session's current model (no regression) |
+| `Model` set only in `define_subagent` | All invocations of that type use the specified model |
+| `Model` set only in `invoke_subagent` | That specific invocation uses the specified model |
+| `Model` set in both | `invoke_subagent.Model` takes precedence over `define_subagent.Model` |
+| Invalid/unavailable `Model` value | Fail-fast with a clear error before the subagent starts |
+| Parent session model changes mid-conversation | Subagents without an explicit `Model` inherit the updated session model |
+| `self`-type subagent with `Model` set | Spawns with the specified model, not the parent's |
+
+**Override chain (lowest → highest precedence):**
+
+```
+session default  →  define_subagent.Model  →  invoke_subagent.Model
+```
+
+---
+
+## Example Usage
+
+```jsonc
+// Orchestrator (running on gemini-2.5-pro) spawns two concurrent subagents:
+// one cheap and fast, one with extended thinking.
+{
+  "Subagents": [
+    {
+      "TypeName": "research",
+      "Role": "Quick File Scanner",
+      "Prompt": "List all Python files that import the requests library",
+      "Workspace": "inherit",
+      "Model": "gemini-2.0-flash"
+    },
+    {
+      "TypeName": "self",
+      "Role": "Security Reviewer",
+      "Prompt": "Perform an adversarial OWASP-style security review of the auth module",
+      "Workspace": "branch",
+      "Model": "claude-sonnet-4-6-thinking"
+    }
+  ]
+}
+```
+
+---
+
+## Statusline
+
+No changes are needed to the statusline schema. The existing
+`.model.display_name` field in the statusline payload is already populated
+per-conversation. Subagent conversations will naturally reflect their own model
+in that field when the feature is implemented.
+
+---
+
+## Backwards Compatibility
+
+`Model` is entirely optional. All existing `invoke_subagent` and
+`define_subagent` calls without it continue to behave exactly as today. There
+are no breaking changes.
+
+---
+
+## Out of Scope (follow-up candidates)
+
+- **Capability validation**: checking that the chosen model supports tool-calling
+  before assigning it to a subagent with `enable_write_tools: true`.
+- **TUI surface**: a picker UI for subagent model selection inside the
+  interactive session.
+- **Cost estimation**: pre-launch cost hints when a non-default model is
+  requested.

From f8bec90fd69b90df09aaa4f2770b6fb97f3d054d Mon Sep 17 00:00:00 2001
From: Matthew Betancourt <mattbeta1@gmail.com>
Date: Sun, 7 Jun 2026 17:12:31 -0400
Subject: [PATCH 2/3] docs: update example to use real model identifiers from
 agy models

---
 docs/proposals/subagent-model-selection.md | 8 ++++----
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git a/docs/proposals/subagent-model-selection.md b/docs/proposals/subagent-model-selection.md
index 6cb0bf2a1..b47f31c59 100644
--- a/docs/proposals/subagent-model-selection.md
+++ b/docs/proposals/subagent-model-selection.md
@@ -99,8 +99,8 @@ session default  →  define_subagent.Model  →  invoke_subagent.Model
 ## Example Usage
 
 ```jsonc
-// Orchestrator (running on gemini-2.5-pro) spawns two concurrent subagents:
-// one cheap and fast, one with extended thinking.
+// Orchestrator (running on Gemini 3.1 Pro) spawns two concurrent subagents:
+// one fast and cheap, one with extended thinking for security analysis.
 {
   "Subagents": [
     {
@@ -108,14 +108,14 @@ session default  →  define_subagent.Model  →  invoke_subagent.Model
       "Role": "Quick File Scanner",
       "Prompt": "List all Python files that import the requests library",
       "Workspace": "inherit",
-      "Model": "gemini-2.0-flash"
+      "Model": "Gemini 3.5 Flash (Low)"
     },
     {
       "TypeName": "self",
       "Role": "Security Reviewer",
       "Prompt": "Perform an adversarial OWASP-style security review of the auth module",
       "Workspace": "branch",
-      "Model": "claude-sonnet-4-6-thinking"
+      "Model": "Claude Sonnet 4.6 (Thinking)"
     }
   ]
 }

From a467f423a2c211b394fe6365fd71361fbfe5b932 Mon Sep 17 00:00:00 2001
From: Matthew Betancourt <mattbeta1@gmail.com>
Date: Sun, 7 Jun 2026 18:22:39 -0400
Subject: [PATCH 3/3] docs: address adversarial PR review feedback

---
 docs/proposals/subagent-model-selection.md | 22 +++++++++++++++-------
 1 file changed, 15 insertions(+), 7 deletions(-)

diff --git a/docs/proposals/subagent-model-selection.md b/docs/proposals/subagent-model-selection.md
index b47f31c59..f015117af 100644
--- a/docs/proposals/subagent-model-selection.md
+++ b/docs/proposals/subagent-model-selection.md
@@ -52,7 +52,7 @@ Add an optional `Model` field (same identifiers accepted by `--model` / `/model`
    "Prompt": "string (required)",
 -  "Workspace": "inherit | branch | share  (optional, default: inherit)"
 +  "Workspace": "inherit | branch | share  (optional, default: inherit)",
-+  "Model": "string (optional) — model identifier, e.g. \"gemini-2.0-flash\""
++  "Model": "string (optional) — model identifier (e.g. \"gemini-2.0-flash\") or \"inherit\""
  }
 ```
 
@@ -84,8 +84,9 @@ that subagent type:
 | `Model` set only in `define_subagent` | All invocations of that type use the specified model |
 | `Model` set only in `invoke_subagent` | That specific invocation uses the specified model |
 | `Model` set in both | `invoke_subagent.Model` takes precedence over `define_subagent.Model` |
-| Invalid/unavailable `Model` value | Fail-fast with a clear error before the subagent starts |
-| Parent session model changes mid-conversation | Subagents without an explicit `Model` inherit the updated session model |
+| `Model` set to `"inherit"` in `invoke_subagent` | Forcefully bypasses any default set by `define_subagent` and falls back to the parent session model |
+| Invalid/unavailable `Model` value | Fail-fast with a strict allowlist validation before the subagent starts |
+| Parent session model changes mid-conversation | Only *newly spawned* subagents inherit the updated session model; already running subagents are unaffected |
 | `self`-type subagent with `Model` set | Spawns with the specified model, not the parent's |
 
 **Override chain (lowest → highest precedence):**
@@ -96,6 +97,15 @@ session default  →  define_subagent.Model  →  invoke_subagent.Model
 
 ---
 
+## Security & Validation Requirements
+
+1. **Boundary Enforcement**: If the parent session uses a local or offline model, spawning a cloud-based model via `Model` must be blocked or require explicit user confirmation to prevent data exfiltration.
+2. **Financial Controls**: Escalating to a model tier higher than the session default must enforce cost quotas, rate limits, or trigger user approval to prevent financial exhaustion.
+3. **Strict Allowlisting**: The `Model` parameter must be strictly validated against an allowlist of valid model identifier slugs. Invalid slugs must fail fast before the subagent starts.
+4. **Capability Validation**: The system must validate that the requested model supports tool-calling *before* launching the subagent with `enable_write_tools: true`.
+
+---
+
 ## Example Usage
 
 ```jsonc
@@ -108,14 +118,14 @@ session default  →  define_subagent.Model  →  invoke_subagent.Model
       "Role": "Quick File Scanner",
       "Prompt": "List all Python files that import the requests library",
       "Workspace": "inherit",
-      "Model": "Gemini 3.5 Flash (Low)"
+      "Model": "gemini-3.5-flash-low"
     },
     {
       "TypeName": "self",
       "Role": "Security Reviewer",
       "Prompt": "Perform an adversarial OWASP-style security review of the auth module",
       "Workspace": "branch",
-      "Model": "Claude Sonnet 4.6 (Thinking)"
+      "Model": "claude-sonnet-4-6-thinking"
     }
   ]
 }
@@ -142,8 +152,6 @@ are no breaking changes.
 
 ## Out of Scope (follow-up candidates)
 
-- **Capability validation**: checking that the chosen model supports tool-calling
-  before assigning it to a subagent with `enable_write_tools: true`.
 - **TUI surface**: a picker UI for subagent model selection inside the
   interactive session.
 - **Cost estimation**: pre-launch cost hints when a non-default model is