Sandbox tool views, llamacpp /props context, structured worker errors by andersonleal · Pull Request #190 · iii-hq/workers

andersonleal · 2026-05-26T16:39:58Z

Summary

Console: new chat/sandbox/ module renders sandbox::* tool calls as rich, tool-specific views (exec, run, stop, list, create, and 10 fs.* operations) with ANSI terminal output, code highlighting, and a unified error view. FunctionCallMessage shows a sandbox preview while a call is pending/running, and switches to terminal/raw-json tabs once it completes.
Harness (llamacpp): discovery now reads n_ctx from the server-wide GET /props endpoint to populate context_window, with graceful fallback to the embedded default on offline / non-2xx / malformed responses.
Harness (agent-trigger): structured worker error envelopes (S-code, docs_url, fix, retryable) are extracted from IIIInvocationError and forwarded via errorResult instead of being buried in gate_unavailable.reason — agents now see the full error contract from sandbox::* handlers.

Test plan

cd console/web && pnpm test — unit tests including new sandbox/__tests__/parsers.test.ts pass
cd harness && pnpm test — covers updated provider-llamacpp/discover.test.ts and turn-orchestrator/agent-trigger.test.ts
Console Examples page: every sandbox tool fixture renders without errors (pending preview + completed terminal/json tabs)
Live llama-server: discovery surfaces real n_ctx instead of the fallback default
Trigger a sandbox::* handler error in chat: agent receives structured envelope with code + docs_url, not a gate_unavailable wrapper

Summary by CodeRabbit

Release Notes

New Features
- Added comprehensive sandbox tool UI with support for exec, run, create, stop, list, and filesystem operations (ls, stat, read, write, mkdir, rm, mv, chmod, grep, sed).
- Added syntax highlighting for code display in sandbox operations.
- Added tabbed view for terminal output and raw JSON in sandbox function calls.
- Enabled auto-accept for sandbox commands via allow-list policy support.
Improvements
- Enhanced model discovery for llama.cpp providers with context window detection.
- Improved error handling for function invocation failures.

Console: - New chat/sandbox module: parsers for the sandbox::* wire shapes plus per-tool renderers (exec, run, stop, list, create, fs.{read,write, grep,ls,mkdir,mv,rm,sed,stat,chmod}), an ANSI terminal output, code highlight pane, and a unified error view. - FunctionCallMessage shows a sandbox preview while pending/running and switches to terminal/raw-json tabs once the call completes. - Examples + Playground get sandbox fixtures and scenario updates so the new views can be exercised in isolation. Harness: - llamacpp discovery now reads n_ctx from the server-wide /props endpoint to populate context_window, falling back to the embedded default on failure / non-2xx / malformed JSON. - turn-orchestrator agent-trigger extracts and forwards structured worker error envelopes (S-code, docs_url, fix, retryable) intact via errorResult instead of burying them under gate_unavailable.reason. Misc: small refactors in providers, auto-accept, conversations context, trace transform, and assorted UI cleanups.

vercel · 2026-05-26T16:40:05Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
workers	Ready	Preview, Comment	May 26, 2026 4:48pm

coderabbitai · 2026-05-26T16:40:17Z

Warning

Review limit reached

@andersonleal, we couldn't start this review because you've reached your PR review rate limit.

More reviews will be available in 47 minutes and 45 seconds. Learn how PR review limits work.

Your organization has run out of usage credits. Purchase more in the billing tab.

⌛ How to resolve this issue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans include higher PR review limits than trial, open-source, and free plans. In all cases, reviews become available again over time. During sustained high-volume PR review activity, CodeRabbit may temporarily slow when the next review becomes available.

Please see our Fair Usage Limits Policy for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 52388e44-5c14-4a3e-b0c8-cf4d69b42bf5

📥 Commits

Reviewing files that changed from the base of the PR and between 75a2f56 and 8504a25.

📒 Files selected for processing (1)

harness/tests/provider-llamacpp/discover.test.ts

📝 Walkthrough

Walkthrough

This PR introduces a complete sandbox tool rendering system with schema-driven validation, comprehensive error handling, and terminal UI components for all sandbox operations (exec, run, create, stop, list, filesystem utilities). It also enhances the auto-accept policy with allow-listing before deny checking, improves llama.cpp discovery with /props endpoint support, and adds structured handler error extraction to the harness.

Changes

Sandbox Tool UI System

Layer / File(s)	Summary
Sandbox request/response schemas and parsers `console/web/src/components/chat/sandbox/parsers.ts`	Zod schemas for sandbox lifecycle (exec, run, create, stop, list) and filesystem (ls, stat, mkdir, read, write, rm, mv, chmod, grep, sed) operations; error wire/invocation formats; envelope unwrapping and safe parsing helpers with type guards.
Formatting utilities for sandbox display `console/web/src/components/chat/sandbox/format.ts`	Byte/time/mode formatting, POSIX shell arg quoting, language inference from file paths, stream channel detection, and environment variable normalization for use in sandbox renderers.
Sandbox error view and handling `console/web/src/components/chat/sandbox/ErrorView.tsx`	ErrorView renders wire-style errors; InvocationErrorView renders invocation/denial errors; SandboxErrorView switches between them; extracts and displays stdout/stderr from error payloads when available.
Terminal UI primitives `console/web/src/components/chat/sandbox/terminal/Terminal.tsx`, `console/web/src/components/chat/sandbox/terminal/AnsiOutput.tsx`	Terminal wraps command, chips, running state, and footer; Chip renders mono header pills; FooterPill renders Badge footer; AnsiOutput strips ANSI codes and renders stdout/stderr with tone-aware styling.
Shared sandbox UI components `console/web/src/components/chat/sandbox/shared.tsx`	Chip, MetaRow, StatusPill, ActionLine provide styled wrappers for sandbox content display with tone-aware styling and consistent spacing.
Sandbox view components for all operations `console/web/src/components/chat/sandbox/ExecView.tsx`, `RunView.tsx`, `CreateView.tsx`, `StopView.tsx`, `ListView.tsx`, `Fs*View.tsx`, `CodeHighlight.tsx`	ExecView and RunView render terminal-style command execution with output and footer; CreateView, StopView, ListView render lifecycle operations; FsChmodView through FsWriteView render filesystem operations with request/response data; CodeHighlight renders Prism-highlighted code.
Sandbox tool dispatcher and integration `console/web/src/components/chat/sandbox/index.tsx`	Exports SandboxToolView with isSandboxFunction, SandboxFunctionIdLabel, tryRender, and tryRenderPreview; gates rendering on function ID allowlist and approval status; dispatches to tool-specific view components.
FunctionCallMessage sandbox integration `console/web/src/components/chat/FunctionCallMessage.tsx`	Adds tab UI for terminal vs JSON view when sandbox terminal output is available; uses SandboxFunctionIdLabel for ID display; manages showRequestPaneAbove logic to position request pane conditionally.
Sandbox parser validation tests `console/web/src/components/chat/sandbox/__tests__/parsers.test.ts`	Vitest suite validates unwrapEnvelope, sandboxErrorWireSchema, parseSandboxErrorDisplay, extractFirstJsonObject, and request/response schemas for all sandbox operations.

Auto-Accept Allow-Listing Enhancement

Layer / File(s)	Summary
AutoAcceptPolicy allow-lists and defaults `console/web/src/lib/backend/auto-accept-policy.ts`	AutoAcceptPolicy interface gains allowPatterns and allowExact fields; DEFAULT_ALLOW_PATTERNS and DEFAULT_ALLOW_EXACT whitelist the sandbox:: namespace to allow auto-acceptance.
Allow-first precedence in isAutoAcceptable `console/web/src/lib/backend/auto-accept-policy.ts`, `auto-accept-policy.test.ts`	isAutoAcceptable now evaluates in order: allowPatterns → allowExact → denyExact → denyPatterns; allows override denies; tests verify precedence and default sandbox acceptance.
use-auto-accept-approvals callback refs `console/web/src/hooks/use-auto-accept-approvals.ts`	Stores onAccepted/onDenied in refs and syncs on each render; removes them from effect dependencies to prevent effect re-runs triggered by callback identity changes; invokes current callbacks via refs.

Syntax Highlighting Enhancement

Layer / File(s)	Summary
Prism language registration and CodeHighlight `console/web/src/lib/syntax.tsx`	Registers Python and Bash grammars on shared Prism instance with custom token patterns (guarded to avoid re-registration); exports CodeHighlight component for caller-provided language syntax highlighting.

Example and Fixture System

Layer / File(s)	Summary
Sandbox tool example fixtures `console/web/src/pages/Examples/sections/sandbox-fixtures.ts`	wrapHarness and base factory create standardized FunctionCallMessage fixtures; exports prebuilt scenarios for exec/run/create/stop/list, filesystem operations, and error cases.
Message variants with sandbox examples `console/web/src/pages/Examples/sections/message-variants.tsx`	Imports and renders sandboxFixtures as VariantCard examples for visual testing of all sandbox function IDs.

Harness Provider and Error Handling

Layer / File(s)	Summary
llama.cpp /props endpoint discovery `harness/src/provider-llamacpp/discover.ts`	propsUrl derives /props URL from chat URL; fetchPropsContextWindow fetches n_ctx from /props endpoint or default_generation_settings; discoverLoadedModel concurrently resolves context_window for all discovered models.
Structured handler error extraction `harness/src/turn-orchestrator/agent-trigger.ts`	extractStructuredHandlerError parses S-code JSON envelope from IIIInvocationError message; triggerFunctionCall forwards extracted payload as handler_error instead of gate_unavailable.
Harness discovery and error handling tests `harness/tests/provider-llamacpp/discover.test.ts`, `agent-trigger.test.ts`	Tests validate propsUrl behavior, context_window extraction across /props payload shapes, and structured handler error forwarding precedence.

🐇 Hoppy hop through the sandbox, parsing schemas with care,
Tabs and formatters blooming, error handling everywhere,
Allow-lists dance with deny rules, each function gets its say,
llama.cpp whispers context secrets in the /props endpoint's way!

🎯 3 (Moderate) | ⏱️ ~20 minutes

iii-hq/workers#184: Related auto-accept and stop-reason handling changes in use-auto-accept-approvals and ChatView that align with the auto-accept enhancement in this PR.

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch chat-sandbox-views

github-actions · 2026-05-26T16:40:18Z

skill-check — worker

1 verified, 11 skipped (no docs/).

1 error across the verified workers.

File	Approximate line	Severity	Violation
`shell/docs/leaves/kill.md`	~7	error	[Terminology.SlopMarketing] Avoid marketing/anthropomorphic phrasing 'Reaping'. Use concrete, technical language. (error)

The new /props test introduced a double-quoted string and a multi-line new Response(...) call that biome formats differently. biome ci fails on the formatting diff. No behavior change.

coderabbitai

Actionable comments posted: 9

🧹 Nitpick comments (1)

console/web/src/components/chat/FunctionCallMessage.tsx (1)

96-103: ⚡ Quick win

Avoid eager sandbox rendering when the row is collapsed.

Line 96 and Line 97 compute sandbox preview/terminal unconditionally. This does avoidable parsing/render work for closed rows in long chat lists. Gate these calls by open (or compute inside the open block) so hidden messages stay cheap.

♻️ Suggested change

-  const sandboxPreview = SandboxToolView.tryRenderPreview(message)
-  const sandboxTerminal = !pending ? SandboxToolView.tryRender(message) : null
+  const sandboxPreview =
+    open && pending ? SandboxToolView.tryRenderPreview(message) : null
+  const sandboxTerminal =
+    open && !pending ? SandboxToolView.tryRender(message) : null

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@console/web/src/components/chat/FunctionCallMessage.tsx` around lines 96 -
103, The component eagerly calls SandboxToolView.tryRenderPreview and
SandboxToolView.tryRender for every message (creating sandboxPreview,
sandboxTerminal, hasSandboxTerminal), which does unnecessary work when the row
is collapsed; change this so those calls are only executed when the row is open
(use the open prop) or move their computation inside the block that renders the
expanded view, then recompute hasSandboxTerminal and showRequestPaneAbove based
on those lazy values so collapsed rows avoid parsing/render work.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@console/web/src/components/chat/sandbox/ErrorView.tsx`:
- Around line 71-80: The external link rendering in ErrorView (component
ErrorView.tsx) uses error.docs_url directly; validate that the URL's protocol is
either "http:" or "https:" before rendering the <a> tag to avoid unsafe
protocols. Update the rendering logic that checks error.docs_url to parse or
test the value (e.g., new URL(...) or a startsWith/protocol check) and only
return the anchor when the protocol is http/https; otherwise render null or a
safe fallback, preserving the existing target/rel attributes.

In `@console/web/src/components/chat/sandbox/format.ts`:
- Around line 193-201: isStreamChannelRef currently only validates channel_id
but its type guard promises channel_id, access_key, and direction; update
isStreamChannelRef to also check that value is a non-null object and that
'access_key' and 'direction' are in value and are strings (e.g., add
"'access_key' in value && typeof (value as { access_key: unknown }).access_key
=== 'string' && 'direction' in value && typeof (value as { direction: unknown
}).direction === 'string'") so the narrowed type truly matches the runtime
validation.

In `@console/web/src/components/chat/sandbox/FsLsView.tsx`:
- Line 54: The mode prefix currently uses `${e.is_dir ? 'd' : '-'}` which treats
symlinks as regular files; change the prefix logic in FsLsView (where the
template uses e.is_dir and formatMode) to check e.is_symlink first and emit 'l'
for symlinks, then 'd' for directories, otherwise '-' (e.g. if e.is_symlink ?
'l' : e.is_dir ? 'd' : '-'), leaving formatMode(e.mode) unchanged.

In `@console/web/src/components/chat/sandbox/FsSedView.tsx`:
- Around line 69-75: The TooltipTrigger currently wraps a non-focusable <span>,
preventing keyboard users from opening the tooltip; update the trigger so it is
keyboard-accessible by replacing or modifying the wrapped element referenced by
TooltipTrigger (the inline span around TriangleAlert and "err") to be focusable
and semantics-correct — e.g., use a real button element or add tabIndex=0 plus
role="button" and an accessible name/aria-describedby that references the
tooltip content (r.error) so keyboard users can focus and invoke the tooltip via
TooltipTrigger.

In `@console/web/src/components/chat/sandbox/parsers.ts`:
- Around line 566-576: parseSandboxErrorDisplay currently only inspects
value.error for a function_error envelope so wrapped payloads (e.g., {content,
details}) are skipped; update the parsing to also unwrap common wrappers before
calling functionErrorEnvelopeSchema.safeParse. Specifically, when encountering
an object in the branch using functionErrorEnvelopeSchema.safeParse, check for
nested candidates like value.content, value.details, and value.content?.error or
value.details?.error (falling back to the original value.error), then run
functionErrorEnvelopeSchema.safeParse on that unwrapped object and continue to
use invocationFromFunctionError and the existing { variant: 'invocation', error:
invocation } return path if parsed.success and kind === 'function_error'.

In `@console/web/src/components/chat/sandbox/RunView.tsx`:
- Around line 82-84: The source path label currently renders the function call
literally; inside RunView.tsx (the JSX span in RunView) replace the literal
"/tmp/run.{extFor(req.lang)}" with a proper JSX expression that concatenates or
templates the extension, e.g. use {`/tmp/run.${extFor(req.lang)}`} or
{'/tmp/run.' + extFor(req.lang)} so the extFor(req.lang) result is evaluated and
displayed.

In `@console/web/src/pages/Examples/sections/message-variants.tsx`:
- Around line 319-323: The VariantCard labels for sandboxFixtures use only
fixture.functionId causing duplicate titles; update the label in the
sandboxFixtures map (where VariantCard is rendered) to include a scenario suffix
such as fixture.id or fixture.state so each label is unique (e.g.,
label={`sandbox · ${fixture.functionId} · ${fixture.id}`} or include
fixture.state) to ensure distinct card titles when rendering sandboxFixtures.

In `@harness/src/provider-llamacpp/discover.ts`:
- Around line 147-149: The current isPositiveInteger type-guard allows positive
non-integers (e.g., 4096.5) which is invalid for n_ctx/context_window; change
its check to require an integer (use Number.isInteger(value) or value % 1 === 0)
and update any validation/usage of n_ctx/context_window in discover.ts to use
this corrected isPositiveInteger function so only positive integers pass
validation. Ensure the function signature remains value is number and that
callers that set n_ctx/context_window reject non-integer values.

In `@harness/src/turn-orchestrator/agent-trigger.ts`:
- Around line 185-188: The structured payload returned by
extractStructuredHandlerError can contain its own error key that overwrites the
intended discriminator; change the merge so the handler discriminator is applied
last (e.g., spread structured first then set error: 'handler_error') when
calling errorResult in the agent-trigger flow (see extractStructuredHandlerError
and errorResult usage) to ensure the 'handler_error' value is preserved.

---

Nitpick comments:
In `@console/web/src/components/chat/FunctionCallMessage.tsx`:
- Around line 96-103: The component eagerly calls
SandboxToolView.tryRenderPreview and SandboxToolView.tryRender for every message
(creating sandboxPreview, sandboxTerminal, hasSandboxTerminal), which does
unnecessary work when the row is collapsed; change this so those calls are only
executed when the row is open (use the open prop) or move their computation
inside the block that renders the expanded view, then recompute
hasSandboxTerminal and showRequestPaneAbove based on those lazy values so
collapsed rows avoid parsing/render work.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: fa4489e9-3f3f-4fe7-82ba-0656212c3249

📥 Commits

Reviewing files that changed from the base of the PR and between d41b2f9 and 75a2f56.

⛔ Files ignored due to path filters (4)

console/Cargo.lock is excluded by !**/*.lock
database/Cargo.lock is excluded by !**/*.lock
shell/Cargo.lock is excluded by !**/*.lock
storage/Cargo.lock is excluded by !**/*.lock

📒 Files selected for processing (73)

console/web/src/App.tsx
console/web/src/components/chat/AutoAcceptToggle.tsx
console/web/src/components/chat/ChatView.tsx
console/web/src/components/chat/Composer.tsx
console/web/src/components/chat/FunctionCallGroup.tsx
console/web/src/components/chat/FunctionCallMessage.tsx
console/web/src/components/chat/LexicalShell.tsx
console/web/src/components/chat/MessageList.tsx
console/web/src/components/chat/ModelPicker.tsx
console/web/src/components/chat/sandbox/CodeHighlight.tsx
console/web/src/components/chat/sandbox/CreateView.tsx
console/web/src/components/chat/sandbox/ErrorView.tsx
console/web/src/components/chat/sandbox/ExecView.tsx
console/web/src/components/chat/sandbox/FsChmodView.tsx
console/web/src/components/chat/sandbox/FsGrepView.tsx
console/web/src/components/chat/sandbox/FsLsView.tsx
console/web/src/components/chat/sandbox/FsMkdirView.tsx
console/web/src/components/chat/sandbox/FsMvView.tsx
console/web/src/components/chat/sandbox/FsReadView.tsx
console/web/src/components/chat/sandbox/FsRmView.tsx
console/web/src/components/chat/sandbox/FsSedView.tsx
console/web/src/components/chat/sandbox/FsStatView.tsx
console/web/src/components/chat/sandbox/FsWriteView.tsx
console/web/src/components/chat/sandbox/ListView.tsx
console/web/src/components/chat/sandbox/RunView.tsx
console/web/src/components/chat/sandbox/StopView.tsx
console/web/src/components/chat/sandbox/__tests__/parsers.test.ts
console/web/src/components/chat/sandbox/format.ts
console/web/src/components/chat/sandbox/index.tsx
console/web/src/components/chat/sandbox/parsers.ts
console/web/src/components/chat/sandbox/shared.tsx
console/web/src/components/chat/sandbox/terminal/AnsiOutput.tsx
console/web/src/components/chat/sandbox/terminal/Terminal.tsx
console/web/src/components/providers/ProviderRow.tsx
console/web/src/components/providers/ProviderSettingsDialog.tsx
console/web/src/components/ui/LiveRegion.tsx
console/web/src/components/ui/ModeToggle.tsx
console/web/src/components/ui/Select.tsx
console/web/src/hooks/use-auto-accept-approvals.ts
console/web/src/hooks/use-chat-dock.ts
console/web/src/hooks/use-functions-catalog.ts
console/web/src/hooks/use-hash-route.ts
console/web/src/hooks/use-live-announcer.ts
console/web/src/lib/backend/auto-accept-policy.test.ts
console/web/src/lib/backend/auto-accept-policy.ts
console/web/src/lib/backend/auto-accept.test.ts
console/web/src/lib/backend/auto-accept.ts
console/web/src/lib/backend/history.ts
console/web/src/lib/backend/pending-approvals-store.test.ts
console/web/src/lib/conversations-context.tsx
console/web/src/lib/format-stop-reason.test.ts
console/web/src/lib/functions-catalog.ts
console/web/src/lib/functions.ts
console/web/src/lib/providers.test.ts
console/web/src/lib/providers.ts
console/web/src/lib/syntax.tsx
console/web/src/pages/Configuration/index.tsx
console/web/src/pages/Examples/sections/message-variants.tsx
console/web/src/pages/Examples/sections/sandbox-fixtures.ts
console/web/src/pages/Playground/scenarios/happy-ask.ts
console/web/src/pages/Playground/scenarios/index.ts
console/web/src/pages/Playground/scenarios/multi-function-agent.ts
console/web/src/pages/Traces/components/SessionDetailPanel.tsx
console/web/src/pages/Traces/components/WaterfallChart.tsx
console/web/src/pages/Traces/hooks/useResizablePanels.ts
console/web/src/pages/Traces/lib/traceTransform.test.ts
console/web/src/pages/Traces/lib/traceTransform.ts
console/web/src/types/chat.ts
harness/src/provider-llamacpp/discover.ts
harness/src/runtime/iii.ts
harness/src/turn-orchestrator/agent-trigger.ts
harness/tests/provider-llamacpp/discover.test.ts
harness/tests/turn-orchestrator/agent-trigger.test.ts

coderabbitai · 2026-05-26T16:52:07Z

+        {error.docs_url ? (
+          <a
+            href={error.docs_url}
+            target="_blank"
+            rel="noreferrer noopener"
+            className="font-mono text-[11px] uppercase tracking-[0.06em] text-accent hover:underline w-fit"
+          >
+            docs ↗
+          </a>
+        ) : null}


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Validate docs_url protocol before rendering the external link.

docs_url comes from payload data and is inserted directly into href. Restrict it to http:/https: before rendering.

💡 Suggested fix

+function safeHttpUrl(url: string): string | null { + try { + const parsed = new URL(url) + return parsed.protocol === 'http:' || parsed.protocol === 'https:' ? url : null + } catch { + return null + } +} + export function ErrorView({ error }: ErrorViewProps) { const retryable = error.retryable === true const streams = execStreamsFromFix(error) + const docsUrl = error.docs_url ? safeHttpUrl(error.docs_url) : null return ( @@ - {error.docs_url ? ( + {docsUrl ? ( <a - href={error.docs_url} + href={docsUrl} target="_blank" rel="noreferrer noopener" className="font-mono text-[11px] uppercase tracking-[0.06em] text-accent hover:underline w-fit" > docs ↗ </a> ) : null}

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@console/web/src/components/chat/sandbox/ErrorView.tsx` around lines 71 - 80, The external link rendering in ErrorView (component ErrorView.tsx) uses error.docs_url directly; validate that the URL's protocol is either "http:" or "https:" before rendering the <a> tag to avoid unsafe protocols. Update the rendering logic that checks error.docs_url to parse or test the value (e.g., new URL(...) or a startsWith/protocol check) and only return the anchor when the protocol is http/https; otherwise render null or a safe fallback, preserving the existing target/rel attributes.

coderabbitai · 2026-05-26T16:52:07Z

+export function isStreamChannelRef(
+  value: unknown,
+): value is { channel_id: string; access_key: string; direction: string } {
+  return (
+    !!value &&
+    typeof value === 'object' &&
+    'channel_id' in value &&
+    typeof (value as { channel_id: unknown }).channel_id === 'string'
+  )


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Tighten isStreamChannelRef to match its declared narrowed type.

The predicate only checks channel_id, but it narrows to an object that also guarantees access_key and direction. That can misclassify payloads and cause downstream undefined access.

💡 Suggested fix

export function isStreamChannelRef( value: unknown, ): value is { channel_id: string; access_key: string; direction: string } { return ( !!value && typeof value === 'object' && 'channel_id' in value && - typeof (value as { channel_id: unknown }).channel_id === 'string' + typeof (value as { channel_id: unknown }).channel_id === 'string' && + 'access_key' in value && + typeof (value as { access_key: unknown }).access_key === 'string' && + 'direction' in value && + ((value as { direction: unknown }).direction === 'read' || + (value as { direction: unknown }).direction === 'write') ) }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

export function isStreamChannelRef(

value: unknown,

): value is { channel_id: string; access_key: string; direction: string } {

return (

!!value &&

typeof value === 'object' &&

'channel_id' in value &&

typeof (value as { channel_id: unknown }).channel_id === 'string'

)

export function isStreamChannelRef(

value: unknown,

): value is { channel_id: string; access_key: string; direction: string } {

return (

!!value &&

typeof value === 'object' &&

'channel_id' in value &&

typeof (value as { channel_id: unknown }).channel_id === 'string' &&

'access_key' in value &&

typeof (value as { access_key: unknown }).access_key === 'string' &&

'direction' in value &&

((value as { direction: unknown }).direction === 'read' ||

(value as { direction: unknown }).direction === 'write')

)

}

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@console/web/src/components/chat/sandbox/format.ts` around lines 193 - 201, isStreamChannelRef currently only validates channel_id but its type guard promises channel_id, access_key, and direction; update isStreamChannelRef to also check that value is a non-null object and that 'access_key' and 'direction' are in value and are strings (e.g., add "'access_key' in value && typeof (value as { access_key: unknown }).access_key === 'string' && 'direction' in value && typeof (value as { direction: unknown }).direction === 'string'") so the narrowed type truly matches the runtime validation.

coderabbitai · 2026-05-26T16:52:07Z

+                    {e.is_dir ? '—' : formatBytes(e.size)}
+                  </td>
+                  <td className="px-2 py-1.5 text-ink-faint tabular-nums">
+                    {`${e.is_dir ? 'd' : '-'}${formatMode(e.mode)}`}


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Render symlink mode prefix correctly.

Line 54 treats symlinks as regular files in the mode prefix. For ls-style output, symlinks should use l.

💡 Suggested fix

- {`${e.is_dir ? 'd' : '-'}${formatMode(e.mode)}`} + {`${e.is_symlink ? 'l' : e.is_dir ? 'd' : '-'}${formatMode(e.mode)}`}

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

{`${e.is_dir ? 'd' : '-'}${formatMode(e.mode)}`}

{`${e.is_symlink ? 'l' : e.is_dir ? 'd' : '-'}${formatMode(e.mode)}`}

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@console/web/src/components/chat/sandbox/FsLsView.tsx` at line 54, The mode prefix currently uses `${e.is_dir ? 'd' : '-'}` which treats symlinks as regular files; change the prefix logic in FsLsView (where the template uses e.is_dir and formatMode) to check e.is_symlink first and emit 'l' for symlinks, then 'd' for directories, otherwise '-' (e.g. if e.is_symlink ? 'l' : e.is_dir ? 'd' : '-'), leaving formatMode(e.mode) unchanged.

coderabbitai · 2026-05-26T16:52:08Z

+                    <Tooltip>
+                      <TooltipTrigger asChild>
+                        <span className="inline-flex items-center gap-1 text-warn cursor-help">
+                          <TriangleAlert aria-hidden className="w-3.5 h-3.5" />
+                          err
+                        </span>
+                      </TooltipTrigger>


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Make the error tooltip trigger keyboard-accessible.

On Line 70, TooltipTrigger wraps a non-focusable <span>, so keyboard users can’t open the tooltip and read r.error.

♿ Suggested fix

) : r.error ? ( <Tooltip> <TooltipTrigger asChild> - <span className="inline-flex items-center gap-1 text-warn cursor-help"> + <button + type="button" + className="inline-flex items-center gap-1 text-warn cursor-help focus-visible:outline-none focus-visible:ring-1 focus-visible:ring-accent" + aria-label={`Show error details for ${r.path}`} + > <TriangleAlert aria-hidden className="w-3.5 h-3.5" /> err - </span> + </button> </TooltipTrigger> <TooltipContent>{r.error}</TooltipContent> </Tooltip>

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

<Tooltip>

<TooltipTrigger asChild>

<span className="inline-flex items-center gap-1 text-warn cursor-help">

<TriangleAlert aria-hidden className="w-3.5 h-3.5" />

err

</span>

</TooltipTrigger>

<Tooltip>

<TooltipTrigger asChild>

<button

type="button"

className="inline-flex items-center gap-1 text-warn cursor-help focus-visible:outline-none focus-visible:ring-1 focus-visible:ring-accent"

aria-label={`Show error details for ${r.path}`}

>

<TriangleAlert aria-hidden className="w-3.5 h-3.5" />

err

</button>

</TooltipTrigger>

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@console/web/src/components/chat/sandbox/FsSedView.tsx` around lines 69 - 75, The TooltipTrigger currently wraps a non-focusable <span>, preventing keyboard users from opening the tooltip; update the trigger so it is keyboard-accessible by replacing or modifying the wrapped element referenced by TooltipTrigger (the inline span around TriangleAlert and "err") to be focusable and semantics-correct — e.g., use a real button element or add tabIndex=0 plus role="button" and an accessible name/aria-describedby that references the tooltip content (r.error) so keyboard users can focus and invoke the tooltip via TooltipTrigger.

coderabbitai · 2026-05-26T16:52:08Z

+  if (value && typeof value === 'object' && !Array.isArray(value)) {
+    const parsed = functionErrorEnvelopeSchema.safeParse(
+      (value as Record<string, unknown>).error,
+    )
+    if (parsed.success && parsed.data.kind === 'function_error') {
+      const invocation = invocationFromFunctionError(parsed.data)
+      if (invocation) {
+        return { variant: 'invocation', error: invocation }
+      }
+    }
+  }


⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Handle wrapped function_error envelopes in the same candidate pass.

parseSandboxErrorDisplay only checks value.error for function_error. When the payload is wrapped (for example via { content, details }), this branch is skipped even though candidates already contain unwrapped objects.

💡 Suggested fix

export function parseSandboxErrorDisplay( value: unknown, ): SandboxErrorDisplay | null { const candidates = collectErrorCandidates(value) @@ for (const candidate of candidates) { + if (candidate && typeof candidate === 'object' && !Array.isArray(candidate)) { + const obj = candidate as Record<string, unknown> + const parsed = functionErrorEnvelopeSchema.safeParse(obj.error) + if (parsed.success && parsed.data.kind === 'function_error') { + const invocation = invocationFromFunctionError(parsed.data) + if (invocation) return { variant: 'invocation', error: invocation } + } + } + const denial = tryParseDenial(candidate) if (denial) { return { variant: 'invocation', error: denialToInvocation(denial), } } } - - if (value && typeof value === 'object' && !Array.isArray(value)) { - const parsed = functionErrorEnvelopeSchema.safeParse( - (value as Record<string, unknown>).error, - ) - if (parsed.success && parsed.data.kind === 'function_error') { - const invocation = invocationFromFunctionError(parsed.data) - if (invocation) { - return { variant: 'invocation', error: invocation } - } - } - } return null }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

if (value && typeof value === 'object' && !Array.isArray(value)) {

const parsed = functionErrorEnvelopeSchema.safeParse(

(value as Record<string, unknown>).error,

)

if (parsed.success && parsed.data.kind === 'function_error') {

const invocation = invocationFromFunctionError(parsed.data)

if (invocation) {

return { variant: 'invocation', error: invocation }

}

}

}

export function parseSandboxErrorDisplay(

value: unknown,

): SandboxErrorDisplay | null {

const candidates = collectErrorCandidates(value)

for (const candidate of candidates) {

if (candidate && typeof candidate === 'object' && !Array.isArray(candidate)) {

const obj = candidate as Record<string, unknown>

const parsed = functionErrorEnvelopeSchema.safeParse(obj.error)

if (parsed.success && parsed.data.kind === 'function_error') {

const invocation = invocationFromFunctionError(parsed.data)

if (invocation) return { variant: 'invocation', error: invocation }

}

}

const denial = tryParseDenial(candidate)

if (denial) {

return {

variant: 'invocation',

error: denialToInvocation(denial),

}

}

}

return null

}

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@console/web/src/components/chat/sandbox/parsers.ts` around lines 566 - 576, parseSandboxErrorDisplay currently only inspects value.error for a function_error envelope so wrapped payloads (e.g., {content, details}) are skipped; update the parsing to also unwrap common wrappers before calling functionErrorEnvelopeSchema.safeParse. Specifically, when encountering an object in the branch using functionErrorEnvelopeSchema.safeParse, check for nested candidates like value.content, value.details, and value.content?.error or value.details?.error (falling back to the original value.error), then run functionErrorEnvelopeSchema.safeParse on that unwrapped object and continue to use invocationFromFunctionError and the existing { variant: 'invocation', error: invocation } return path if parsed.success and kind === 'function_error'.

coderabbitai · 2026-05-26T16:52:08Z

+        <span className="text-ink-ghost normal-case tracking-normal">
+          · /tmp/run.{extFor(req.lang)} · {lineCount}{' '}
+          {lineCount === 1 ? 'line' : 'lines'}


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Fix literal interpolation in source path label.

Line 83 renders extFor(req.lang) as plain text, so users see /tmp/run.{extFor(req.lang)} instead of the actual extension.

💡 Suggested fix

- <span className="text-ink-ghost normal-case tracking-normal"> - · /tmp/run.{extFor(req.lang)} · {lineCount}{' '} + <span className="text-ink-ghost normal-case tracking-normal"> + {` · /tmp/run.${extFor(req.lang)} · ${lineCount} `} {lineCount === 1 ? 'line' : 'lines'} </span>

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@console/web/src/components/chat/sandbox/RunView.tsx` around lines 82 - 84, The source path label currently renders the function call literally; inside RunView.tsx (the JSX span in RunView) replace the literal "/tmp/run.{extFor(req.lang)}" with a proper JSX expression that concatenates or templates the extension, e.g. use {`/tmp/run.${extFor(req.lang)}`} or {'/tmp/run.' + extFor(req.lang)} so the extFor(req.lang) result is evaluated and displayed.

coderabbitai · 2026-05-26T16:52:08Z

+        {sandboxFixtures.map((fixture) => (
+          <VariantCard
+            key={fixture.id}
+            label={`sandbox · ${fixture.functionId}`}
+          >


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Make sandbox variant labels unique per fixture.

Using only functionId creates duplicate card titles (for example multiple sandbox::exec states), which makes the examples harder to scan. Include a scenario suffix (e.g., fixture.id or state) in the label.

Suggested tweak

- label={`sandbox · ${fixture.functionId}`} + label={`sandbox · ${fixture.functionId} · ${fixture.id}`}

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

{sandboxFixtures.map((fixture) => (

<VariantCard

key={fixture.id}

label={`sandbox · ${fixture.functionId}`}

>

{sandboxFixtures.map((fixture) => (

<VariantCard

key={fixture.id}

label={`sandbox · ${fixture.functionId} · ${fixture.id}`}

>

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@console/web/src/pages/Examples/sections/message-variants.tsx` around lines 319 - 323, The VariantCard labels for sandboxFixtures use only fixture.functionId causing duplicate titles; update the label in the sandboxFixtures map (where VariantCard is rendered) to include a scenario suffix such as fixture.id or fixture.state so each label is unique (e.g., label={`sandbox · ${fixture.functionId} · ${fixture.id}`} or include fixture.state) to ensure distinct card titles when rendering sandboxFixtures.

coderabbitai · 2026-05-26T16:52:08Z

+function isPositiveInteger(value: unknown): value is number {
+  return typeof value === 'number' && Number.isFinite(value) && value > 0;
+}


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Enforce integer validation for n_ctx.

Line 148 accepts positive non-integers (for example 4096.5), but n_ctx/context_window should be an integer.

Suggested fix

function isPositiveInteger(value: unknown): value is number { - return typeof value === 'number' && Number.isFinite(value) && value > 0; + return typeof value === 'number' && Number.isSafeInteger(value) && value > 0; }

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@harness/src/provider-llamacpp/discover.ts` around lines 147 - 149, The current isPositiveInteger type-guard allows positive non-integers (e.g., 4096.5) which is invalid for n_ctx/context_window; change its check to require an integer (use Number.isInteger(value) or value % 1 === 0) and update any validation/usage of n_ctx/context_window in discover.ts to use this corrected isPositiveInteger function so only positive integers pass validation. Ensure the function signature remains value is number and that callers that set n_ctx/context_window reject non-integer values.

coderabbitai · 2026-05-26T16:52:08Z

+    const structured = extractStructuredHandlerError(err);
+    if (structured) {
+      return errorResult({ error: 'handler_error', ...structured });
+    }


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Preserve the handler_error discriminator when merging structured payloads.

At Line 187, ...structured can overwrite error: 'handler_error' if the envelope already has an error key.

Suggested fix

- return errorResult({ error: 'handler_error', ...structured }); + return errorResult({ ...structured, error: 'handler_error' });

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

const structured = extractStructuredHandlerError(err);

if (structured) {

return errorResult({ error: 'handler_error', ...structured });

}

const structured = extractStructuredHandlerError(err);

if (structured) {

return errorResult({ ...structured, error: 'handler_error' });

}

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@harness/src/turn-orchestrator/agent-trigger.ts` around lines 185 - 188, The structured payload returned by extractStructuredHandlerError can contain its own error key that overwrites the intended discriminator; change the merge so the handler discriminator is applied last (e.g., spread structured first then set error: 'handler_error') when calling errorResult in the agent-trigger flow (see extractStructuredHandlerError and errorResult usage) to ensure the 'handler_error' value is preserved.

vercel Bot deployed to Preview May 26, 2026 16:40 View deployment

fix(lint): apply biome formatting to discover.test.ts

8504a25

The new /props test introduced a double-quoted string and a multi-line new Response(...) call that biome formats differently. biome ci fails on the formatting diff. No behavior change.

vercel Bot deployed to Preview May 26, 2026 16:48 View deployment

coderabbitai Bot reviewed May 26, 2026

View reviewed changes

sergiofilhowz approved these changes May 26, 2026

View reviewed changes

andersonleal merged commit 4baefd7 into main May 26, 2026
23 of 24 checks passed

andersonleal deleted the chat-sandbox-views branch May 26, 2026 17:14

coderabbitai Bot mentioned this pull request May 29, 2026

feat(console): custom function-call views, session export, markdown directory bodies #201

Merged

5 tasks

coderabbitai Bot mentioned this pull request Jun 12, 2026

feat(console): friendly chat views for shell::* function calls #253

Merged

4 tasks

	{`${e.is_dir ? 'd' : '-'}${formatMode(e.mode)}`}
	{`${e.is_symlink ? 'l' : e.is_dir ? 'd' : '-'}${formatMode(e.mode)}`}

Conversation

andersonleal commented May 26, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by CodeRabbit

Release Notes

Uh oh!

vercel Bot commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai Bot commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review limit reached

Walkthrough

Changes

Uh oh!

github-actions Bot commented May 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

skill-check — worker

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

andersonleal commented May 26, 2026 •

edited by coderabbitai Bot

Loading

vercel Bot commented May 26, 2026 •

edited

Loading

coderabbitai Bot commented May 26, 2026 •

edited

Loading

github-actions Bot commented May 26, 2026 •

edited

Loading