Z.AI MCP CLI v2 — Design + Implementation Plan

Executive Summary

This document defines the refactor of zai-cli into a fully MCP‑native, LLM‑optimized CLI. The new CLI:

Uses UTCP + MCP for all services, including Vision (stdio).
Supports Code Mode for tool‑chaining via TypeScript.
Provides tool discovery, schemas, and raw tool calls for perfect LLM alignment.
Outputs data‑only by default for token efficiency, with optional JSON wrappers.

The goal is to make every MCP tool discoverable, debuggable, and safe for agent usage while preserving simple human UX.

Goals

Full MCP coverage for Vision, Search, Reader, and ZRead.
LLM‑first ergonomics: data‑only output, tool schemas, and chainable execution.
Zero guesswork: explicit help, tool discovery, and a doctor command.
Minimal surface area: one CLI that is both human‑friendly and agent‑friendly.

Architecture

High‑Level Flow

zai-cli
├── MCP Client (UTCP + MCP plugin)
│   ├── Vision (stdio via @z_ai/mcp-server)
│   ├── Search (HTTP)
│   ├── Reader (HTTP)
│   └── ZRead (HTTP)
└── Code Mode (UTCP Code Mode)
    └── Executes TypeScript with tool calls

Why UTCP + MCP?

Native MCP interoperability.
Tool discovery (listTools) for schema introspection.
Unified tool naming (manual.server.tool).
Reliable session handling for both stdio and HTTP.

Why Code Mode?

Best tool‑chaining workflow for LLMs.
TypeScript interface generation for all tools.
Tool discovery + runtime introspection (__interfaces).

MCP Servers

MCP Manual Name

All tools are registered under a single manual name:

zai.<server>.<tool>

Servers:

vision
search
reader
zread

Vision (stdio)

Command: npx -y @z_ai/mcp-server@latest

Tools:

analyze_image
ui_to_artifact
extract_text_from_screenshot
diagnose_error_screenshot
understand_technical_diagram
analyze_data_visualization
ui_diff_check
analyze_video

Search / Reader / ZRead (HTTP)

Endpoints:

https://api.z.ai/api/mcp/web_search_prime/mcp
https://api.z.ai/api/mcp/web_reader/mcp
https://api.z.ai/api/mcp/zread/mcp

CLI Surface

Core Commands

zai-cli vision <cmd> <source> [prompt] [options]
zai-cli search <query> [options]
zai-cli read <url> [options]
zai-cli repo <cmd> <owner/repo> [args]

LLM‑Focused Meta Commands

zai-cli tools [--full|--typescript]
zai-cli tool <name>
zai-cli call <tool> --json <payload>
zai-cli doctor
zai-cli code <run|eval|interfaces|prompt>

Output Contract

Default (data‑only)

All commands emit the raw tool data to stdout, optimized for LLM consumption.

Optional JSON Wrapper

Use --output-format json or --output-format pretty:

{
  "success": true,
  "data": "...",
  "timestamp": 1234567890
}

Errors are JSON on stderr and include actionable help.

Configuration

Required

Z_AI_API_KEY

Vision MCP Overrides (stdio)

Z_AI_VISION_MCP_COMMAND
Z_AI_VISION_MCP_ARGS
Z_AI_VISION_MCP_CWD
Z_AI_VISION_MCP=0   # disable vision MCP

Output

ZAI_OUTPUT_MODE=data|json|pretty

Implementation Steps (Completed)

Central MCP configuration
- Single call template shared by UTCP + Code Mode.
Vision moved to MCP
- All 8 tools mapped directly to MCP tool calls.
Tool discovery + raw calls
- tools, tool, and call commands.
Code Mode support
- code run, code eval, code interfaces, code prompt.
Output overhaul
- Data‑only output by default, JSON wrapper optional.
Diagnostics
- doctor command validates env + tool discovery.

Testing Plan

Required Smoke Tests (per tool)

Vision tools:
- analyze, ui-to-code, extract-text, diagnose-error, diagram, chart, diff, video
Search:
- search with domain + recency
Reader:
- read with markdown + text
ZRead:
- repo tree, repo read, repo search

Optional Integration

tools + tool (schema discovery)
call with explicit JSON payload
code eval (tool chaining)
doctor (env + tools)

Open Questions (Optional Future Work)

Add --stream option for streaming responses if MCP servers support it.
Add caching layer for search/read.
Add “batch” mode for tool calls.

Summary

This refactor makes zai-cli fully MCP‑native, LLM‑optimized, and discoverable. All tool schemas are accessible, every endpoint is reachable through a stable interface, and the CLI now supports both direct tool calls and tool‑chaining via Code Mode.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Z.AI MCP CLI v2 — Design + Implementation Plan

Executive Summary

Goals

Architecture

High‑Level Flow

Why UTCP + MCP?

Why Code Mode?

MCP Servers

MCP Manual Name

Vision (stdio)

Search / Reader / ZRead (HTTP)

CLI Surface

Core Commands

LLM‑Focused Meta Commands

Output Contract

Default (data‑only)

Optional JSON Wrapper

Configuration

Required

Vision MCP Overrides (stdio)

Output

Implementation Steps (Completed)

Testing Plan

Required Smoke Tests (per tool)

Optional Integration

Open Questions (Optional Future Work)

Summary

FilesExpand file tree

DESIGN_DOCUMENT.md

Latest commit

History

DESIGN_DOCUMENT.md

File metadata and controls

Z.AI MCP CLI v2 — Design + Implementation Plan

Executive Summary

Goals

Architecture

High‑Level Flow

Why UTCP + MCP?

Why Code Mode?

MCP Servers

MCP Manual Name

Vision (stdio)

Search / Reader / ZRead (HTTP)

CLI Surface

Core Commands

LLM‑Focused Meta Commands

Output Contract

Default (data‑only)

Optional JSON Wrapper

Configuration

Required

Vision MCP Overrides (stdio)

Output

Implementation Steps (Completed)

Testing Plan

Required Smoke Tests (per tool)

Optional Integration

Open Questions (Optional Future Work)

Summary