WebAI-MCP/todo.md at main · cpjet64/WebAI-MCP

Rules for AI coding agent

Execution order: perform tasks strictly top to bottom.
Gating: do not start a task until the previous task is implemented, tested, and verified.
Every "Implement" task must include adding unit/integration tests and running them successfully before proceeding.
Use Rust tests (tokio, wiremock) for backend; use a small JS harness only for Chrome extension UX where needed.
After completing a section, run npm run build:all and the targeted tests for that section before advancing.
Keep changes minimal and targeted; avoid broad refactors.
Match existing project style and folder layout.
Prefer Rust for backend: server + MCP use rmcp.
Preserve HTTP/MCP JSON shapes and semantics.
Use feature flags to switch Rust vs legacy paths.
Do not remove JS Chrome extension; test it via small JS harness.
Testing: prefer Rust (tokio, wiremock); keep JS tests only where browser/DevTools is required.
No network in unit tests. Mock with wiremock/httpmock.
In transitional period, a Node test fetch proxy is allowed to keep legacy Jest+nock passing. Remove when Rust tests cover the cases.
Respect licenses. Add THIRD_PARTY_NOTICES for embedded Node/Lighthouse. Do not ship unvetted binaries.
Error handling: no panics in non‑test code; use anyhow or crate error types. Return early on errors.
Security: bind HTTP to 127.0.0.1 by default; warn on non‑loopback binds.
Performance: prefer async Rust (tokio); avoid blocking calls.
Documentation: update convert-rust.md and README when user- visible behavior or flags change.

Conversion checklist

Bootstrap workspace

Create crates/ Rust workspace and top‑level Cargo.toml
Add crates/core with shared types and errors
Add crates/server HTTP/WS service crate
Add crates/mcp MCP stdio server crate (rmcp)
Add crates/browser CDP abstraction crate (scaffold)
Add crates/lighthouse-shim (scaffold; optional feature)
Add xtask/ or scripts for building prebuilt binaries

Core crate

Define DTOs mirroring existing JSON: logs, audits, identity
Implement bounded ring buffer with JSON‑safe truncation
Implement path conversion helper (WSL/UNC -> native)
Implement error model and human messages (parity with TS)
Add serde + schemars derives for future schema docs
Unit tests for ring buffer, truncation, and errors

Server crate: HTTP/WS skeleton

Server crate: screenshots and storage

Define screenshot message contract (base64 payload)
Create folder if missing (Downloads/mcp-screenshots)
Safe filename generation (timestamp + sanitized title)
Atomic write to avoid partial files on crash
Disk quota/size guard; error mapping
Implement cookies/localStorage/sessionStorage handlers
Payload size guards and structured logging
Tests: folder creation, safe filename, atomic write, quota error mapping, payload size rejection

Proxy and network config

Env proxy detection: HTTP(S)_PROXY, lowercase variants
NO_PROXY wildcards, suffixes, and exact matches
Local/private range bypass (127/localhost/10/172/192.168)
HTTP/HTTPS vs SOCKS agent selection
test_connectivity API using reqwest agents
Unit tests: private ranges, NO_PROXY, bad proxy URL

Notes

test_connectivity implemented with reqwest. Integration test binds to localhost and is marked #[ignore] by default to support restricted sandboxes. Run with cargo test -p webai-server -- --ignored in CI/dev.
Linux auto-paste uses xclip + xdotool and requires X11. On Wayland, the fallback may not work; a Wayland-specific helper may be needed.
Windows registry App Paths + BLBeacon implemented via injectable RegProbe trait. Real registry integration can be added behind a windows-reg feature for Windows CI; tests use a FakeReg.

OS integrations (auto‑paste)

macOS: AppleScript via osascript fallback
macOS: optional native APIs (feature flag)
Windows: windows crate clipboard + focus
Windows: PowerShell fallback path
Linux: xdotool/X11; Wayland caveats noted
Endpoint parity and message texts
Per‑OS tests and error mapping

Browser detection and lifecycle

Browser automation (CDP)

Evaluate chromiumoxide vs headless_chrome
Implement headless browser lifecycle (user‑data‑dir)
Executable path detection (Chrome/Edge/Brave; partial FF)
Navigation with options: headers, cookies, viewport, UA, locale, timezone (interfaces only)
Resource blocking via CDP (options scaffold)
Network condition emulation (options scaffold)
Replace Puppeteer calls behind feature flag
Fallback path to legacy JS (env var)

WebSocket flows (extension‑ws)

Lighthouse

Feature audit-lighthouse (default on)
Package embedded Node + Lighthouse assets
First‑run extraction to cache dir
Spawn management and lifecycle
Flags mapping and pure LHR JSON return
--no-audit disables audits entirely
Robust error mapping (Chrome missing, spawn ENOENT)
THIRD_PARTY_NOTICES and license texts
Tests: spawn success, missing Chrome, missing assets

MCP server (rmcp)

CLI and single binary

Node package integration (thin launchers)

@cpjet64/webai-server: JS bin that prefers Rust binary
Postinstall: download prebuilt or build via cargo
Fallback to legacy JS server only if binary missing
@cpjet64/webai-mcp: JS bin that prefers Rust MCP
Keep legacy JS MCP as temporary fallback
Ensure /.identity reports package version

Testing (Rust‑first)

Transitional JS tests

Keep minimal Jest harness for Chrome extension E2E
(Optional) Temporary Node fetch proxy for legacy tests
Document deprecation of nock‑based backend tests

Docs and dev UX

Update convert-rust.md with progress links
Update README with single‑binary usage
Add CLI help and examples
Add troubleshooting for audits and browsers
Add development guide for building from source

Distribution and licensing

Build prebuilt binaries for macOS, Linux, Windows
Code‑sign where applicable (optional)
Include THIRD_PARTY_NOTICES
Bundle Node/Lighthouse assets when feature enabled
Verify licenses for all bundled deps

Security and stability

Bind to 127.0.0.1 by default
Add rate limits or backpressure if needed
Ensure graceful shutdown and cleanup of temp dirs
Fuzz test parsers and JSON inputs (optional)
Warn when binding to non‑loopback host
Temp dir cleanup verification tests
Redaction toggle and tests (PII/tokens)

Cleanup and deprecation

Remove temporary Node fetch proxy once Rust tests pass
Mark legacy JS server paths as deprecated
Keep Chrome extension JS; document its interface

Notes

Track blockers or decisions here during implementation.
macOS native auto-paste: blocked in this sandbox due to restricted network (cannot add Cocoa/objc crates). The feature gate and stubs exist; native implementation will be added once deps can be fetched in CI/dev.

Cross‑browser support

Selector engine

CSS/XPath/ARIA/Text selectors
Shadow DOM piercing
Auto‑wait for visible/stable/attached
Deterministic selection and error texts

Record and replay

Action recorder (WS events -> script)
Export to Rust/TS test scripts
Deterministic waits (network idle, element conditions)
CLI: webai record and webai replay

Network capture and replay

HAR capture per session
HAR redaction rules
HAR replay for deterministic tests
CLI: webai har capture|replay

Artifacts and visual tools

Full‑page screenshots
Region crop
PDF/MHTML export
Visual diff (SSIM/threshold)

Observability

Structured logs with per‑session ids
OpenTelemetry traces + metrics (feature flag)
Sampling controls via config
Timeline view data: network waterfall, console correlation (JSON API)

Crash diagnostics

Auto crash bundle (logs, HAR, traces)
Privacy redaction policies
CLI: webai diag bundle

Security and privacy

Token/PII redaction rules (regex + allowlist)
Sandboxed modes: disable downloads, dialogs
Allow/deny lists for URLs and downloads
Proxy rotation + NO_PROXY handling parity

Developer experience

Single binary UX: webai server|mcp|all
Config layering: CLI > ENV > file
JSON schema for config; validate on load
First‑run diagnostics (browser detection, permissions, env checks)
Browser downloader helper for CI

Scaling and remote workers

Remote worker protocol (gRPC/WS)
Per‑tenant namespaces and quotas
Secure auth for remote execution

MCP enhancements (rmcp)

Progress and cancellation streaming
Tool schema introspection endpoint
Attachment streaming (screenshots, HAR, PDF) with chunking

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rules for AI coding agent

Conversion checklist

Bootstrap workspace

Core crate

Server crate: HTTP/WS skeleton

Server crate: screenshots and storage

Proxy and network config

Notes

OS integrations (auto‑paste)

Browser detection and lifecycle

Browser automation (CDP)

WebSocket flows (extension‑ws)

Lighthouse

MCP server (rmcp)

CLI and single binary

Node package integration (thin launchers)

Testing (Rust‑first)

Transitional JS tests

Docs and dev UX

Distribution and licensing

Security and stability

Cleanup and deprecation

Notes

Cross‑browser support

Selector engine

Record and replay

Network capture and replay

Artifacts and visual tools

Observability

Crash diagnostics

Security and privacy

Developer experience

Scaling and remote workers

MCP enhancements (rmcp)

FilesExpand file tree

todo.md

Latest commit

History

todo.md

File metadata and controls

Rules for AI coding agent

Conversion checklist

Bootstrap workspace

Core crate

Server crate: HTTP/WS skeleton

Server crate: screenshots and storage

Proxy and network config

Notes

OS integrations (auto‑paste)

Browser detection and lifecycle

Browser automation (CDP)

WebSocket flows (extension‑ws)

Lighthouse

MCP server (rmcp)

CLI and single binary

Node package integration (thin launchers)

Testing (Rust‑first)

Transitional JS tests

Docs and dev UX

Distribution and licensing

Security and stability

Cleanup and deprecation

Notes

Cross‑browser support

Selector engine

Record and replay

Network capture and replay

Artifacts and visual tools

Observability

Crash diagnostics

Security and privacy

Developer experience

Scaling and remote workers

MCP enhancements (rmcp)