The Operating System for Autonomous AI Agents.
A systematic implementation of Harness Engineering, enabling agents to work in a stable, controllable, and verifiable environment. It shifts focus from "writing code" to "designing feedback loops".
- Pain: Agents operate in discrete sessions. When a new session starts, the agent is fresh and completely amnesic, unable to continue previous work.
- Solution: Enforce Context Discovery Protocol. Agents recover full project context within 5 seconds by reading standardized
progress.txtandfeature_list.jsonupon wake-up.
- Pain: Agents often write code that "looks right but doesn't run", or hallucinate non-existent APIs.
- Solution: Test-Driven Development (TDD) + Reflexion Loop. Before writing implementation code, a failing test script must be written. No test evidence = code does not exist.
- Pain: As projects grow, agents easily introduce circular dependencies and break layering.
- Solution: Architecture as Law. Custom Linter rules are not just suggestions, but unbreakable red lines. CI pipelines directly block any violating code.
- Pain: Agents code "blindly", unable to see runtime errors or UI anomalies like humans do.
- Solution: Observability First. Integrate log analysis and screenshot verification, forcing agents to "observe" system state before modifying code.
Option 1: Tell Claude directly:
Use the Harness skill from
https://github.com/ldzhouquan/agent-harness-skill
Option 2: Clone and link:
git clone https://github.com/ldzhouquan/agent-harness-skill.git
ln -s agent-harness-skill/Harness ~/.claude/skills/HarnessTell your Agent:
“/Harness Please set up a new project for me."
The Agent should automatically start Module 1: Project Initialization and create AGENTS.md and progress.txt.
Harness/
├── SKILL.md # 🚦 Traffic Control Tower (Entry Point)
├── workflow.md # 📋 Detailed Checklists
├── Tools/ # 🛠️ Tool scripts
├── scripts/ # 📜 Automation scripts
├── references/ # 📚 Reference materials
│ ├── initialization/
│ ├── bugfix/
│ └── development/
└── modules/ # 📚 Knowledge Modules
├── initialization.md # Setup & Golden Spike
├── feature-management.md # Planning & Specs
├── development-workflow.md # The Loop
├── bug-fix-protocol.md # Bug fix protocol (TDD)
├── architecture-enforcement.md # The Law
├── code-merge.md # Review & Merge
├── autonomous-development.md # E2E Autonomy
├── technical-debt.md # Cleanup Protocols
└── progress-tracking.md # Progress tracking
- Init: Setup CI/Lint/Test -> Run "Golden Spike" (Hello World).
- Plan: Deconstruct requirements into
feature_list.json. - Dev: Loop
Locate -> Ground -> Recall -> Verify -> Claim. - Reflexion:
Design -> Code -> Test -> Fix(The Engine). - Merge: Verify clean state -> Merge.