jamieosh
diff --git a/‎CHANGELOG.md‎
Lines changed: 8 additions & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 8 additions & 0 deletions
diff --git a/‎README.md‎
Lines changed: 10 additions & 7 deletions b/‎README.md‎
Lines changed: 10 additions & 7 deletions
diff --git a/‎agents/__init__.py‎
Lines changed: 27 additions & 2 deletions b/‎agents/__init__.py‎
Lines changed: 27 additions & 2 deletions
diff --git a/‎agents/base_agent.py‎
Lines changed: 26 additions & 32 deletions b/‎agents/base_agent.py‎
Lines changed: 26 additions & 32 deletions
diff --git a/‎agents/chat_harness.py‎
Lines changed: 209 additions & 0 deletions b/‎agents/chat_harness.py‎
Lines changed: 209 additions & 0 deletions
@@ -2,6 +2,14 @@
 
 ## 2026-03-15
 
+### Ship Phase 3 Chat Harness Vocabulary And Contracts
+
+- Added a normalized `ChatHarness` contract with serialization-friendly request, result, event, failure, identity, capability, and observability types in `agents/chat_harness.py`, while keeping `BaseAgent` only as a compatibility shim.
+- Refactored the FastAPI startup, readiness, and send-message flow in `main.py`, `utils/diagnostics.py`, and `services/chat_turns.py` so the app layer now talks to harness-level contracts and normalized failures instead of catching OpenAI SDK exceptions directly.
+- Adapted the shipped OpenAI path in `agents/openai_agent.py` to expose explicit harness identity, normalized `run()` behavior, and harness-owned observability metadata without changing the current non-streaming chat behavior.
+- Updated contributor-facing guidance in `README.md` and `plans/PHASE 3 DESIGN.md`, moved `P3-01` out of `plans/PHASE 3 BACKLOG.md`, and recorded the shipped slice in `plans/done/PHASE 3 DONE.md`.
+- Verification passed with `uv run ruff check .`, `uv run mypy .`, and `uv run python -m pytest` (`178 passed`).
+
 ### Ship Phase 2 Test And Documentation Expansion
 
 - Added repository, service, and route regression coverage for replayed `failed` and `conflicted` requests, duplicate `processing` requests, archived target rejection, and archived mid-flight conflict handling.
 
@@ -21,7 +21,7 @@ Those documents define the long-term direction and maturity phases. This README
 - In-flight request locking plus persisted request IDs so duplicate submissions are replayed instead of being processed twice.
 - Lightweight loading feedback while switching chats.
 - Inline failure handling for validation, service-unavailable, and transport-error states.
-- OpenAI-backed agent implementation (`gpt-5-mini` by default).
+- OpenAI-backed chat harness implementation (`gpt-5-mini` by default).
 - SQLite-backed chat storage with per-client chat ownership and transcript persistence across reloads and restarts.
 - Prompt-template-driven system and user prompt construction.
 - Neutral `AI Chat` defaults with no implicit domain context beyond the persisted transcript for the active chat.
@@ -43,7 +43,7 @@ Phase 2 is complete when the default app behaves as a durable, browser-cookie-sc
 The repository has moved well past the original Phase 1 boundary. The notes below stay here as historical context for the startup/configuration baseline that still underpins the current app:
 
 - Startup path resolution is project-root-aware rather than dependent on the shell's current working directory.
-- Runtime behavior is configurable through environment variables instead of route-level or agent-level constants.
+- Runtime behavior is configurable through environment variables instead of route-level or harness-level constants.
 - Default CORS behavior matches the current no-auth posture: wildcard origins are allowed, but credentials stay disabled unless you opt into explicit origins.
 - Prompt/template selection, OpenAI model choice, timeout, and compatible temperature settings can be changed without modifying application code.
 - The browser chat flow prevents duplicate sends, uses a minimal typing indicator during normal requests, and renders degraded-service states inline when the backend is unavailable or a request fails.
@@ -172,9 +172,10 @@ Forks that move beyond trusted local or internal use should plan explicit securi
 
 ```
 basic_chat_app/
-├── agents/                 # AI agent implementations
-│   ├── base_agent.py      # Abstract base agent class
-│   └── openai_agent.py    # OpenAI-specific agent implementation
+├── agents/                 # Chat harness contracts and implementations
+│   ├── base_agent.py      # Legacy compatibility shim and harness re-exports
+│   ├── chat_harness.py    # Core ChatHarness contract and normalized types
+│   └── openai_agent.py    # OpenAI-specific harness implementation
 ├── persistence/           # SQLite bootstrap and chat repository code
 ├── static/                # Static assets
 │   ├── css/              # CSS styles
@@ -245,10 +246,12 @@ uv run pre-commit install --hook-type pre-commit --hook-type pre-push
 
 ### Adding New Features
 
-1. **New Agent Types**: Extend the `BaseAgent` class in `agents/base_agent.py`
+1. **New Harness Types**: Implement `ChatHarness` in `agents/chat_harness.py`. `BaseAgent` remains available only as a compatibility shim for legacy `process_message()` implementations.
 2. **Custom Prompts**: Add new templates in `templates/prompts/<agent_type>/`
 3. **UI Components**: Add new components in `templates/components/`
 
+The application layer should own routing, persistence, idempotent turn lifecycle, and HTML rendering. The harness layer should own normalized request/result/failure contracts, observability metadata, prompt assembly, and provider-facing execution.
+
 ### Configuration
 
 - Logging configuration can be modified in `utils/logging_config.py`
@@ -276,7 +279,7 @@ For the default no-auth baseline, keep `CORS_ALLOW_CREDENTIALS=false`. If you en
 
 - Prompts: edit `templates/prompts/openai/` to change the default system or user prompt behavior.
 - Model and runtime settings: use environment variables first, then `utils/settings.py` if you need to change the supported configuration surface.
-- Provider wiring: edit `agents/openai_agent.py` to change OpenAI-specific request construction or swap in a different agent implementation behind the existing app contract.
+- Provider wiring: edit `agents/openai_agent.py` for OpenAI-specific request construction, or add a new implementation in `agents/` behind the `ChatHarness` contract without changing the route layer.
 - Chat UI behavior: edit `templates/components/chat.html`, `static/js/chat.js`, and `static/css/chat.css`.
 - Visual baselines: update `tests/e2e/snapshots/` only when a deliberate UI change is accepted.
 
 
@@ -1,4 +1,29 @@
-from .base_agent import BaseAgent
+from .base_agent import (
+    BaseAgent,
+    ChatHarness,
+    ChatHarnessCapabilities,
+    ChatHarnessExecutionError,
+    ChatHarnessEvent,
+    ChatHarnessFailure,
+    ChatHarnessIdentity,
+    ChatHarnessObservability,
+    ChatHarnessRequest,
+    ChatHarnessResult,
+    ConversationTurn,
+)
 from .openai_agent import OpenAIAgent
 
-__all__ = ['BaseAgent', 'OpenAIAgent'] 
+__all__ = [
+    "BaseAgent",
+    "ChatHarness",
+    "ChatHarnessCapabilities",
+    "ChatHarnessExecutionError",
+    "ChatHarnessEvent",
+    "ChatHarnessFailure",
+    "ChatHarnessIdentity",
+    "ChatHarnessObservability",
+    "ChatHarnessRequest",
+    "ChatHarnessResult",
+    "ConversationTurn",
+    "OpenAIAgent",
+]
@@ -1,33 +1,27 @@
-from abc import ABC, abstractmethod
-from dataclasses import dataclass
-from typing import Literal, Sequence
+from .chat_harness import (
+    BaseAgent,
+    ChatHarness,
+    ChatHarnessCapabilities,
+    ChatHarnessExecutionError,
+    ChatHarnessEvent,
+    ChatHarnessFailure,
+    ChatHarnessIdentity,
+    ChatHarnessObservability,
+    ChatHarnessRequest,
+    ChatHarnessResult,
+    ConversationTurn,
+)
 
-
-@dataclass(frozen=True)
-class ConversationTurn:
-    role: Literal["user", "assistant"]
-    content: str
-
-class BaseAgent(ABC):
-    """Abstract base class for all agents"""
-    
-    @property
-    @abstractmethod
-    def display_name(self):
-        """Return the display name for the agent to be shown in the header"""
-        pass
-    
-    @property
-    @abstractmethod
-    def model_display_name(self):
-        """Return a user-friendly display name for the model"""
-        pass
-    
-    @abstractmethod
-    def process_message(
-        self,
-        message: str,
-        conversation_history: Sequence[ConversationTurn] | None = None,
-    ) -> str:
-        """Process a user message and return a response"""
-        pass
+__all__ = [
+    "BaseAgent",
+    "ChatHarness",
+    "ChatHarnessCapabilities",
+    "ChatHarnessExecutionError",
+    "ChatHarnessEvent",
+    "ChatHarnessFailure",
+    "ChatHarnessIdentity",
+    "ChatHarnessObservability",
+    "ChatHarnessRequest",
+    "ChatHarnessResult",
+    "ConversationTurn",
+]
@@ -0,0 +1,209 @@
+from __future__ import annotations
+
+from abc import ABC, abstractmethod
+from dataclasses import dataclass, field
+from typing import Literal
+from collections.abc import Iterator, Sequence
+
+
+FailureCode = Literal[
+    "rate_limited",
+    "authentication_failed",
+    "timeout",
+    "connection_error",
+    "invalid_request",
+    "provider_error",
+    "empty_response",
+    "unexpected_error",
+]
+
+EventType = Literal["output_text", "completed", "failed"]
+
+
+@dataclass(frozen=True)
+class ConversationTurn:
+    role: Literal["user", "assistant"]
+    content: str
+
+
+@dataclass(frozen=True)
+class ChatHarnessIdentity:
+    key: str
+    display_name: str
+    model_display_name: str
+    provider_name: str | None = None
+    version: str | None = None
+
+
+@dataclass(frozen=True)
+class ChatHarnessCapabilities:
+    supports_streaming: bool = False
+    supports_tools: bool = False
+    supports_context_builders: bool = False
+
+
+@dataclass(frozen=True)
+class ChatHarnessObservability:
+    model: str | None = None
+    provider: str | None = None
+    request_id: str | None = None
+    tags: dict[str, str] = field(default_factory=dict)
+
+
+@dataclass(frozen=True)
+class ChatHarnessFailure:
+    code: FailureCode
+    message: str
+    retryable: bool
+    detail: str | None = None
+
+
+@dataclass(frozen=True)
+class ChatHarnessRequest:
+    message: str
+    conversation_history: tuple[ConversationTurn, ...] = ()
+    request_id: str | None = None
+    chat_session_id: int | None = None
+    client_id: str | None = None
+    metadata: dict[str, str] = field(default_factory=dict)
+
+    def __post_init__(self) -> None:
+        object.__setattr__(self, "conversation_history", tuple(self.conversation_history))
+        object.__setattr__(self, "metadata", dict(self.metadata))
+
+
+@dataclass(frozen=True)
+class ChatHarnessResult:
+    output_text: str | None = None
+    finish_reason: str = "completed"
+    failure: ChatHarnessFailure | None = None
+    observability: ChatHarnessObservability = field(default_factory=ChatHarnessObservability)
+    metadata: dict[str, str] = field(default_factory=dict)
+
+    def __post_init__(self) -> None:
+        object.__setattr__(self, "metadata", dict(self.metadata))
+        if self.failure is None and not self.output_text:
+            raise ValueError("Successful harness results require output_text.")
+        if self.failure is not None and self.output_text is not None:
+            raise ValueError("Failed harness results cannot include output_text.")
+
+
+@dataclass(frozen=True)
+class ChatHarnessEvent:
+    event_type: EventType
+    output_text: str | None = None
+    failure: ChatHarnessFailure | None = None
+    observability: ChatHarnessObservability = field(default_factory=ChatHarnessObservability)
+    sequence: int = 0
+    metadata: dict[str, str] = field(default_factory=dict)
+
+    def __post_init__(self) -> None:
+        object.__setattr__(self, "metadata", dict(self.metadata))
+
+
+class ChatHarnessExecutionError(RuntimeError):
+    """Raised when a harness fails with a normalized failure."""
+
+    def __init__(self, failure: ChatHarnessFailure):
+        self.failure = failure
+        super().__init__(failure.message)
+
+
+class ChatHarness(ABC):
+    """App-facing contract for harness implementations."""
+
+    @property
+    @abstractmethod
+    def identity(self) -> ChatHarnessIdentity:
+        """Return stable harness identity and display metadata."""
+
+    @property
+    def capabilities(self) -> ChatHarnessCapabilities:
+        return ChatHarnessCapabilities()
+
+    @abstractmethod
+    def run(self, request: ChatHarnessRequest) -> ChatHarnessResult:
+        """Execute one harness request and return the normalized result."""
+
+    def run_events(self, request: ChatHarnessRequest) -> Iterator[ChatHarnessEvent]:
+        result = self.run(request)
+        if result.output_text is not None:
+            yield ChatHarnessEvent(
+                event_type="output_text",
+                output_text=result.output_text,
+                observability=result.observability,
+                metadata=result.metadata,
+                sequence=0,
+            )
+        if result.failure is not None:
+            yield ChatHarnessEvent(
+                event_type="failed",
+                failure=result.failure,
+                observability=result.observability,
+                metadata=result.metadata,
+                sequence=1,
+            )
+            return
+        yield ChatHarnessEvent(
+            event_type="completed",
+            output_text=result.output_text,
+            observability=result.observability,
+            metadata=result.metadata,
+            sequence=1,
+        )
+
+
+class BaseAgent(ChatHarness, ABC):
+    """Compatibility layer for the legacy non-harness agent interface."""
+
+    @property
+    @abstractmethod
+    def display_name(self) -> str:
+        """Return the display name for the agent to be shown in the header."""
+
+    @property
+    @abstractmethod
+    def model_display_name(self) -> str:
+        """Return a user-friendly display name for the model."""
+
+    @property
+    def identity(self) -> ChatHarnessIdentity:
+        return ChatHarnessIdentity(
+            key=self.__class__.__name__.lower(),
+            display_name=self.display_name,
+            model_display_name=self.model_display_name,
+        )
+
+    def run(self, request: ChatHarnessRequest) -> ChatHarnessResult:
+        try:
+            response_text = self.process_message(
+                request.message,
+                request.conversation_history,
+            )
+        except ValueError:
+            raise
+        except Exception as exc:
+            raise ChatHarnessExecutionError(self.normalize_exception(exc)) from exc
+        return ChatHarnessResult(
+            output_text=response_text,
+            observability=ChatHarnessObservability(
+                model=self.model_display_name,
+                request_id=request.request_id,
+            ),
+        )
+
+    def normalize_exception(self, exc: Exception) -> ChatHarnessFailure:
+        return ChatHarnessFailure(
+            code="unexpected_error",
+            message="Harness execution failed.",
+            retryable=False,
+            detail=str(exc),
+        )
+
+    @abstractmethod
+    def process_message(
+        self,
+        message: str,
+        conversation_history: Sequence[ConversationTurn] | None = None,
+    ) -> str:
+        """Process a user message and return a response."""