Loupe

Auditor-credible AI lenses for code review.

Threat models, hazard analyses, privacy reviews. Same platform, different lens.

Docs · Quickstart · Architecture · Verification · Decisions · Changelog

Loupe pipeline diagram.
_{Source code is inspected by capability lenses (SBOM, CVE, Secret, SAST, ThreatLens); findings flow into a hash-chained .loupe/ artefact pack; loupe verify confirms each block.}

On every pull request, Loupe runs a Pydantic-typed agent over a composable stack of SBOM, CVE, secret, and SAST scanners. Findings land as Git-tracked YAML in .loupe/, the run record is hash-chained, and a sticky PR comment posts the summary.

loupe verify replays the whole chain from in-repo state alone. No SaaS, no telemetry, no hidden cache. You pick the LLM, you pick the tools; the configuration is the evidence.

Table of contents

The problem
The bet
Compared to what's out there
The mental model
Architecture at a glance
Quickstart · GitHub Action
What lives in .loupe/
Sample output
You pick the LLM. You pick the tools.
Defence in depth (four layers)
Status
Why now: the CRA timeline
Where to find things
Built on
Contributing · Security · Licence

The problem

Software-engineering risk activities (threat modelling, hazard analysis, privacy review) share a structural failure mode. Artefacts drift out of sync with the code they describe. The work needed to keep them current is the slow, careful, evidence-shaped work most engineers will not do without external pressure.

Two failure modes show up everywhere.

Stale documents. A wiki page from 2023 still claims the service uses session cookies, two years after the OAuth migration. An auditor reads it. The auditor is satisfied. The document is wrong.

Compliance theatre. A scanner emits a 200-page PDF on every commit. Nobody reads it. The process runs. No actual analysis happens.

The EU Cyber Resilience Act (fully applicable December 2027) makes both worse. Manufacturers must maintain a current risk assessment, a current secure-by-design rationale, a current SBOM, and current per-vulnerability impact statements. Living documents, not point-in-time PDFs filed after release.

The bet

Claim	Shape
AI agents can do the slow, careful, evidence-shaped work	… provided their outputs are structured, auditable, and human-reviewed at the boundaries that matter
A plugin platform with separate domain lenses is the right shape	Threat modelling, safety analysis, privacy review have different methods but share infrastructure (diff parsing, SBOM generation, artefact storage, enforcement, MCP exposure)
The platform must be auditor-credible from day one	Standards-conformant outputs (CycloneDX, OpenVEX, STRIDE), Git-versioned artefacts, hash-chained run records, write-boundary enforcement in code (not policy)
The bet is falsifiable	Hand a Loupe-produced threat model and a senior-engineer-produced one for the same PR to a third-party auditor blinded to authorship. If they can distinguish them on artefact quality, the bet fails. The benchmark methodology is recorded in D-19.

Compared to what's out there

	Loupe	StrideGPT	IriusRisk	Threat Dragon
In-repo artefacts	✓ Git-tracked	✗ web UI	✗ SaaS DB	~ JSON export
Hash-chained audit trail	✓	✗	✗	✗
Multi-LLM	✓ 8 providers	~ OpenAI-only	n/a	n/a
Auditor-replayable	✓ `loupe verify`	✗	✗	✗
OSS, no SaaS	✓ Apache 2.0	~ partial	✗ commercial	✓ OWASP

_{Row-by-row breakdown with version-pinned sources, snapshot date, and methodological caveats: docs/comparison.md.}

The mental model

The shortest correct answer to "what is Loupe?" is a medical-clinic metaphor that holds all the way down.

A pull request rolls in (the patient). The clinic decides which specialists should see this patient. The specialists order the diagnostic tests they need. The clinic writes up a chart an external auditor can read.

Role	What it is	What it does	In Loupe
Clinic	Platform	Orchestrates, files paperwork, enforces boundaries	`loupe-core`
Specialist	Lens	Brings domain expertise, decides what to look for	`ThreatLens` (security), `SafetyLens` (future), `PrivacyLens` (future)
Diagnostic test	Capability	A tool category specialists can order	`sbom`, `cve`, `secret_detect`, `static_analysis`
Test machine	Backend	The specific tool that performs the test	`syft`, `cdxgen`, `grype`, `trufflehog`, `semgrep`, …
Chart	Artefact	The patient's medical record	Files in `.loupe/`

Architecture at a glance

flowchart TB
    PR["Pull request"]
    Action["loupe-action<br/>(GitHub Action wrapper)"]
    Core["loupe-core<br/>(coordinator · enforcement · run records)"]

    subgraph lenses["Lenses (domain plugins)"]
        ThreatLens["ThreatLens<br/>(v1, shipped)"]
        Future["SafetyLens · PrivacyLens · AIRiskLens<br/>(planned)"]
    end

    subgraph caps["Capabilities"]
        SBOM["sbom"]
        CVE["cve"]
        Secret["secret_detect"]
        SAST["static_analysis"]
    end

    Backends["10 bundled backends<br/>syft · cdxgen · grype · osv-scanner ·<br/>gitleaks · trufflehog · detect-secrets ·<br/>semgrep · codeql · bandit"]

    LLM["LLM (you pick)<br/>Anthropic · OpenAI · Google · Mistral ·<br/>Groq · Ollama · Bedrock · …"]

    Artefacts[(".loupe/<br/>threats · mitigations ·<br/>SBOM · VEX · run records ·<br/>decisions · knowledge graph")]

    PR --> Action --> Core
    Core --> lenses
    Core --> caps
    caps --> Backends
    ThreatLens -.-> LLM
    Core --> Artefacts

    classDef shipped fill:#d4edda,stroke:#155724,stroke-width:2px
    classDef planned fill:#fff3cd,stroke:#856404,stroke-width:1px,stroke-dasharray:5 5
    class ThreatLens,Action,Core,Backends shipped
    class Future planned

Each layer is replaceable without disturbing the others. Swap syft for trivy and lenses do not notice. Add AutoCyberLens for TARA and the platform does not change. Change the GitHub Action wrapper for a GitLab one and the core stays the same.

Quickstart

Warning

Pre-alpha · pre-PyPI. Loupe is wired end-to-end but the API will move before v0.1 tags. Install from source for now; the artefact schemas are the stable surface.

git clone https://github.com/fadi-labib/loupe.git
cd loupe
uv sync --all-packages

Pre-PyPI, the CLI lives in the workspace venv. Invoke it against your project with uv run --project /path/to/loupe; post-v0.1 the prefix collapses to bare loupe.

cd ~/projects/your-repo
uv run --project /path/to/loupe loupe init                       # scaffold .loupe/
$EDITOR .loupe/context.md                                        # describe your product (anti-hallucination anchor)
export ANTHROPIC_API_KEY="sk-ant-…"                              # or OPENAI_API_KEY / GOOGLE_API_KEY / etc.
uv run --project /path/to/loupe loupe doctor                     # preflight: keys, binaries, config
git diff main... > /tmp/pr.diff
uv run --project /path/to/loupe loupe ci --diff-file /tmp/pr.diff

loupe doctor reads one line per check. Before loupe init you see the missing-scaffold case:

[✓] git binary: found on PATH
[✗] .loupe/ directory: .loupe not found — run `loupe init` first
[-] config.yaml parse: skipped — depends on .loupe/ existing
[-] context.md: skipped — depends on .loupe/ existing
[✓] provider key (anthropic): ANTHROPIC_API_KEY is set
[-] required capabilities: skipped — config.yaml didn't parse

2 ok · 0 warn · 3 skip · 1 fail

After loupe init and filling context.md, all skipped rows resolve to [✓]. Each [✗] row carries the actual fix in its detail message; the Troubleshooting recipe walks through the failure modes for each line.

A full walk-through with screenshots is in the Quickstart.

GitHub Action

Report-only — Loupe runs on every PR and posts a sticky comment; nothing is committed back to the repo:

# .github/workflows/loupe.yml
on:
  pull_request:

jobs:
  loupe:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: fadi-labib/loupe/packages/loupe-action@main   # post-v0.1: fadi-labib/loupe-action@v0.1
        with:
          pr: ${{ github.event.pull_request.number }}
          comment_mode: sticky                # or 'new' or 'none'
        env:
          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}

Recommended for production — opt into Layer 2 enforcement. The Action commits .loupe/ artefacts to a loupe/proposal-<pr-id> side branch and opens a sub-PR for review. The LOUPE_PAT is a fine-grained GitHub PAT scoped contents: write only on refs matching loupe/proposal-*, so the Action cannot push anywhere else; missing PAT is a fail-fast (exit 64), not a fallback to GITHUB_TOKEN:

      - uses: fadi-labib/loupe/packages/loupe-action@main
        with:
          pr: ${{ github.event.pull_request.number }}
          auto_commit_loupe_dir: true                       # push .loupe/ to a side branch; open sub-PR
          commit_author: "Loupe Bot <loupe-bot@example.com>"
        env:
          GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
          ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
          LOUPE_PAT: ${{ secrets.LOUPE_PAT }}               # required when auto_commit_loupe_dir is true

One-time PAT setup (~3 min) including the exact scope to grant and a gh api curl test for the scope: docs/reference/data-handling.md#layer-2-pat-setup. Mechanics and rationale: D-08 and D-27.

Exit code	Meaning
`0`	Clean run; no severity in `ci.fail_on` triggered
`1`	Gate failure: a threat at a `ci.fail_on` severity was reported
`64`	Usage / configuration error (BSD `sysexits.h` `EX_USAGE`); also raised when `auto_commit_loupe_dir: true` is set without `LOUPE_PAT`

The Action publishes eight outputs (findings_count, severity-specific counts, run_id, run_hash, exit_code). See action.yml for the full schema.

What lives in `.loupe/`

Note

The audit pack is plain files, Git-tracked, no SaaS, no hidden state, no telemetry. An auditor can replay every run from in-repo state alone; the runs/*.json hash chain detects history rewrites.

File	Who writes it	What it is
`context.md`	Human	Product brief; the anti-hallucination anchor every lens reads
`threats.yaml` + `threat-model.md`	`ThreatLens`	STRIDE threats, machine-readable + narrative
`mitigations.yaml`	`ThreatLens`	Mitigations cross-referencing threats
`sbom.cdx.json`	Syft (or cdxgen, or your pick)	CycloneDX 1.6 SBOM
`vex.json`	`ThreatLens` + human approval	OpenVEX vulnerability statements
`runs/*.json`	`loupe-core`	Hash-chained audit trail of every invocation
`decisions/D-*.md`	Human	ADR-style risk acceptances
`knowledge.yaml`	`loupe-core`	Persistent cross-run knowledge graph
`config.yaml`	Human	Operator settings: which lenses, which backends, which gates

Full schemas in docs/reference/schemas/.

Sample output

What ThreatLens actually writes to .loupe/threats.yaml:

schema_version: 1
threats:
  - id: T-001
    element_id: E-001
    stride_category: I            # Information disclosure
    title: API endpoint leaks user IDs in error messages
    description: |
      The /api/users endpoint returns stack traces in 500 responses,
      revealing internal user IDs and database schema names.
    severity: high
    status: proposed              # human moves it through accepted / mitigated / accepted_risk / rejected
    mitigation_ids: [M-005]
    cwe_refs: [CWE-209]
    introduced_in_pr: "1234"
    last_reviewed: 2026-05-15
    review_due: 2026-08-15
    rationale: |
      Reviewing the diff for /api/users, I noticed the new exception
      handler raises Exception directly without sanitisation.
    proposed_by: threatlens

Every threat carries a stable T-NNN ID, an element (E-NNN), a STRIDE category, severity, lifecycle status, mitigation cross-references (M-NNN), CWE references, and a rationale the LLM is required to produce. Mitigations, VEX statements, and run records share the same shape: Pydantic-validated, stable IDs, no prose-only fields. Schemas: docs/reference/schemas/.

You pick the LLM. You pick the tools.

Tip

Multi-LLM by design. PydanticAI ships Anthropic, OpenAI, Google, Mistral, Groq, Cohere, Ollama, Bedrock natively. Set THREATLENS_MODEL=openai:gpt-5 and you are done. No code change.

No tool lock-in either. Every non-LLM tool is a Capability Protocol. Five composition modes (single, fallback, union, consensus, pipeline) let you say "run TruffleHog and gitleaks and merge" or "require two of three SAST scanners to agree" in config.yaml. The configuration is the evidence.

What that looks like in .loupe/config.yaml

capabilities:
  sbom:
    mode: single                                       # any SBOM tool is fine
    backends: [syft, cdxgen]
  cve:
    mode: union                                        # different DBs catch different CVEs
    backends: [grype, osv-scanner]                     # merge results, dedupe by CVE ID
  secret_detect:
    mode: union                                        # belt and braces; false negatives are catastrophic
    backends: [trufflehog, gitleaks, detect-secrets]
  static_analysis:
    mode: consensus                                    # require two analysers to agree
    consensus_threshold: 2
    backends: [semgrep, codeql, bandit]

An auditor reads this and knows the team's posture without grepping any code.

Defence in depth (four layers)

Loupe commits to eleven principles, each numbered, non-negotiable, and paired with a mechanical verification recipe in docs/reference/verification.md. The four enforcement layers below are how those principles are kept.

Layer	Status	What it protects
1 Tool surface	Shipped	Agent has only `write_agent_artifact` (allow-list) + `propose_patch` (writes to `.proposed/`). `PathBoundary` enforced in Python, not in a prompt. Parent-symlink-safe via `dir_fd` + `O_NOFOLLOW`.
2 Branch namespace	Shipped	`LOUPE_PAT` scoped `contents: write` only on refs matching `loupe/proposal-*`; CODEOWNERS gates `.loupe/context.md`, `.loupe/decisions/`, `.loupe/config.yaml`, `.loupe/knowledge.yaml`. Auto-commit opt-in via `auto_commit_loupe_dir: true`.
3 `loupe verify`	Shipped	Four checks: hash chain, artefact schemas, cross-references, protected-path authorship (`--strict`)
4 Interactive UX gate	Shipped	`loupe chat` walks each staged `.loupe/.proposed/<...>.patch` through a `[y/N/edit/skip]` prompt (default-N); `.proposed/` → `.applied/` / `.skipped/` via `git mv`. No `--auto-confirm`. Full lifecycle: `docs/concepts/proposals.md`

See docs/reference/verification.md for the mechanical recipes an auditor runs against each principle.

Status

Shipped today

Every row in this table is a claim. The third column is the command that proves it.

Surface	What's in it	Verified by
Platform	`loupe-core`: coordinator, dispatcher, run context, prompt builder, MCP server, pricing, enforcement	`uv run pytest packages/loupe-core/ -q`
CLI	`loupe init`, `ci`, `verify [--strict]`, `scan`, `mcp`, `chat`, `lens list`, `cap list`, `doctor`	`uv run loupe --help`
GitHub Action	PR fetch, `loupe ci` runner, sticky-comment poster (sticky / new / none), retry + rate-limit handling	`packages/loupe-action/action.yml`
ThreatLens	PydanticAI agent wired to a live LLM. User runs hit the configured provider; the project's own test suite uses VCR cassettes to keep CI deterministic. STRIDE threats, mitigations, cross-references	`uv run pytest packages/loupe-threatlens/ -q`
MCP server (stdio, 11 tools)	Core read: `list_threats`, `get_threat`, `list_mitigations`, `get_mitigation`, `list_elements`, `latest_run`. ThreatLens read: `threatlens_query_by_stride`, `threatlens_query_by_severity`, `threatlens_summary`. ThreatLens write (Layer-1 gated): `threatlens_propose_threat`, `threatlens_propose_mitigation`	`uv run loupe mcp`
Capability backends	10 bundled: Syft + cdxgen (SBOM), Grype + osv-scanner (CVE), gitleaks + TruffleHog + detect-secrets (secret), Semgrep + CodeQL + Bandit (SAST)	`uv run loupe cap list`
Composition modes	`single`, `fallback`, `union`, `consensus`, `pipeline`	`docs/reference/config.md`
Audit trail	Hash-chained run records, `loupe verify` Layer 3 checks (chain, schema, cross-refs, authorship)	`uv run loupe verify --strict .loupe/`
Cost discipline	Stable-prefix prompt caching, blackboard, dispatch skipping, RunRecord token/cost telemetry	`test_cost_regression.py`

In flight (before v0.1 tags)

HTTP+SSE remote MCP transport (stdio works today)
Benchmarks against Cesanta Mongoose (Tier 1) and Eclipse Mosquitto (Tier 2). See D-19

Anticipated

SafetyLens: functional-safety hazard analysis (ISO 26262, IEC 61508, IEC 62304)
PrivacyLens: data-protection review (GDPR DPIA, LINDDUN)
AIRiskLens: AI/ML risk (NIST AI RMF, EU AI Act high-risk; possibly MAESTRO)

Why now: the CRA timeline

Up to €15 million or 2.5% of worldwide annual turnover, whichever is higher. That is the maximum administrative fine under Article 64 of Regulation 2024/2847 for non-compliance with the essential cybersecurity requirements. It applies to every manufacturer of "products with digital elements" placed on the EU market.

11 September 2026 — manufacturers must report actively-exploited vulnerabilities and severe incidents via ENISA's Single Reporting Platform, coordinated through their Member State CSIRT.
11 December 2027 — the full regulation applies.

Both regimes require living risk assessments, living SBOMs, and living per-CVE impact statements. Loupe produces all three as in-repo artefacts, on every PR, with audit-grade provenance.

Where to find things

Docs render in-tree on GitHub today; a hosted MkDocs site will go up once the repo is public. In-tree map:

If you want to …	Read
Get something running	Quickstart
Understand the mental model	Architecture · Capabilities · Proposals
Know the principles Loupe will not compromise on	Principles
Verify the principles mechanically	Verification
Look up a CLI command or flag	CLI reference
Configure `.loupe/config.yaml`	Config reference
See each artefact's schema	Schemas
Run the MCP server	How-to: MCP server · MCP tools reference
Understand cost estimation	Pricing reference
Know why a decision was made	Decisions log (D-01 through D-27)
Compare against StrideGPT / IriusRisk / Snyk / others	Comparison
Check what data leaves your repo	Data handling
Look up a term	Glossary
Contribute code or a lens	Contributing
See what changed	CHANGELOG

By role, if you'd rather start from who you are than what you want to do:

If you are …	Read first
An auditor checking Loupe's outputs	Verification → Decisions log → Loupe's own threat model
A maintainer evaluating Loupe vs alternatives	Comparison → The bet → Status → Pricing
A security engineer about to use it	Quickstart → CLI reference → How-to: MCP server → Troubleshooting
A contributor adding a lens or capability	Contributing → Principles → Architecture → Capabilities

Built on

Role	Project	Why this one
Multi-provider agent	PydanticAI	Typed tool calls and multi-provider switch via a single env var — see D-03
Typed I/O	Pydantic 2.x	Same Pydantic shape across agent calls, artefact schemas, and run records
Tool protocol	Model Context Protocol	Stable surface for editor / IDE / agent clients — see D-09 and D-21 for SDK choice
SBOM format	CycloneDX 1.6	ISO/IEC 5962:2021; widest auditor familiarity
Vulnerability statements	OpenVEX 0.2	The format CISA pins for VEX in BOD 23-01
Threat-modelling method	STRIDE	Categorical, mechanically classifiable, well-supported in literature — lineage in D-16
Scanner backends	Syft, cdxgen, Grype, osv-scanner, TruffleHog, gitleaks, detect-secrets, Semgrep, CodeQL, Bandit	Composable via the Capability protocol — see `docs/concepts/capabilities.md`
Workspace + packaging	uv	Hermetic resolver, fast enough that CI overhead is rounding error
Docs site	MkDocs Material	Markdown-first, no JS framework, auditor-readable on GitHub before the site is up — see D-20
Prose linter	Vale	Catches AI-vocabulary drift, em-dash overuse, and bold-prefixed sentence openers in the docs themselves

Full attribution, version pins, and the decision record behind each choice: docs/built-on.md.

Contributing

Workspace layout, test discipline (hermetic, VCR-cassetted), commit conventions, and CI gates: docs/contributing.md. Pre-alpha private; PR contributions welcomed once the repo opens.

Security

The threat model for Loupe itself is at docs/reference/loupe-threat-model.md. Same STRIDE shape Loupe would emit for any other repo. Dogfood. Disclosure policy in SECURITY.md.

Licence

Apache 2.0. Patent grant included (§ 3); attribution preserved under § 4(c). See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 320 Commits
.github		.github
.vale/styles/Loupe		.vale/styles/Loupe
docs		docs
includes		includes
overrides		overrides
packages		packages
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.vale.ini		.vale.ini
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
lychee.toml		lychee.toml
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Loupe

The problem

The bet

Compared to what's out there

The mental model

Architecture at a glance

Quickstart

GitHub Action

What lives in `.loupe/`

Sample output

You pick the LLM. You pick the tools.

Defence in depth (four layers)

Status

Shipped today

In flight (before v0.1 tags)

Anticipated

Why now: the CRA timeline

Where to find things

Built on

Contributing

Security

Licence

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Loupe

The problem

The bet

Compared to what's out there

The mental model

Architecture at a glance

Quickstart

GitHub Action

What lives in .loupe/

Sample output

You pick the LLM. You pick the tools.

Defence in depth (four layers)

Status

Shipped today

In flight (before v0.1 tags)

Anticipated

Why now: the CRA timeline

Where to find things

Built on

Contributing

Security

Licence

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

What lives in `.loupe/`

Packages