research-scout

research-scout turns Claude Code into a private research analyst. Arm your own research materials (PDFs, YouTube transcripts, articles) to control the evidence base, then run a single-domain pipeline that scouts primary sources by modality, chases each one down, and adversarially verifies it before it survives. Point it at a domain (Finance, Healthcare, Energy) with optional focus keywords and you get a structured, primary-sourced findings summary entirely under your control — weak findings get killed, not padded.

Prerequisites

Python 3.11+ and Node 18+
Poppler — required for the armed-uploads PDF workflow. Claude Code reads PDFs via pdftoppm (part of poppler); without it, uploaded PDFs cannot be mined and the uploads scout reports DEGRADED: PDF unreadable — poppler not installed. Markdown/text/URL uploads work without it.

OS Install

Windows winget install poppler (or choco install poppler), then reopen the terminal so pdftoppm is on PATH

macOS brew install poppler

Linux sudo apt install poppler-utils

Verify with pdftoppm -v.

Quickstart

git clone https://github.com/shiviancodes/research-scout
cd research-scout

First time only:

.\setup.ps1     # Windows
./setup.sh      # Mac / Linux

Every time:

.\start.ps1     # Windows
./start.sh      # Mac / Linux

Or with make:

make setup   # first time only
make start   # every time
make stop    # when done

Open the URL printed by the start script (default: http://localhost:5173)

Research runs are driven by the /research slash command in Claude Code — one domain per run.

(Optional) Arm uploads — drop materials into inputs/<domain>/ and set that domain's flag to true in inputs/settings.json (or use the dashboard Sources panel). Armed material becomes an extra scout modality for the next run, and the flag resets once consumed.
Open Claude Code at the project root and run: claude
Run the pipeline:
```
/research energy "grid storage, municipal billing"
```
- First argument is the domain — finance | healthcare | energy.
- Second argument (optional) is focus keywords that steer scouts and triage.
- Append dry for a cheap 2-scout / 2-candidate end-to-end test.
Watch the stages report — scouts (parallel, one per modality) → triage → deep-dive → red-team (kills weak candidates) → assemble → synthesis. The orchestrator prints candidate counts per modality, kills with reasons, and the survivor count.
View results — open the dashboard at http://localhost:5173 and refresh the History tab, or read outputs/<domain>/<YYYY-WNN>-findings.md directly.
(Optional) Edit agents — the Config page lets you adjust the stage agent definitions, edit Standards, and restore factory defaults.

Architecture

flowchart TB
    subgraph input["Input"]
        User["You<br/>Arm uploads (inputs/settings.json)<br/>Run /research domain focus"]
    end
    
    subgraph frontend["Frontend"]
        Dashboard["React Dashboard @ :5173<br/>Home | Config | History | Research"]
    end
    
    subgraph backend["Backend"]
        API["FastAPI @ :8766<br/>/api/inputs | /api/agents | /api/run"]
    end
    
    subgraph processing["AI Processing — /research pipeline (single domain)"]
        direction TB
        Scouts["Phase A · Scouts<br/>parallel, one per modality<br/>regulatory · earnings · audit · tenders · armed-uploads"]
        Triage["Phase B · Triage<br/>dedup → rank → top 10"]
        DeepDive["Phase C · Deep-dive<br/>one subagent per candidate"]
        RedTeam["Phase D · Red-team<br/>adversarial verify · KILL by default"]
        Assemble["Phase E · Assemble<br/>findings + registry + run log"]
        Synthesis["Synthesis<br/>aggregate summary"]
        Scouts --> Triage --> DeepDive --> RedTeam --> Assemble --> Synthesis
    end
    
    subgraph storage["Storage"]
        Files["inputs/ - armed uploads<br/>outputs/ - findings + summaries<br/>findings-registry.json - cross-run memory<br/>.claude/agents/ - stage definitions"]
    end
    
    subgraph output["Output"]
        Findings["Findings<br/>Problem | Source | Why now | Tags"]
    end
    
    User -->|Arm uploads| Dashboard
    User -->|Run /research| Scouts
    Dashboard -->|API calls| API
    API -->|Serves| Dashboard
    Files -->|Exclusion list| Scouts
    Scouts -->|Read armed sources + web| Files
    Assemble -->|Write findings + registry| Files
    Files -->|Serve| API
    Assemble -->|Generate| Findings
    Dashboard -->|View| Findings

Customising your quality bar

prompts/STANDARDS.md defines what makes a finding credible:

Non-negotiables: Real named problem, primary source, defensible why-now
Disqualifiers: No primary source, pure trend chasing, already solved
Positive signals: sa-angle, regulatory-shift, contrarian, data-moat (help readers prioritise)

Edit directly or use Config → Standards in the dashboard. Changes take effect on the next run.

Key features

Single-domain pipeline

One run researches one domain through six stages: parallel scouts (one per source modality) → triage (dedup + rank to a shortlist) → deep-dive (one subagent per candidate, chasing the primary source) → red-team (adversarial verification) → assemble → synthesis. Driven by the /research slash command.

Adversarial verification

The red-team stage independently tries to kill every candidate and defaults to KILL when uncertain. Thin weeks (1–2 survivors) are expected output, not failure — better empty than generic. Weak findings are dropped, never padded.

Cross-run memory

Every candidate that reaches the red-team — survived or killed — is recorded in outputs/findings-registry.json. The next run builds an exclusion list from it so scouts don't resurface the same problems week over week.

Focus keywords

The optional second argument to /research steers scouts and triage toward a theme (e.g. "grid storage, municipal billing") without hard-filtering — a strong off-focus signal still beats a weak on-focus one.

Armed-Source Workflow

Arm your own research materials (PDFs, YouTube transcripts, markdown) per domain via inputs/settings.json (or the dashboard Sources panel). When armed, your uploads become an extra scout modality for the next run, and the flag resets once consumed.

Config Page

Edit agent definitions in-browser:

Standards — view/edit the quality bar reference
Scout, Deep-dive, Red-team, Synthesis — edit the stage agent definitions
Save posts changes; Restore-to-Default pulls factory copies

Findings Format & lint hook

Standardised four-field findings (Problem, Source, Why now, Tags) make output actionable and machine-readable. A PostToolUse hook (scripts/lint_findings.py) checks the format mechanically on every findings-file write.

Run Summary

The synthesis agent aggregates findings by tag. No scoring, no tiers — humans decide what to investigate.

Project structure

.claude/agents/             Stage agent definitions (scout, deep-dive, red-team, synthesis)
.claude/commands/           /research pipeline orchestrator
.claude/settings.json       PostToolUse lint hook on findings writes
backend/                    FastAPI - serves outputs/ + agent/source endpoints
frontend/                   React + Vite dashboard
scripts/                    Pipeline helpers (lint_findings, update_registry, log_run,
                            check_links, validate_domain) 
outputs/                    Generated artefacts
  ├─ energy/                Per-domain weekly findings (+ .run/ scratch dirs)
  ├─ finance/
  ├─ healthcare/
  ├─ summary/               Aggregated summaries per run
  ├─ findings-registry.json Cross-run finding memory (survivors + kills)
  └─ run-log.jsonl          Per-run telemetry
prompts/
  ├─ STANDARDS.md           Quality bar
  └─ sources/               Per-domain source packs, by scout modality
defaults/agents/            Factory copies of stage agents (restore-to-default)
inputs/                     Armed research materials per domain (+ settings.json arming)
  ├─ energy/
  ├─ finance/
  └─ healthcare/
docs/                       Reference materials

Port configuration

Service	Default	Variable
Backend	8766	`BACKEND_PORT`
Frontend	5173	`FRONTEND_PORT`

Both are set in .env at the project root (copy .env.example to get started). Vite runs with strictPort: true - if a port is taken it fails loudly rather than silently shifting.

License

MIT - see LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

research-scout

Prerequisites

Quickstart

Architecture

Customising your quality bar

Key features

Single-domain pipeline

Adversarial verification

Cross-run memory

Focus keywords

Armed-Source Workflow

Config Page

Findings Format & lint hook

Run Summary

Project structure

Port configuration

License

About

Uh oh!

Releases 4

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.claude		.claude
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
backend		backend
defaults/agents		defaults/agents
docs		docs
frontend		frontend
inputs		inputs
outputs		outputs
prompts		prompts
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
setup.ps1		setup.ps1
setup.sh		setup.sh
start.ps1		start.ps1
start.sh		start.sh
stop.ps1		stop.ps1
stop.sh		stop.sh

OS	Install
Windows	`winget install poppler` (or `choco install poppler`), then reopen the terminal so `pdftoppm` is on `PATH`
macOS	`brew install poppler`
Linux	`sudo apt install poppler-utils`

Folders and files

Latest commit

History

Repository files navigation

research-scout

Prerequisites

Quickstart

Architecture

Customising your quality bar

Key features

Single-domain pipeline

Adversarial verification

Cross-run memory

Focus keywords

Armed-Source Workflow

Config Page

Findings Format & lint hook

Run Summary

Project structure

Port configuration

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Uh oh!

Contributors

Uh oh!

Languages