LinuxIR Agent

A multi-agent Linux DFIR (digital forensics & incident response) triage system for the SANS "FIND EVIL!" challenge. An analyst opens a browser, gives plain-language case context and evidence paths, and watches specialized agents investigate a mounted evidence tree in parallel — finding persistence, reconstructing the intrusion timeline, analyzing memory and network captures (when the tools are present), enriching indicators with threat intel, and answering the 12 mandatory IR questions — producing a cross-referenced report in an Obsidian vault with honest, audited confidence levels.

Its defining property is that evidence-integrity guardrails are architectural, not prompt-based. A Python ConstraintEnforcer vets every tool call before any subprocess or filesystem write runs. The model never gets the chance to spoliate evidence — the restriction is code at the dispatch layer, not an instruction the model could ignore, jailbreak, or hallucinate past. See docs/architecture.svg for the component diagram with trust-zone labels.

 web GUI ─▶ Coordinator ─▶ disk / log / memory / network specialists   (parallel, own gateway)
                  │ iterate (--max-iterations)   │ every tool_use
                  │                               ▼
                  │                    ToolGateway.dispatch()   ◀── the one chokepoint
                  │                               │
                  │            ConstraintEnforcer → AuditLogger → adapter (real binary / fallback)
                  ▼                               ▼
   auditor (verify vs cited output) ─▶ IR expert (intel + MITRE + re-analysis) ─▶
   persona builder ─▶ reporter (12 IR answers) ─▶ Obsidian vault + JSONL audit

What it does (capabilities)

Web intake GUI — browser form → case-state note in an Obsidian vault (linuxir serve).
Read-only tool surface — persistence (cron/systemd/ssh-keys/setuid/rc/ld.so.preload/ passwd/bash-history/wtmp), logs (auth/lastb/syslog/timeline/gaps), memory (volatility3 + kernel-banner), network (pcap summary/beaconing/dns/http/exfil/creds/tor), threat-intel.
Deterministic self-correction — failed/empty/unavailable tool results yield a logged recovery hint fed back to the model (vol3 retry · empty-result pivot · contradiction reconciliation).
Orchestrator — parallel specialists (each in its own gateway), inter-agent messages logged to agent-messages.jsonl, --max-iterations with graceful partial reports.
Auditor — drops findings unsupported by their cited tool output (anti-hallucination).
IR expert — local-first threat-intel enrichment + MITRE mapping; can request one bounded re-analysis (closing the self-learning loop).
Reporter — the 12 mandatory IR answers (each evidence-cited), plus attacker profile, timeline, IOC/TTP, and recommendations.

Why a hand-rolled tool loop

The agents use the Anthropic SDK with an explicit create → tool_use → dispatch → tool_result loop (linuxir/agents/loop.py) rather than a higher-level tool runner. That is deliberate: it forces every tool call through ToolGateway.dispatch (linuxir/gateway.py), where ConstraintEnforcer.check (linuxir/guardrails/constraints.py) runs first. There is no code path from a model tool request to a subprocess that bypasses it.

The enforcer blocks a call when any of these hold:

the tool name denotes mutation (write_, delete_, rm_, chmod_, truncate_, …);
the tool is not in the read-only registry;
a path argument resolves (via realpath, so .. is neutralized) outside evidence scope;
the bash_readonly escape hatch uses a non-allowlisted binary, a redirect (>/>>), an in-place edit, or a destructive flag;
an output flag (--output-file, -o, of=) appears on a read-only tool.

Proof: the spoliation test (the headline claim)

Reproduces the report's ten write/delete/modify attempts and asserts 10/10 blocked, 10/10 raised as exceptions, 10/10 logged to audit/spoliation-attempts.jsonl:

uv run python -m linuxir.guardrails.spoliation_test
uv run pytest tests/test_spoliation.py -q

Setup

uv sync --extra dev --extra web   # core + pytest + FastAPI/uvicorn (the web GUI)

Optional forensic binaries (the system runs without them — adapters fall back gracefully): volatility3 (pip install volatility3), tshark, sleuthkit, last/lastb/utmpdump, geoiplookup.

Web GUI

uv run linuxir serve              # http://127.0.0.1:8080

Open the page, enter client/context + evidence paths, submit → a case-state.md note lands in the Obsidian vault (via the Local REST API if configured, else a local-file fallback). Endpoints: GET /, POST /case/new, GET /case/{id}/status, GET /cases, GET /healthz.

Run

Three auth modes, selected with --auth / --offline:

Mode	Flag	Cost	Needs
Subscription (default)	`--auth subscription`	$0 per-token (uses your Claude Pro/Max plan limits)	`claude` CLI + `CLAUDE_CODE_OAUTH_TOKEN`
Offline demo	`--offline`	$0, no network	nothing
Billed API	`--auth api`	paid per token	`ANTHROPIC_API_KEY`

Offline demo — full pipeline, scripted client against the bundled evidence fixture (great for proving the flow with zero setup):

uv run linuxir analyze --case cases/sample-case.yaml --offline

Subscription ($0) — the hackathon path. Runs on the Claude Agent SDK authenticated by your Pro/Max subscription, so there is no API key and no per-token billing (just your plan's usage limits). The forensic tools run as an in-process MCP server, built-in Bash/Read/Write/Edit are disabled, and the ConstraintEnforcer still gates every call.

uv run linuxir analyze --case cases/sample-case.yaml                 # --auth subscription is the default
uv run linuxir analyze --case cases/sample-case.yaml --model opus --effort high

Billed API — raw Messages API with a hand-rolled gated loop:

export ANTHROPIC_API_KEY=sk-ant-...
uv run linuxir analyze --case cases/sample-case.yaml --auth api

Setting up the $0 subscription path on a VM (e.g. the SANS DFIR VM in GNOME Boxes)

The Python Agent SDK shells out to the Claude Code CLI, so the VM needs it plus a subscription OAuth token. Browser login can't happen on a headless VM, so mint the token on your normal machine and copy it over:

# 1. On a machine WITH a browser (your laptop), logged into Claude Pro/Max:
npm install -g @anthropic-ai/claude-code // you can also use the curl command on claude's site
claude setup-token          # opens a browser → prints sk-ant-oat01-...  (valid ~1 year)

# 2. On the SANS VM:
npm install -g @anthropic-ai/claude-code          # needs Node 18+
export CLAUDE_CODE_OAUTH_TOKEN=sk-ant-oat01-...    # the token from step 1
unset ANTHROPIC_API_KEY                            # IMPORTANT: it silently overrides the token <- important if you DO NOT want to get charged for API costs.
uv sync --extra dev                                # or: pip install -e .
uv run linuxir analyze --case cases/sample-case.yaml

Notes:

ANTHROPIC_API_KEY takes precedence over the OAuth token and would bill you — the CLI unsets it for you when --auth subscription, but keep it out of your shell to be safe.
Subscription auth is licensed for personal use — run it yourself for the competition; don't ship it as a multi-user service on subscription credentials.
Point evidence_scope in the case file at the mounted evidence (read-only).

Output lands in the case workspace:

vault/report.md + vault/analysis-<agent>.md — Obsidian-style notes (cross-linked).
audit/tool-calls.jsonl — every tool call (allowed/blocked), with its hypothesis and outcome, plus findings and phase events.
audit/spoliation-attempts.jsonl — blocked evidence-mutation attempts.
Corrections/self-learning-log.md — distilled self-corrections (dropped findings, etc.).

A case file

case_id: demo-001
evidence_scope:          # READ-ONLY roots; paths resolve relative to this file
  - ../tests/fixtures/evidence
workspace: ../out/demo-001   # writable: vault, audit, Corrections

Memory images (*.lime/*.raw/…) and pcaps (*.pcap/…) found inside the evidence scope automatically activate the memory and network agents.

How findings stay honest

Hypothesis before execution: every tool call carries a required hypothesis field — what the agent expects to find — recorded to the audit log before the tool runs and compared against the outcome, so surprises surface instead of being rationalized.
Each finding must cite the verbatim tool output it rests on (source_tool_output).
A separate auditor pass (Haiku) judges every finding against that cited output, not against the agent's prose, and drops anything it can't substantiate — caught before the final report. (The demo plants a "meterpreter" claim with no supporting evidence to show this working.)
LOW-confidence or elevated-risk findings are flagged requires_human_review.
The report includes a transparency section listing what the auditor dropped and why.

Layout

linuxir/
  guardrails/constraints.py   ConstraintEnforcer + SpoliationViolation  (the safety core)
  guardrails/spoliation_test.py   10-attack harness
  gateway.py                  ToolGateway.dispatch — the chokepoint (+ self-correction)
  selfcorrect.py              deterministic recovery hints (vol3 / empty-pivot / reconcile)
  adapters/                   base.run_binary + disk / logs / memory / network / intel / geoip
  tools.py                    read-only tool schemas → gateway handlers
  agents/                     loop, base, coordinator, auditor, linux_ir_expert,
                              persona_builder, reporter, {disk,log,memory,network}_agent
  agentsdk_runtime.py         $0 subscription runtime (Claude Agent SDK + in-process MCP)
  web/                        FastAPI intake GUI (server.py + static/index.html)
  obsidian.py casestore.py    vault writer (REST + local fallback) + case intake/state
  findings.py audit.py report.py corrections.py config.py llm.py demo.py cli.py
knowledge/                    linux-techniques · mitre-attack · known-hashes · threat-intel-sources
cases/sample-case.yaml
docs/                         architecture.svg · accuracy-report.md · evidence-dataset.md
tests/                        spoliation · adapters · pipeline · subscription · web · persistence ·
                              logs/memory · network · self-correction · hypothesis · orchestrator ·
                              intel · expert · reporter  =  113 tests

See docs/accuracy-report.md (spoliation + two real-evidence runs) and docs/evidence-dataset.md (what was tested against).

The same gateway, enforcer, tools, prompts, auditor, correlation, and reports are shared by both transports — only how the model is reached differs (raw Messages API loop vs the Agent SDK driving in-process MCP tools).

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
cases		cases
docs		docs
knowledge		knowledge
linuxir		linuxir
out		out
tests		tests
.env.template		.env.template
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LinuxIR Agent

What it does (capabilities)

Why a hand-rolled tool loop

Proof: the spoliation test (the headline claim)

Setup

Web GUI

Run

Setting up the $0 subscription path on a VM (e.g. the SANS DFIR VM in GNOME Boxes)

A case file

How findings stay honest

Layout

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LinuxIR Agent

What it does (capabilities)

Why a hand-rolled tool loop

Proof: the spoliation test (the headline claim)

Setup

Web GUI

Run

Setting up the $0 subscription path on a VM (e.g. the SANS DFIR VM in GNOME Boxes)

A case file

How findings stay honest

Layout

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages