A command-line, terminal-user-interface and MCP Australian & NZ legal research tool, built local-first. jurisd gives an AI assistant a fast, offline-capable recall layer over installed legal data modules — deterministic provision lookup, local semantic search, and a citation graph — and falls back to live AustLII search and an Open Australian Legal Corpus (OALC) layer when the answer is not in a local module.
The design tenet is degrade visibly, never silently: a missing optional dependency, an absent API key, or an uninstalled module disables only the feature that needs it and is reported back, never swallowed. With no key and no network, the local-module recall path still answers.
Status: pre-1.0, day-0 release candidate. 15 MCP tools across live research, citation/bibliography, and local data modules.
jurisd answers Australian (and New Zealand) legal-research questions from an AI assistant. It has three answer sources, tried in precedence order:
- Local data modules (Layer 1) — installed parquet bundles holding legislation and decisions with provision-level structure, citation edges, and chunk embeddings. Answered offline, no network, no key. This is the local-first core: deterministic provision lookup, an Act containment tree, an offline citation graph, and local semantic search.
- Live AustLII (Layer 2) — natural-language case and legislation search over AustLII, with authority-based ranking, paragraph-pinpoint extraction, full-text fetch (HTML + PDF), and AGLC4 citation formatting.
- OALC fallback (Layer 3) — an Open Australian Legal Corpus layer that backs the live layer when a direct fetch is blocked.
jade.io is supported as an optional runtime citation-enhancement source — the citator (citing cases), citation/article resolution, and cross-referencing of live results — when you supply your own session cookie.
npx -y github:russellbrenner/jurisdnpx clones the repository, installs dependencies, builds, and launches the
server over stdio in one step.
claude mcp add jurisd -- npx -y github:russellbrenner/jurisdOr add it to your client config directly:
{
"mcpServers": {
"jurisd": {
"command": "npx",
"args": ["-y", "github:russellbrenner/jurisd"]
}
}
}All environment variables are optional — with none set, the live AustLII layer and the local-module recall layer both work. See docs/INSTALL.md for the local-clone path, every config option, and the offline/baseline guarantee.
A bundled Claude Code skill teaches the agent expert jurisd usage from day 0 — which of the 15 tools to reach for, the local-first/live-fallback rule, AGLC4 citation workflows, and a worked research session. Install it by copying the skill folder into your skills directory:
cp -r skills/jurisd-research ~/.claude/skills/(or your plugin's skills path). The skill activates automatically on legal-research
and AGLC4 prompts once the jurisd MCP server is registered.
15 tools in three groups. Operation variants are selected via a
mode / op / action / by discriminator on the relevant tool.
| Tool | What it does |
|---|---|
search_cases |
Natural-language case-law search across all AU/NZ jurisdictions; authority ranking; title/phrase/boolean methods; pagination. |
search_legislation |
Search AU/NZ legislation with the same method/jurisdiction/sort controls. |
fetch_document_text |
Fetch full text from an AustLII or jade.io URL (HTML, PDF, jade.io via GWT-RPC). |
| Tool | What it does |
|---|---|
format_citation |
Format an AGLC4 citation. mode: full (default), short, ibid, subsequent, pinpoint. |
resolve_citation |
Resolve a citation to its source. mode: auto (default), validate (AustLII existence check), search. |
jade_lookup |
Look up jade.io. by: article_id (resolve metadata) or citation (build a lookup URL). |
search_citing_cases |
Find cases citing a target via the jade.io citator (requires JADE_SESSION_COOKIE). |
cite |
Write to the local citation cache. action: add (default) or refresh_source (conditional-HEAD freshness check). |
bibliography |
Read the local citation cache (no network). op: get, list (default), export (.bib), cited_by. |
cache_cited_by |
Fetch a cached citation's citing cases from jade.io and store them locally (requires JADE_SESSION_COOKIE). |
These five tools serve installed offline data modules. They require the optional
@duckdb/node-api dependency and at least one installed module;
semantic_search_local additionally needs @huggingface/transformers. Every
answer carries metadata.source = "local_module" with the module name, version,
and snapshot date (plus a staleness advisory when the snapshot is old).
| Tool | What it does |
|---|---|
get_provision |
Deterministic provision lookup (e.g. s 18 of an Act). No embedding, no ranking; typed not-found so the router can fall through. |
get_act_structure |
Containment tree of an Act (Act → Part → Division → section/schedule/clause) over act_provision edges, closed-world. |
find_citing |
Offline twin of search_citing_cases: documents in installed modules that cite a target, with each citation's provenance span. |
semantic_search_local |
Vector recall: the query is embedded locally (bge-small, offline, no key) and ranked by cosine over chunk embeddings, with optional facet pre-filters. |
list_data_modules |
Introspect installed modules: coverage, doc/chunk counts, embedding descriptor, load status, snapshot date and staleness. |
Full parameter tables for every tool are in docs/AGENT-GUIDE.md.
jurisd keeps the MCP server as the compatibility surface while the CLI is being reorganised around task-oriented command contracts.
- CLI guide: docs/CLI.md
- MCP compatibility reference: docs/MCP-COMPATIBILITY.md
- Security and authority model: docs/SECURITY-AUTHORITY.md
Existing flat CLI commands remain available during the foundation work.
A data module is a self-describing parquet bundle (documents, chunks, edges,
unmatched citations, plus a manifest.json) published as a GitHub release asset
on the jurisd-data repository. Everything needed to load and query a module —
schema version, coverage, embedding descriptor, file hashes, and licence posture
— is in its manifest. No out-of-band config.
Status: no modules published yet. The
jurisd-datapublishing repo and its first release are still being built, sojurisd fetch-modulehas nothing to download today (it resolves the release and fails fast with a404). The server runs without any module — the live AustLII layer and citation tools work standalone, and the five local-recall tools report "no modules" (degrade visibly). The CLI flow below is implemented and ready for the first publish; this section documents the intended install once modules land.
Modules are queried in place: DuckDB scans the parquet on disk and never materialises a whole table into memory, so a host can install many modules (Commonwealth legislation + per-state + decisions) and stay flat in RSS.
Modules are operator-installed via the CLI (kept off the tool surface so an LLM never triggers a large download mid-conversation):
jurisd fetch-module <name> [--version X.Y.Z] # download + sha256-verify + atomic install
jurisd verify-module <name> # re-verify installed files against the manifest
jurisd list-modules # list installed modules (incl. refused)The default install root is ~/.jurisd/modules/ (override with
JURISD_MODULES_DIR or --modules-dir). fetch-module validates the manifest
and checks the schema version before downloading any parquet, sha256-verifies
every file against the manifest, installs atomically (temp-then-rename, so a
half-written module never appears), and prints the licence attribution lines at
install time.
A module's identity is (name, module_version). The module_version handle
distinguishes a module's variant — a baseline module is the standard
build (deterministic structure, citation edges, bge-small embeddings); a
domain-specialised variant is a build tuned for a particular corpus or task.
Use list_data_modules to see the variant, coverage, and embedding descriptor of
each installed module, and pin a specific one with the module argument on any
recall tool.
semantic_search_local has two optional enhancement slots that operate over
the locally-retrieved top-k results — they never replace local recall, they
refine it:
- rerank — reorder the local top-k by a stronger relevance model.
- extractive-QA — return the best answer span within a retrieved chunk.
Both are expressed through one vendor-neutral DomainAdapter interface. The
distinction is capability presence, framed as baseline vs
domain-specialised with a provider-interpolated display label:
- Baseline (always present): pure local cosine order. No network, no key.
- Domain-specialised (slot): selected only if a provider is configured and
reachable via a BYOK key. With
ISAACUS_API_KEYset and the endpoint reachable, the capability probe reportsdomain_adapter: { label: "Isaacus-enhanced", canRerank: true, canExtractiveQA: true }and responses carrymetadata.enhancement = "Isaacus-enhanced".
If the key is unset, or set-but-unreachable, the adapter degrades to baseline and the tool still returns local cosine results — reported by the probe, never thrown into a tool result.
jurisd's local data layer is built and scored honestly against a gold set. The
jurisd-data gold-set evaluation (to be published alongside the first module
release; the jurisd-data repo is still being built) measures the local enricher
(segments, defined terms, citation crossrefs) against 90 Open Australian Legal
Corpus / Kanon ILDGS documents, under two parallel metrics:
- strict — the conservative audit metric: every typed prediction unmatched within its type is a false positive.
- aligned — the decision metric: a strict false positive whose span co-locates an untyped gold sub-span at IoU ≥ 0.9 is credited as a granularity agreement (a vocabulary disagreement with the silver standard, not an extraction error) rather than penalised.
The current baseline does not yet pass all four gate thresholds (segment F1, citation precision, citation recall, defined-term F1). Headline segment F1 is 0.44 strict / 0.64 aligned against a 0.85 gate. The report localises every gap to a specific rule (the residual segment gap is genuine over-segmentation, chiefly an endnotes-boundary flood; citation precision is internal-ref over-firing on structural lines). It is published in full, both metrics, as the honest current state, not a marketing number.
- Code: Apache-2.0 (see LICENSE). Third-party dependency licences are catalogued in LICENSE-THIRD-PARTY.md.
- Module data: licensed per source, declared in each module's
manifest.jsonlicenceblock, and surfaced atfetch-moduleinstall time. The aggregate is CC-BY-4.0 (Open Australian Legal Corpus), but redistribution is decided per source, not in aggregate:- AustLII-sourced rows are excluded from published modules by default — the AustLII Terms of Service is restrictive, and re-importing it is exactly what the live transport layer routes around. They remain available recipe-only (rebuild locally).
- VIC and NT legislation are not redistributable (Government Printer / Crown copyright, no open licence) — recipe-only.
- Commonwealth (FRL), NSW, QLD, SA, TAS, WA legislation and HCA/FCA/NSW case-law sources are redistributable under the CC-BY-4.0 aggregate, subject to per-source confirmation before each module publishes.
The full per-source verdict table ships as jurisd-data/LICENSING.md with the
first module release (the jurisd-data repo is still being built); each
published module also carries its own licence block in manifest.json, surfaced
at fetch-module install time.
| Document | Description |
|---|---|
| INSTALL.md | Day-0 install paths, Claude Code config, env vars, module flow |
| CLI.md | CLI command shape, compatibility aliases, output rules, exit codes |
| MCP-COMPATIBILITY.md | Compatibility reference for the current MCP tool surface |
| SECURITY-AUTHORITY.md | Command authority, side-effect classes, terminal safety, credential rules |
| jurisd-research skill | Claude Code skill: tool decision guidance, AGLC4 workflows, worked example |
| AGENT-GUIDE.md | Agent-facing usage guide with full tool catalog and examples |
| ARCHITECTURE.md | System architecture, deployment topology, CI/CD |
| DOCKER.md | Docker deployment guide |
| ROADMAP.md | Development history and future plans |
| Code | Jurisdiction |
|---|---|
cth |
Commonwealth of Australia |
federal |
Federal courts (alias for cth) |
vic |
Victoria |
nsw |
New South Wales |
qld |
Queensland |
sa |
South Australia |
wa |
Western Australia |
tas |
Tasmania |
nt |
Northern Territory |
act |
Australian Capital Territory |
nz |
New Zealand |
other |
All jurisdictions (no filter) |
Once connected, ask natural-language questions:
- "Find the High Court decision in Mabo v Queensland (No 2) and explain native title."
- "Search for recent NSW cases about defamation on social media."
- "What does section 18 of the Australian Consumer Law say?" (answered offline if the module is installed)
- "Find cases that cite Mabo v Queensland (No 2)."
- "Format
Mabo v Queensland (No 2) [1992] HCA 23 (1992) 175 CLR 1per AGLC4 at [64]." - "Compare how Victoria and NSW courts have treated non-compete clauses."
git clone https://github.com/russellbrenner/jurisd.git
cd jurisd
npm install
npm run dev # hot reload
npm run build # TypeScript compile
npm start # run the built server
npm test # unit + integration + perf (integration hits live services)
npm run lint # ESLint (flat config)./build.sh # build the image
docker-compose up # run locallySee docs/DOCKER.md for details.
./build.sh
# import the image to k3s nodes (see k8s/README.md)
./deploy-k8s.shSee k8s/README.md.
See CONTRIBUTING.md and SECURITY.md for responsible disclosure.
Key principles:
- Primary sources only (no journal articles)
- Citation accuracy is paramount
- Degrade visibly, never silently
- All unit tests must pass before committing
This tool is for legal research purposes only and does not constitute legal advice.
- Search results may not be comprehensive and should not be relied upon as a complete statement of the law.
- Source databases may not include all decisions or the most recent updates.
- Always verify citations and check for subsequent treatment of cases.
- Legal advice should be sought from a qualified legal practitioner for any specific legal matter.
- The authors and contributors accept no liability for any loss or damage arising from use of this tool.
Apache-2.0