Skip to content

russellbrenner/jurisd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

281 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

jurisd

A command-line, terminal-user-interface and MCP Australian & NZ legal research tool, built local-first. jurisd gives an AI assistant a fast, offline-capable recall layer over installed legal data modules — deterministic provision lookup, local semantic search, and a citation graph — and falls back to live AustLII search and an Open Australian Legal Corpus (OALC) layer when the answer is not in a local module.

The design tenet is degrade visibly, never silently: a missing optional dependency, an absent API key, or an uninstalled module disables only the feature that needs it and is reported back, never swallowed. With no key and no network, the local-module recall path still answers.

Status: pre-1.0, day-0 release candidate. 15 MCP tools across live research, citation/bibliography, and local data modules.

What jurisd is

jurisd answers Australian (and New Zealand) legal-research questions from an AI assistant. It has three answer sources, tried in precedence order:

  1. Local data modules (Layer 1) — installed parquet bundles holding legislation and decisions with provision-level structure, citation edges, and chunk embeddings. Answered offline, no network, no key. This is the local-first core: deterministic provision lookup, an Act containment tree, an offline citation graph, and local semantic search.
  2. Live AustLII (Layer 2) — natural-language case and legislation search over AustLII, with authority-based ranking, paragraph-pinpoint extraction, full-text fetch (HTML + PDF), and AGLC4 citation formatting.
  3. OALC fallback (Layer 3) — an Open Australian Legal Corpus layer that backs the live layer when a direct fetch is blocked.

jade.io is supported as an optional runtime citation-enhancement source — the citator (citing cases), citation/article resolution, and cross-referencing of live results — when you supply your own session cookie.

Quick start

Run with npx (no clone)

npx -y github:russellbrenner/jurisd

npx clones the repository, installs dependencies, builds, and launches the server over stdio in one step.

Register with Claude Code

claude mcp add jurisd -- npx -y github:russellbrenner/jurisd

Or add it to your client config directly:

{
  "mcpServers": {
    "jurisd": {
      "command": "npx",
      "args": ["-y", "github:russellbrenner/jurisd"]
    }
  }
}

All environment variables are optional — with none set, the live AustLII layer and the local-module recall layer both work. See docs/INSTALL.md for the local-clone path, every config option, and the offline/baseline guarantee.

Claude Code skill

A bundled Claude Code skill teaches the agent expert jurisd usage from day 0 — which of the 15 tools to reach for, the local-first/live-fallback rule, AGLC4 citation workflows, and a worked research session. Install it by copying the skill folder into your skills directory:

cp -r skills/jurisd-research ~/.claude/skills/

(or your plugin's skills path). The skill activates automatically on legal-research and AGLC4 prompts once the jurisd MCP server is registered.

Tools

15 tools in three groups. Operation variants are selected via a mode / op / action / by discriminator on the relevant tool.

Live research (AustLII)

Tool What it does
search_cases Natural-language case-law search across all AU/NZ jurisdictions; authority ranking; title/phrase/boolean methods; pagination.
search_legislation Search AU/NZ legislation with the same method/jurisdiction/sort controls.
fetch_document_text Fetch full text from an AustLII or jade.io URL (HTML, PDF, jade.io via GWT-RPC).

Citation + bibliography (AGLC4)

Tool What it does
format_citation Format an AGLC4 citation. mode: full (default), short, ibid, subsequent, pinpoint.
resolve_citation Resolve a citation to its source. mode: auto (default), validate (AustLII existence check), search.
jade_lookup Look up jade.io. by: article_id (resolve metadata) or citation (build a lookup URL).
search_citing_cases Find cases citing a target via the jade.io citator (requires JADE_SESSION_COOKIE).
cite Write to the local citation cache. action: add (default) or refresh_source (conditional-HEAD freshness check).
bibliography Read the local citation cache (no network). op: get, list (default), export (.bib), cited_by.
cache_cited_by Fetch a cached citation's citing cases from jade.io and store them locally (requires JADE_SESSION_COOKIE).

Local data modules (offline recall)

These five tools serve installed offline data modules. They require the optional @duckdb/node-api dependency and at least one installed module; semantic_search_local additionally needs @huggingface/transformers. Every answer carries metadata.source = "local_module" with the module name, version, and snapshot date (plus a staleness advisory when the snapshot is old).

Tool What it does
get_provision Deterministic provision lookup (e.g. s 18 of an Act). No embedding, no ranking; typed not-found so the router can fall through.
get_act_structure Containment tree of an Act (Act → Part → Division → section/schedule/clause) over act_provision edges, closed-world.
find_citing Offline twin of search_citing_cases: documents in installed modules that cite a target, with each citation's provenance span.
semantic_search_local Vector recall: the query is embedded locally (bge-small, offline, no key) and ranked by cosine over chunk embeddings, with optional facet pre-filters.
list_data_modules Introspect installed modules: coverage, doc/chunk counts, embedding descriptor, load status, snapshot date and staleness.

Full parameter tables for every tool are in docs/AGENT-GUIDE.md.

CLI foundation and compatibility

jurisd keeps the MCP server as the compatibility surface while the CLI is being reorganised around task-oriented command contracts.

Existing flat CLI commands remain available during the foundation work.

Data modules

A data module is a self-describing parquet bundle (documents, chunks, edges, unmatched citations, plus a manifest.json) published as a GitHub release asset on the jurisd-data repository. Everything needed to load and query a module — schema version, coverage, embedding descriptor, file hashes, and licence posture — is in its manifest. No out-of-band config.

Status: no modules published yet. The jurisd-data publishing repo and its first release are still being built, so jurisd fetch-module has nothing to download today (it resolves the release and fails fast with a 404). The server runs without any module — the live AustLII layer and citation tools work standalone, and the five local-recall tools report "no modules" (degrade visibly). The CLI flow below is implemented and ready for the first publish; this section documents the intended install once modules land.

Modules are queried in place: DuckDB scans the parquet on disk and never materialises a whole table into memory, so a host can install many modules (Commonwealth legislation + per-state + decisions) and stay flat in RSS.

Installing modules

Modules are operator-installed via the CLI (kept off the tool surface so an LLM never triggers a large download mid-conversation):

jurisd fetch-module <name> [--version X.Y.Z]   # download + sha256-verify + atomic install
jurisd verify-module <name>                     # re-verify installed files against the manifest
jurisd list-modules                             # list installed modules (incl. refused)

The default install root is ~/.jurisd/modules/ (override with JURISD_MODULES_DIR or --modules-dir). fetch-module validates the manifest and checks the schema version before downloading any parquet, sha256-verifies every file against the manifest, installs atomically (temp-then-rename, so a half-written module never appears), and prints the licence attribution lines at install time.

Baseline vs domain-specialised variants

A module's identity is (name, module_version). The module_version handle distinguishes a module's variant — a baseline module is the standard build (deterministic structure, citation edges, bge-small embeddings); a domain-specialised variant is a build tuned for a particular corpus or task. Use list_data_modules to see the variant, coverage, and embedding descriptor of each installed module, and pin a specific one with the module argument on any recall tool.

BYOK provider adapter

semantic_search_local has two optional enhancement slots that operate over the locally-retrieved top-k results — they never replace local recall, they refine it:

  • rerank — reorder the local top-k by a stronger relevance model.
  • extractive-QA — return the best answer span within a retrieved chunk.

Both are expressed through one vendor-neutral DomainAdapter interface. The distinction is capability presence, framed as baseline vs domain-specialised with a provider-interpolated display label:

  • Baseline (always present): pure local cosine order. No network, no key.
  • Domain-specialised (slot): selected only if a provider is configured and reachable via a BYOK key. With ISAACUS_API_KEY set and the endpoint reachable, the capability probe reports domain_adapter: { label: "Isaacus-enhanced", canRerank: true, canExtractiveQA: true } and responses carry metadata.enhancement = "Isaacus-enhanced".

If the key is unset, or set-but-unreachable, the adapter degrades to baseline and the tool still returns local cosine results — reported by the probe, never thrown into a tool result.

Quality

jurisd's local data layer is built and scored honestly against a gold set. The jurisd-data gold-set evaluation (to be published alongside the first module release; the jurisd-data repo is still being built) measures the local enricher (segments, defined terms, citation crossrefs) against 90 Open Australian Legal Corpus / Kanon ILDGS documents, under two parallel metrics:

  • strict — the conservative audit metric: every typed prediction unmatched within its type is a false positive.
  • aligned — the decision metric: a strict false positive whose span co-locates an untyped gold sub-span at IoU ≥ 0.9 is credited as a granularity agreement (a vocabulary disagreement with the silver standard, not an extraction error) rather than penalised.

The current baseline does not yet pass all four gate thresholds (segment F1, citation precision, citation recall, defined-term F1). Headline segment F1 is 0.44 strict / 0.64 aligned against a 0.85 gate. The report localises every gap to a specific rule (the residual segment gap is genuine over-segmentation, chiefly an endnotes-boundary flood; citation precision is internal-ref over-firing on structural lines). It is published in full, both metrics, as the honest current state, not a marketing number.

Licensing

  • Code: Apache-2.0 (see LICENSE). Third-party dependency licences are catalogued in LICENSE-THIRD-PARTY.md.
  • Module data: licensed per source, declared in each module's manifest.json licence block, and surfaced at fetch-module install time. The aggregate is CC-BY-4.0 (Open Australian Legal Corpus), but redistribution is decided per source, not in aggregate:
    • AustLII-sourced rows are excluded from published modules by default — the AustLII Terms of Service is restrictive, and re-importing it is exactly what the live transport layer routes around. They remain available recipe-only (rebuild locally).
    • VIC and NT legislation are not redistributable (Government Printer / Crown copyright, no open licence) — recipe-only.
    • Commonwealth (FRL), NSW, QLD, SA, TAS, WA legislation and HCA/FCA/NSW case-law sources are redistributable under the CC-BY-4.0 aggregate, subject to per-source confirmation before each module publishes.

The full per-source verdict table ships as jurisd-data/LICENSING.md with the first module release (the jurisd-data repo is still being built); each published module also carries its own licence block in manifest.json, surfaced at fetch-module install time.

Documentation

Document Description
INSTALL.md Day-0 install paths, Claude Code config, env vars, module flow
CLI.md CLI command shape, compatibility aliases, output rules, exit codes
MCP-COMPATIBILITY.md Compatibility reference for the current MCP tool surface
SECURITY-AUTHORITY.md Command authority, side-effect classes, terminal safety, credential rules
jurisd-research skill Claude Code skill: tool decision guidance, AGLC4 workflows, worked example
AGENT-GUIDE.md Agent-facing usage guide with full tool catalog and examples
ARCHITECTURE.md System architecture, deployment topology, CI/CD
DOCKER.md Docker deployment guide
ROADMAP.md Development history and future plans

Jurisdictions

Code Jurisdiction
cth Commonwealth of Australia
federal Federal courts (alias for cth)
vic Victoria
nsw New South Wales
qld Queensland
sa South Australia
wa Western Australia
tas Tasmania
nt Northern Territory
act Australian Capital Territory
nz New Zealand
other All jurisdictions (no filter)

Example queries for AI assistants

Once connected, ask natural-language questions:

  • "Find the High Court decision in Mabo v Queensland (No 2) and explain native title."
  • "Search for recent NSW cases about defamation on social media."
  • "What does section 18 of the Australian Consumer Law say?" (answered offline if the module is installed)
  • "Find cases that cite Mabo v Queensland (No 2)."
  • "Format Mabo v Queensland (No 2) [1992] HCA 23 (1992) 175 CLR 1 per AGLC4 at [64]."
  • "Compare how Victoria and NSW courts have treated non-compete clauses."

Development

git clone https://github.com/russellbrenner/jurisd.git
cd jurisd
npm install
npm run dev        # hot reload
npm run build      # TypeScript compile
npm start          # run the built server
npm test           # unit + integration + perf (integration hits live services)
npm run lint       # ESLint (flat config)

Docker

./build.sh         # build the image
docker-compose up  # run locally

See docs/DOCKER.md for details.

Kubernetes (k3s)

./build.sh
# import the image to k3s nodes (see k8s/README.md)
./deploy-k8s.sh

See k8s/README.md.

Contributing

See CONTRIBUTING.md and SECURITY.md for responsible disclosure.

Key principles:

  • Primary sources only (no journal articles)
  • Citation accuracy is paramount
  • Degrade visibly, never silently
  • All unit tests must pass before committing

Disclaimer

This tool is for legal research purposes only and does not constitute legal advice.

  • Search results may not be comprehensive and should not be relied upon as a complete statement of the law.
  • Source databases may not include all decisions or the most recent updates.
  • Always verify citations and check for subsequent treatment of cases.
  • Legal advice should be sought from a qualified legal practitioner for any specific legal matter.
  • The authors and contributors accept no liability for any loss or damage arising from use of this tool.

License

Apache-2.0

About

jurisd (formerly auslaw-mcp) is a local‑first agentic legal recall for Australian and New Zealand law tool, with GraphRAG, semantic search over locally embedded corpora, deterministic provision lookup, offline citation graph, fast AGLC4 linting, and live AustLII and OALC fallback that keeps future queries local.

Topics

Resources

License

Apache-2.0, Unknown licenses found

Licenses found

Apache-2.0
LICENSE
Unknown
LICENSE-THIRD-PARTY.md

Contributing

Security policy

Stars

Watchers

Forks

Packages

 
 
 

Contributors