ToolTrust Scoring Methodology

Version: 1.2 | Effective Date: 2026-03-21 | Scanner: ToolTrust Scanner

Overview

ToolTrust grades every MCP server on a scale of A → F using a deterministic risk score produced by ToolTrust Scanner. This document defines how we discover tool definitions, the scoring formula, severity weights, grade boundaries, and the full check catalog.

1. Tool Definition Discovery

Each MCP server is cloned at its latest release tag. The scanner then looks for tool definitions using a 3-tier discovery strategy:

Tier 1 — Explicit manifest files

The scanner checks for these files in order (first valid match wins):

Path	Format
`tools.json`	MCP tools array
`mcp.json`	MCP tools array
`server.json`	MCP server manifest
`testdata/tools.json`	Test fixtures
`.mcp/tools.json`	Hidden config
`src/tools.json` / `src/mcp.json`	Source-embedded
`test/tools.json`	Test fixtures
`tools/tools.json`	Tools subdirectory

A file is accepted only if it contains a non-empty .tools[] array (validated with jq).

Tier 2 — Recursive search

If no manifest is found, every *.json in the repo (excluding node_modules/, .git/, package*.json, tsconfig*.json) is inspected for a .tools[] array. This catches tools defined in skills/, tools/, or custom subdirectories.

Tier 3 — Grep-based synthetic extraction (fallback)

When no JSON manifest exists, the scanner parses the source code directly to extract tool names and descriptions:

Language	Pattern matched
TypeScript / JavaScript	`server.tool("name", "description", ...)`
Python	`@mcp.tool`, `name="tool_name"`

A synthetic tools.json is generated from the extracted names and passed to the scanner. This ensures servers like chrome-devtools-mcp — where tool definitions like evaluate_script are embedded in TypeScript source — are scanned at the individual tool level and not missed.

If no tool definitions are found after all three tiers, only the AS-004 supply-chain CVE scan runs against the repo's dependency manifests (package.json, go.mod).

2. Risk Score Formula

The composite risk score for a server is defined as:

$$ \text{RiskScore} = \sum_{i=1}^{n} \left( \text{SeverityWeight}_{i} \times \text{FindingCount}_{i} \right) $$

where:

$n$ is the total number of distinct finding categories detected.
$\text{SeverityWeight}_{i}$ is the weight assigned to severity level $i$ (see §3).
$\text{FindingCount}_{i}$ is the number of individual findings at severity level $i$.

2.1 Severity Weights

Severity	Weight ($w$)	Example trigger
Critical	25	Prompt injection (AS-001), arbitrary code execution (AS-006)
High	15	exec/network permission (AS-002), scope mismatch (AS-003), privilege escalation (AS-005)
Medium	8	Insecure secret handling (AS-010)
Low	2	Missing rate-limit (AS-011)
Info	0	Informational only; no exploitability

2.2 Worked Example

A tool with 1 Critical finding (AS-006) and 1 Low finding (AS-011):

$$ \text{RiskScore} = (25 \times 1) + (2 \times 1) = 27 \quad \Rightarrow \text{Grade C} $$

3. Grade Boundaries

Grade	RiskScore Range	Gateway Action	Meaning
S	(Reserved)	ALLOW	S grade is reserved for future dynamic runtime analysis. Static-only scans cap at A.
A	0 – 9	ALLOW	Minimal risk. Safe for production agents.
B	10 – 24	ALLOW + rate limit	Low risk. Minor issues; review findings.
C	25 – 49	REQUIRE_APPROVAL	Moderate risk. Remediation recommended before production use.
D	50 – 74	REQUIRE_APPROVAL	High risk. Use only in isolated/sandboxed environments.
F	75+	BLOCK	Critical risk. Do not use in agentic pipelines.

4. Check Catalog

All active scanner rules as of ToolTrust Scanner v0.1.12, plus the directory-only historical drift check AS-012:

ID	Category	Severity	What it detects
🛡️ AS‑001	Critical	Tool Poisoning	Hidden adversarial prompts in tool descriptions (`ignore previous instructions`, `system:`, `<INST>`)
🔑 AS‑002	High / Low	Permission Surface	Tools declaring `exec`, `network`, `db`, or `fs` beyond their stated purpose; unnecessarily broad input schema
📐 AS‑003	High	Scope Mismatch	Tool names that contradict their permissions (e.g. `read_config` secretly holding `exec`)
📦 AS‑004	High / Critical	Supply Chain (CVE)	Third-party dependencies with known CVEs — queried live from OSV database
🔓 AS‑005	High	Privilege Escalation	OAuth/token scopes broader than stated purpose (`admin`, `:write` wildcards); escalation signals in description (`sudo`, `impersonate`)
⚡ AS‑006	Critical	Arbitrary Code Execution	Tool name or description implies arbitrary script/code execution (`evaluate_script`, `execute javascript`, `_evaluate` suffix, `page.evaluate()` patterns)
ℹ️ AS‑007	Info	Insufficient Tool Data	Tool lacks a valid description or schema, preventing agents from understanding its capabilities or limitations
🚨 AS‑008	Critical	Known Compromised Package	Dependency or binary matches an embedded offline blacklist of confirmed supply-chain attacks — zero-latency, no network required
🔤 AS‑009	Medium	Typosquatting	Tool name within edit-distance 2 of a well-known MCP tool name, suggesting impersonation of a trusted tool
🗝️ AS‑010	Medium	Secret Handling	Input parameters accepting API keys/passwords/tokens; credentials logged or stored insecurely
⚡ AS‑011	Low	DoS Resilience	Network/execution tools with no rate-limit, timeout, or retry configuration
🔄 AS‑012	High	Rug-Pull / Silent Update	Tool set changed between scans of the same version without a version bump — directory pipeline only; requires historical scan data
ℹ️ AS‑014	Info	Dependency Inventory Unavailable	MCP tool exposed neither `metadata.dependencies` nor a `repo_url`, so supply-chain coverage is explicitly incomplete
⚠️ AS‑015	Medium / High	Suspicious NPM Lifecycle Script	npm dependency publishes install-time lifecycle scripts; severity rises for remote-fetch or inline-execution patterns
🚨 AS‑016	Critical	Suspicious NPM IOC Dependency	npm package metadata or install-time scripts reference a known malicious IOC dependency, domain, URL, or reviewed script pattern such as `plain-crypto-js`, helping catch compromised publishes beyond version blacklists
⚠️ AS‑017	Medium	Suspicious Data Exfiltration Description	tool description explicitly suggests forwarding user data, content, or conversation history to external / remote endpoints, distinct from prompt-injection wording
ℹ️ AS‑018	Info	Embedded MCP Server Detected	source-level MCP SDK imports and server initialization were found, but tool enumeration was not possible without running the server
👥 AS‑013	High / Medium	Tool Shadowing	Duplicate or near-duplicate tool name registered across servers hijacks calls intended for a trusted tool

AS-001

Tool Poisoning (Prompt Injection) · Severity: Critical

Detects adversarial instructions embedded in a tool's description field — e.g. ignore previous instructions, system: prefixes, <INST> tags, or jailbreak language intended to override the model's normal guardrails.

MCP tool descriptions are read by the LLM at runtime. A malicious server can use this field to override the agent's system prompt, exfiltrate data, or escalate privileges without the user's knowledge.

Fix: Remove adversarial instructions from tool descriptions. Validate all tool-definition strings against a safe-pattern allowlist before registration.

AS-002

Excessive Permission Surface · Severity: High / Medium / Low

Detects tools that declare broad permission categories (exec, fs, network) beyond what their stated purpose requires, or whose input schema accepts parameters implying wide access (e.g. arbitrary shell commands, unrestricted file paths).

Over-privileged tools increase blast radius if the agent is manipulated or the tool is misused.

Fix: Validate input parameters using Enums where possible. Restrict file-system operations to explicit allowed directories. Scope network access to known hosts.

AS-003

Scope Mismatch · Severity: High

Detects inconsistency between a tool's name, description, and declared permissions — e.g. a tool named read_file that also declares exec permission, or a description that understates actual capabilities.

Fix: Use explicit naming conventions that fully reflect actual capabilities.

AS-004

Supply Chain Vulnerability (CVE) · Severity: High / Critical

Detects known CVEs in the tool's dependencies via OSV / Google OSV-Scanner.

Fix: Upgrade or replace the vulnerable dependency. Pin all dependency versions and enable automated CVE scanning (Dependabot or OSV Scanner).

AS-005

Privilege Escalation · Severity: High

Detects OAuth/token scopes that include admin or wildcard write access, or description-level signals suggesting impersonation or privilege escalation (sudo, impersonate, act as admin).

Fix: Restrict OAuth/token scopes to the minimum necessary. Remove admin, :write wildcards, and any description-level escalation signals.

AS-006

Arbitrary Code Execution · Severity: Critical

Detects tools whose description or input schema indicate they can execute arbitrary shell commands, scripts, or code — e.g. parameters named command, script, eval, or descriptions containing "run any command".

Arbitrary code execution tools are the highest-risk category. A single prompt injection on an ACE tool can fully compromise the host.

Fix: If not strictly needed, remove the tool. If required, set approval_required: true in your MCP client config to ensure human-in-the-loop confirmation.

AS-008

Known Compromised Package Version · Severity: Critical (BLOCK) / High (WARN)

Detects dependencies whose exact version — or version range — has been confirmed as malicious or compromised, using an embedded offline blacklist compiled from public security advisories. This check runs before AS-004 (OSV live query), requires no network access, and returns results in O(1) time.

Current blacklist entries (as of 2026-03-31):

Advisory	Package	Affected versions	Action
SNYK-PYTHON-LITELLM-15762713	`litellm` (PyPI)	1.82.7, 1.82.8	BLOCK
GHSA-69fq-xp46-6x23	`trivy` (binary)	v0.69.4, v0.69.5, v0.69.6	BLOCK
GHSA-69fq-xp46-6x23	`trivy-action` (GitHub Actions)	< v0.35.0	WARN
GHSA-69fq-xp46-6x23	`setup-trivy` (GitHub Actions)	< v0.2.6	WARN
CVE-2026-33017	`langflow` (PyPI)	< 1.9.0	BLOCK
AXIOS-NPM-COMPROMISE-2026-03-31	`axios` (npm)	1.14.1, 0.30.4	BLOCK

Threat context — TeamPCP supply chain attack (March 2026):

The litellm 1.82.7 and 1.82.8 releases were injected with a malicious .pth file that executes automatically on every Python startup. It harvests SSH private keys, AWS/GCP credentials, .env files, and Kubernetes tokens, exfiltrates them to a C2 server (scan[.]aquasecurtiy[.]org · 45.148.10.212), and establishes systemd persistence. The same threat actor (TeamPCP) compromised trivy's CI/CD pipeline and force-pushed malicious binaries to GHCR, ECR, Docker Hub, and get.trivy.dev.

Severity mapping:

SUPPLY_CHAIN_BLOCK (confirmed malicious code) → CRITICAL → contributes 25 pts → Grade F guaranteed
SUPPLY_CHAIN_WARN (elevated risk, no confirmed payload) → HIGH → contributes 15 pts

Fix: Remove the affected package immediately. Rotate all credentials (SSH keys, AWS/GCP tokens, .env secrets). Check for systemd user services and files under ~/.config/sysmon/. Upgrade to a clean version.

Additional npm incident coverage (March 31, 2026):

ToolTrust now also flags the malicious axios npm publish (axios@1.14.1, axios@0.30.4) and related IOC evidence. For npm-backed MCP servers, this includes:

direct or transitive dependency recovery from package-lock.json, pnpm-lock.yaml, and yarn.lock
install-time lifecycle script review
metadata-level IOC matching for helper packages, malicious domains, URLs, and reviewed script patterns

AS-010

Insecure Secret Handling · Severity: Medium / High

Detects input parameters whose names suggest they accept raw secrets or credentials — e.g. api_key, password, secret, token, private_key.

Secrets passed as plain input parameters appear in agent traces, logs, and LLM context windows. A compromised agent or leaking trace exposes the credential.

Fix: Avoid accepting raw credentials as input parameters. Use secret managers (e.g. 1Password CLI, AWS Secrets Manager) and ensure credentials are never logged or stored in agent traces.

AS-009

Typosquatting · Severity: Medium

Detects tool names that are within edit-distance 2 of a curated list of well-known MCP tool names (e.g. list_files, read_file, brave_search). A tool named read_fille or list_filles could impersonate a trusted tool to intercept agent calls.

Typosquatting is a supply-chain attack vector: a malicious server registers a slightly misspelled version of a popular tool hoping an agent or user selects it by mistake.

Fix: Rename the tool to a unique, clearly differentiated name. If the tool genuinely implements the same interface as the popular tool (e.g. a fork), document this explicitly and distinguish it with a vendor prefix.

AS-011

DoS Resilience — Missing Rate Limit / Timeout · Severity: Low

Detects network or execution tools that declare no rate-limit, timeout, or retry configuration in their description or schema.

An agent in a loop can hammer an unthrottled tool, exhausting API quotas, causing cascading failures, or incurring unexpected costs.

Fix: Declare explicit rate-limit, timeout, and retry configuration for all network and execution tools. Implement exponential back-off and surface resource state to the calling agent.

AS-012

Rug-Pull / Silent Update · Severity: High · Directory pipeline only

Detects when the set of tools exposed by a server changes between two scans of the same version without a version bump. A server that silently adds or removes tools after installation is a supply-chain red flag — commonly called a "rug-pull" attack.

Note: This check requires historical scan data (the previous scan report for the same tool) and therefore runs only in the ToolTrust Directory CI pipeline, not in the standalone tooltrust-scanner CLI.

Example: vsmithery previously exposed ig_get_media, ig_publish_photo, etc. A later scan of the same version revealed those 22 tools had been silently replaced with 17 new INSTAGRAM_* tools — a complete interface swap with no version bump.

Fix: Pin your MCP server to a specific commit hash rather than a floating version tag. Audit the changelog and all tool definitions before updating. Enable the ToolTrust Directory daily re-scan to be notified of silent changes.

AS-014

Dependency Inventory Unavailable · Severity: Info

Flags MCP tools that do not expose metadata.dependencies and do not provide a repo_url. In this case ToolTrust can still scan the tool definition itself, but supply-chain analysis coverage is limited because the dependency inventory is incomplete or unavailable.

This is intentionally informational rather than punitive. The goal is to make missing supply-chain visibility explicit so users do not mistake a clean result for comprehensive dependency coverage.

Fix: Prefer MCP metadata that includes a direct dependency list and a repository URL. For local scans, keep lockfiles checked into the repo so ToolTrust can recover verified dependency versions.

AS-015

Suspicious NPM Lifecycle Script · Severity: Medium / High

Flags npm dependency versions that publish install-time lifecycle scripts such as preinstall, install, postinstall, or prepare. These scripts execute automatically during installation and are a common supply-chain attack primitive.

Severity remains Medium for ordinary lifecycle scripts, but is raised to High when the script includes riskier execution patterns such as remote fetches (curl, wget, Invoke-WebRequest) or inline execution (bash -c, sh -c, node -e, python -c).

Fix: Review the script before use, prefer versions without lifecycle scripts where possible, and install in CI/sandboxed environments with --ignore-scripts when appropriate.

AS-016

Suspicious NPM IOC Dependency · Severity: Critical

Flags npm dependency versions whose published registry metadata references known malicious IOC package names such as plain-crypto-js. This is narrower than full tarball signature scanning, but it still helps catch compromised publishes when the attacker introduces a recognizable IOC through dependency metadata.

AS-017

Suspicious Data Exfiltration Description · Severity: Medium

Flags descriptions that explicitly suggest forwarding user data, content, or conversation history to external URLs, remote hosts, attacker-controlled sinks, or equivalent off-box destinations.

This rule is intentionally separate from AS-001. Prompt injection focuses on instruction override language; AS-017 focuses on suspicious external data-forwarding language that may still warrant review even when it is not trying to hijack the model.

Fix: Narrow the destination scope, document the external endpoint clearly, and keep the tool behind approval if it forwards sensitive or user-derived content.

AS-018

Embedded MCP Server Detected · Severity: Info

Flags repositories where ToolTrust can see MCP SDK imports and server initialization in source code, but cannot enumerate tools from a manifest or a live server handshake.

This is not a clean bill of health. It means the repo appears to contain an embedded MCP implementation, so manual review or a sandboxed live scan is still required to evaluate auth, scope, and input validation.

Fix: Run the server in a sandbox with a live tooltrust-scanner scan --server ... command when possible, or add a static tools manifest to make the implementation reviewable without executing the server.

This rule is especially useful when combined with lockfile-derived transitive dependency evidence from package-lock.json, pnpm-lock.yaml, or yarn.lock, because many MCP servers will not depend on a compromised package directly.

Fix: Treat the affected version as a likely compromise. Remove it, rotate exposed credentials, inspect the dependency tree for the IOC package, and reinstall from a verified clean release.

AS-013

Tool Shadowing · Severity: High / Medium

Detects tool name collisions across a multi-server tool set. When two servers register tools with identical or near-identical names (edit-distance 1), the order of resolution becomes attacker-controlled. A malicious server can register read_file to intercept calls intended for the trusted filesystem server.

Exact duplicates are flagged as High (the hijack is unambiguous). Near-duplicates (edit-distance 1) are flagged as Medium (may be accidental or intentional).

Fix: Ensure each MCP server uses a unique namespace prefix for its tools (e.g. github__search_repos vs linear__search_repos). Audit multi-server configurations for name collisions before deploying to production agents.

5. Scan Scope & Limitations

Static analysis only (v1.1). Dynamic/runtime analysis is planned for a future release.
Tier 3 (grep-based) coverage is best-effort. A tool name missed by extraction will not be scored.
Scores reflect the tool version at scan_date. Scores are not retroactively updated when rules change; a rescan must be triggered.
A clean scan result does not constitute an endorsement of overall software quality or runtime security.

6. Versioning

This methodology follows Semantic Versioning. Breaking changes to the formula or grade boundaries will increment the major version and require re-scanning all affected tools.

Methodology version	Scanner version	Change
1.0	v0.1.2	Initial release
1.1	v0.1.4	Added AS-006 (Arbitrary Code Execution); 3-tier tool discovery
1.2	v0.1.12	Added AS-009 (Typosquatting), AS-013 (Tool Shadowing); false-positive fixes for AS-001 and AS-010
1.3	v0.1.13	Added AS-008 (Known Compromised Package) — offline embedded blacklist for zero-latency supply-chain attack detection; TeamPCP/litellm/trivy/langflow entries

7. Contributing

To challenge a finding or request a rescan, open a Scan Request issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ToolTrust Scoring Methodology

Overview

1. Tool Definition Discovery

Tier 1 — Explicit manifest files

Tier 2 — Recursive search

Tier 3 — Grep-based synthetic extraction (fallback)

2. Risk Score Formula

2.1 Severity Weights

2.2 Worked Example

3. Grade Boundaries

4. Check Catalog

AS-001

AS-002

AS-003

AS-004

AS-005

AS-006

AS-008

AS-010

AS-009

AS-011

AS-012

AS-014

AS-015

AS-016

AS-017

AS-018

AS-013

5. Scan Scope & Limitations

6. Versioning

7. Contributing

FilesExpand file tree

methodology.md

Latest commit

History

methodology.md

File metadata and controls

ToolTrust Scoring Methodology

Overview

1. Tool Definition Discovery

Tier 1 — Explicit manifest files

Tier 2 — Recursive search

Tier 3 — Grep-based synthetic extraction (fallback)

2. Risk Score Formula

2.1 Severity Weights

2.2 Worked Example

3. Grade Boundaries

4. Check Catalog

AS-001

AS-002

AS-003

AS-004

AS-005

AS-006

AS-008

AS-010

AS-009

AS-011

AS-012

AS-014

AS-015

AS-016

AS-017

AS-018

AS-013

5. Scan Scope & Limitations

6. Versioning

7. Contributing