graph-tool-call

Tool Lifecycle Management for LLM Agents

Ingest, Analyze, Organize, Retrieve.

The Problem

LLM agents are getting access to more and more tools. A commerce platform might expose 1,200+ API endpoints. A company's internal toolset might have 500+ functions across multiple services.

But there's a hard limit: you can't put them all in the context window.

The common solution? Vector search — embed tool descriptions, find the closest matches. It works, but it misses something important:

Tools don't exist in isolation. They have relationships.

When a user says "cancel my order and process a refund", vector search might find cancelOrder. But it won't know that you need to call listOrders first (to get the order ID), and that processRefund should follow. These aren't just similar tools — they form a workflow.

The Solution

graph-tool-call models tool relationships as a graph:

                    ┌──────────┐
          PRECEDES  │listOrders│  PRECEDES
         ┌─────────┤          ├──────────┐
         ▼         └──────────┘          ▼
   ┌──────────┐                    ┌───────────┐
   │ getOrder │                    │cancelOrder│
   └──────────┘                    └─────┬─────┘
                                         │ COMPLEMENTARY
                                         ▼
                                  ┌──────────────┐
                                  │processRefund │
                                  └──────────────┘

Instead of treating each tool as an independent vector, graph-tool-call understands:

REQUIRES — getOrder needs an ID from listOrders
PRECEDES — you must list orders before you can cancel one
COMPLEMENTARY — cancellation and refund often go together
SIMILAR_TO — getOrder and listOrders serve related purposes
CONFLICTS_WITH — updateOrder and deleteOrder shouldn't run together

This means when you search for "cancel order", you don't just get cancelOrder — you get the complete workflow: list → get → cancel → refund.

How It Works

OpenAPI/MCP/Code → [Ingest] → [Analyze] → [Organize] → [Retrieve] → Agent
                    (convert)  (relations)  (graph)     (hybrid)

1. Ingest — Point it at a Swagger spec, MCP server, or Python functions. Tools are auto-converted into a unified schema.

2. Analyze — Relationships are automatically detected: path hierarchies, CRUD patterns, shared schemas, response-parameter chains, state machines.

3. Organize — Tools are grouped into an ontology graph. Two modes:

Auto — purely algorithmic (tags, paths, CRUD patterns). No LLM needed.
LLM-Auto — enhanced with LLM reasoning (Ollama, vLLM, OpenAI). Better categories, richer relations.

4. Retrieve — Hybrid search that combines keyword matching, graph traversal, and (optionally) embeddings. Works great without any LLM. Works even better with one.

Quick Start

pip install graph-tool-call

from graph_tool_call import ToolGraph

tg = ToolGraph()

# Register tools (OpenAI / Anthropic / LangChain format auto-detected)
tg.add_tools(your_tools_list)

# Define relationships
tg.add_relation("read_file", "write_file", "complementary")

# Retrieve — graph expansion finds related tools automatically
tools = tg.retrieve("read a file and save changes", top_k=5)
# → [read_file, write_file, list_dir, ...]
#    write_file found via COMPLEMENTARY relation, not just vector similarity

From Swagger / OpenAPI

from graph_tool_call import ToolGraph

tg = ToolGraph()
tg.ingest_openapi("tests/fixtures/petstore_swagger2.json")
# Supports: Swagger 2.0, OpenAPI 3.0, OpenAPI 3.1
# Accepts: file path (JSON/YAML), URL, or raw dict

# Automatic: 5 endpoints → 5 tools → CRUD relations → categories
# Dependencies, call ordering, category groupings — all auto-detected.

tools = tg.retrieve("create a new pet", top_k=5)
# → [createPet, getPet, updatePet, listPets, deletePet]
#    Graph expansion brings the full CRUD workflow

From Swagger UI URL

from graph_tool_call import ToolGraph

# Auto-discovers all API groups from Swagger UI
tg = ToolGraph.from_url("https://api.example.com/swagger-ui/index.html")

# Also works with direct spec URLs
tg = ToolGraph.from_url("https://api.example.com/v3/api-docs")

tools = tg.retrieve("search products", top_k=5)

from_url() automatically detects Swagger UI pages, discovers all spec groups via swagger-config, and ingests them into a single unified graph. Operations without descriptions get auto-generated fallbacks from their HTTP method, path, and tags.

From MCP Server Tools

from graph_tool_call import ToolGraph

tg = ToolGraph()

# Ingest MCP tool list (annotations are preserved)
mcp_tools = [
    {
        "name": "read_file",
        "description": "Read a file",
        "inputSchema": {"type": "object", "properties": {"path": {"type": "string"}}},
        "annotations": {"readOnlyHint": True, "destructiveHint": False},
    },
    {
        "name": "delete_file",
        "description": "Delete a file permanently",
        "inputSchema": {"type": "object", "properties": {"path": {"type": "string"}}},
        "annotations": {"readOnlyHint": False, "destructiveHint": True},
    },
]
tg.ingest_mcp_tools(mcp_tools, server_name="filesystem")

# "delete files" → destructive tools ranked higher (annotation-aware)
tools = tg.retrieve("delete temporary files", top_k=5)

MCP annotations (readOnlyHint, destructiveHint, idempotentHint, openWorldHint) are used as retrieval signals. Query intent is automatically classified and aligned with tool annotations — read queries prefer read-only tools, delete queries prefer destructive tools.

From Python Functions

def read_file(path: str) -> str:
    """Read contents of a file."""

def write_file(path: str, content: str) -> None:
    """Write contents to a file."""

tg = ToolGraph()
tg.ingest_functions([read_file, write_file])
# Parameters extracted from type hints, description from docstring

Why Not Just Vector Search?

Scenario	Vector-only	graph-tool-call
"cancel my order"	Returns `cancelOrder`	Returns `listOrders → getOrder → cancelOrder → processRefund` (full workflow)
"read and save file"	Returns `read_file`	Returns `read_file` + `write_file` (via COMPLEMENTARY)
"delete old records"	Returns any tool matching "delete"	Destructive tools ranked first (annotation-aware)
Multiple Swagger specs with overlapping tools	Duplicate tools in results	Auto-deduplication across sources
1,200 API endpoints	Slow, noisy results	Organized into categories, precise graph traversal

3-Tier Search: Use What You Have

graph-tool-call is designed to work without any LLM and get better with one:

Tier	What you need	What it does	Improvement
0	Nothing	BM25 keywords + graph expansion + RRF fusion	Baseline
1	Small LLM (1.5B~3B)	+ query expansion, synonyms, translation	Recall +15~25%
2	Full LLM (7B+)	+ intent decomposition, iterative refinement	Recall +30~40%

Even a tiny model running on Ollama (qwen2.5:1.5b) can meaningfully improve search quality. No GPU required for Tier 0.

Feature Comparison

Feature	Vector-only solutions	graph-tool-call
Tool source	Manual registration	Auto-ingest from Swagger/OpenAPI/MCP
Search method	Flat vector similarity	Graph + vector hybrid (wRRF), 3-Tier
Behavioral semantics	None	MCP annotation-aware retrieval
Tool relations	None	6 relation types, auto-detected
Call ordering	None	State machine + CRUD workflow detection
Deduplication	None	Cross-source duplicate detection
Ontology	None	Auto / LLM-Auto modes
LLM dependency	Required	Optional (better with, works without)

Roadmap

Phase	What	Status
0	Core graph engine + hybrid retrieval	✅ Done (39 tests)
1	OpenAPI ingest, BM25+RRF retrieval, dependency detection	✅ Done (88 tests)
2	Deduplication, embeddings, ontology modes (Auto/LLM-Auto), search tiers, `from_url()`	✅ Done (181 tests)
2.5	MCP Annotation-Aware Retrieval: intent classifier, annotation scoring, wRRF integration	✅ Done (255 tests)
3	Pyvis visualization, Neo4j export, CLI, PyPI publish	Planned
4	Interactive dashboard (Dash Cytoscape), manual editing, community	Planned

Documentation

Doc	Description
Architecture	System overview, pipeline layers, data model
WBS	Work Breakdown Structure — Phase 0~4 progress
Design	Algorithm design — spec normalization, dependency detection, search modes, call ordering, ontology modes
Research	Competitive analysis, API scale data, commerce patterns
OpenAPI Guide	How to write API specs that produce better tool graphs

Contributing

Contributions are welcome!

# Development setup
git clone https://github.com/SonAIengine/graph-tool-call.git
cd graph-tool-call
pip install poetry
poetry install --with dev

# Run tests
poetry run pytest -v

# Lint
poetry run ruff check .

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.githooks		.githooks
.github/workflows		.github/workflows
benchmarks		benchmarks
docs		docs
examples		examples
graph_tool_call		graph_tool_call
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README-ja.md		README-ja.md
README-ko.md		README-ko.md
README-zh_CN.md		README-zh_CN.md
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

graph-tool-call

The Problem

The Solution

How It Works

Quick Start

From Swagger / OpenAPI

From Swagger UI URL

From MCP Server Tools

From Python Functions

Why Not Just Vector Search?

3-Tier Search: Use What You Have

Feature Comparison

Roadmap

Documentation

Contributing

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

SonAIengine/graph-tool-call

Folders and files

Latest commit

History

Repository files navigation

graph-tool-call

The Problem

The Solution

How It Works

Quick Start

From Swagger / OpenAPI

From Swagger UI URL

From MCP Server Tools

From Python Functions

Why Not Just Vector Search?

3-Tier Search: Use What You Have

Feature Comparison

Roadmap

Documentation

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages