Claude Compression Lab - Benchmark

A benchmarking suite that measures Claude Code's token consumption under three scenarios, then analyses the results to evaluate the economic impact of Edgee's AI token compressor.

See the reports/ folder for detailed, real-world reports from our latest benchmark runs, including token usage, costs, and scenario breakdowns.

Overview

The benchmark works in two phases:

Run — Launch isolated Claude Code sessions that complete a fixed set of coding instructions. Each session runs in one of three scenarios (compression strategies).
Analyse — Read the session artefacts and produce cost reports.

Prerequisites

Node.js ≥ 18 with npm
claude CLI installed and accessible in your PATH
RTK (Rust Token Killer): required for the rtk scenario; see https://github.com/rtk-ai/rtk
An .env file at the project root (see below)

Install dependencies:

npm install

Environment setup

Create a .env file at the root of the project:

EDGEE_API_TOKEN_NORMAL=<your-token-for-normal-sessions>
EDGEE_API_TOKEN_EDGEE=<your-token-for-edgee-sessions>
EDGEE_API_TOKEN_RTK=<your-token-for-rtk-sessions>

Each token is an Edgee API key used to route Claude's requests through the Edgee AI Gateway.

Reports

If you want to generate reports, you'll need another variable:

EDGEE_API_TOKEN_REPORT=<your-token-to-generate-reports>

Running a benchmark session

./run.sh <scenario>

Scenarios:

Scenario	Description
`normal`	Baseline — Claude requests go through Edgee AI Gateway with no compression
`edgee`	Edgee token compressor is enabled; input tokens are reduced before forwarding to Anthropic
`rtk`	RTK (Rust Token Killer) is enabled as a local bash proxy; Claude's bash tool calls go through RTK before hitting the gateway

Each run:

Copies the cli/ source directory into a fresh _<scenario>-<random>/ folder
Creates an isolated Claude config directory inside it
Launches Claude Code with --dangerously-skip-permissions

Example:

./run.sh edgee

This creates _edgee-4a2f8c1d/ and starts a Claude session inside it.

What to do inside the Claude session

Once Claude starts, put it in plan mode, then paste the coding instructions one at a time from instructions.md. For each instruction:

Paste the instruction
Let Claude produce a plan
Approve the plan and let it execute
Move on to the next instruction

The session records token usage and cost in .claude/.claude.json inside the session directory.

Analysing results

Standard benchmark (`npm run analyze`)

Reads all _normal-*, _edgee-*, and _rtk-* session directories (excluding -full ones), aggregates token and cost metrics, then calls the Edgee LLM API to produce an AI-written analysis.

npm run analyze

Outputs two files in the project root:

report-<ISO-date>.json — raw aggregated metrics
report-<ISO-date>.md — human-readable markdown report with tables and LLM analysis

Endurance benchmark (`npm run analyze-full`)

Reads all _*-full/ session directories and their claude-pro-usage.json files to measure how many instructions each scenario can complete before exhausting a Claude Pro plan.

npm run analyze-full

Outputs:

report-full-<ISO-date>.json
report-full-<ISO-date>.md

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
cli @ e0ebdbe		cli @ e0ebdbe
config		config
reports		reports
src		src
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
instructions.md		instructions.md
package-lock.json		package-lock.json
package.json		package.json
run.sh		run.sh
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Claude Compression Lab - Benchmark

Overview

Prerequisites

Environment setup

Reports

Running a benchmark session

What to do inside the Claude session

Analysing results

Standard benchmark (`npm run analyze`)

Endurance benchmark (`npm run analyze-full`)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Claude Compression Lab - Benchmark

Overview

Prerequisites

Environment setup

Reports

Running a benchmark session

What to do inside the Claude session

Analysing results

Standard benchmark (npm run analyze)

Endurance benchmark (npm run analyze-full)

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Standard benchmark (`npm run analyze`)

Endurance benchmark (`npm run analyze-full`)

Packages