🚀 Rocketgraph v0.1.0 - Compress billions of logs into tiny snapshots to detect anomalies #12
kaushik94
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This release opens up the full Rocketgraph stack: the ML log-clustering and streaming anomaly detection engine, an AI agent that auto-instruments any Node.js service with OpenTelemetry in ~90 seconds, and a license transition to BSL 1.1 → AGPL-3.0 (2030-05-27).
It's the first version where the entire pipeline — from "I have an uninstrumented Node app" to "I'm getting ranked anomaly alerts" — runs end-to-end out of the repo, with no SaaS dependency.
🧠 Rocketgraph ML — clustering + streaming anomaly detection
A stateless FastAPI service (
ml/) that points at the observability platform you already pay for, mines structural log templates, and flags the ones that are statistically unusual. Three deterministic algorithms in sequence — no LLM, no labels, fully reproducible:[log_count, error_rate, warn_rate, unique_services, token_count]per template per service.(x, y) ∈ [5, 95]for drop-in scatter plots.Measured performance
Single container, 4 vCPU, 8 GB RAM, against a real production-shaped workload:
Anomaly signals
Every flagged row carries a
reasonsarray, so downstream alerting can route deterministically:anomaly_scoreHST_THRESHOLD(default 0.7) for that servicenew_templateerror_burstConnectors
Six platforms supported in this release, all returning the same row shape
{timestamp, message, level, service}so the downstream pipeline is identical regardless of source:OpenTelemetry isn't a connector — route OTLP into ClickHouse or Loki via a standard collector config, then point Rocketgraph at that. Minimal collector config is in
ml/README.md.Roadmap: Splunk, Elastic / OpenSearch, Azure Monitor, GCP Cloud Logging.
HTTP API
GET/clustersPOST/clusters/train/clusters, plus warms the per-service HST model on the same window.POST/anomalies/detectPOST/credentials.envat runtime, per-tenant, no restart.GET/credentialsPOST/detector/resetGET/healthTime-window flags:
1h,6h,12h,24h,1d,7d, or absolutestart=<ISO>&end=<ISO>(ClickHouse).Tunables
DRAIN_SIM_TH0.4ANOMALY_CONTAMINATION0.1HST_THRESHOLD0.7DEFAULT_LOOKBACK_HOURS6windowis omitted.MAX_ROWS100000Deployment
docker compose upis the whole install. No DB, no auth provider, no agents on hosts..envfor static creds,POST /credentialsfor dynamic/per-tenant creds. Held in memory only — never written to disk or logs.SIGNING_SECRETgates every endpoint behind anX-Signing-Secretheader.⚡
@rgraph/otel-node— AI agent for OpenTelemetry instrumentationMost teams want anomaly detection but lack the upstream pipeline that produces structured telemetry in the first place.
@rgraph/otel-node(packages/otel-node) closes that gap.What it does on that single command:
package.jsonand the lockfile to detect framework, language (TS/JS), and package manager (npm/yarn/pnpm/bun)..bak) aninstrumentation.ts/.jstailored to that exact stack.@opentelemetry/*packages using the detected package manager.--require/--importflag to wire the file in. For Next.js, picks upexperimental.instrumentationHookautomatically.Detected stacks
http,https,grpc,@grpc/grpc-jspg,mysql2,mongodb,mongoose,redis,ioredis,prismaamqplib,kafkajs,aws-sdk,@aws-sdk/*npm,yarn,pnpm,bun(auto from lockfile)Modes
@opentelemetry/instrumentation-*package selection.--legacy) — deterministic, no LLM. Use this in CI where reproducibility matters.--dry-run --legacy— print the file that would be written + the packages that would be installed. No changes.otel-node instrument— agent goes further: adds structured error handlers, span attributes, and observability code throughout the app.otel-node detect— JSON report of what the detector sees. Useful for debugging or auditing in CI.otel-node uninstall— removes the generated file and.bak. Leaves OTel packages installed.Where it fits
The agent only owns the leftmost step — getting telemetry out of your service over standards-compliant OTLP. The right half is whatever observability platform you already pay for. No proprietary protocol, no custom SDK, no lock-in.
🎨 Rebrand to Rocketgraph
Project consolidated under the Rocketgraph name (formerly
rgraph/ RocketsGraphQL). New logo, new domain at rocketgraph.app, new docs structure. Engine and APIs unchanged — only branding and packaging.The README now ships a real demo GIF (
images/logs-snapshot.gif) showing the 2M-logs → 58-templates pipeline in motion, so you can see what the engine actually does without cloning anything.📜 License change — BSL 1.1, converts to AGPL-3.0 on 2030-05-27
Rocketgraph is moving from AGPL-3.0 to Business Source License 1.1 with a Change Date of 2030-05-27, at which point the license converts to AGPL-3.0 automatically.
What you can do under BSL today (free, no cost):
What you can't do under BSL today (until the Change Date):
On 2030-05-27, the BSL terms expire and every version up to that point becomes AGPL-3.0. No action required from you. Same protection model the AGPL gave you, with a 4-year window of source-available commercial protection in front of it.
The license shift is the standard "MariaDB / Sentry / CockroachDB" pattern — designed to keep the project self-hostable and auditable while leaving room to fund continued development.
Full text:
LICENSE.txt.🔗 Community
YHVnZ5WTinvite is deprecated.🛠 Upgrade notes
discord.gg/dqwkEpSc.🙏 Thanks
To everyone who's stress-tested the ML engine against real production bursts and filed sharp issues — keep them coming. The roadmap items most useful to land next are listed in the README: more connectors (Splunk, OpenSearch, Azure Monitor, GCP Cloud Logging), more
@rgraph/otel-nodeframework coverage (Fastify, NestJS, Remix, Bun-native), and additional end-to-end reference apps underexample-setups/.This discussion was created from the release 🚀 Rocketgraph v0.1.0 - Compress billions of logs into tiny snapshots to detect anomalies.
Beta Was this translation helpful? Give feedback.
All reactions