Grafana AI observability is a product from Grafana for teams running agents in production.
Instrument once with a thin OpenTelemetry-native SDK, then use it to see what your agents are doing, what they cost, how quality is changing, and which conversations need attention.
- Simple onboarding. Sigil is a thin SDK layer on top of OpenTelemetry and the OTel GenAI semantic conventions, with helpers for common providers and frameworks. If you already have OTel, setup is small enough to do by hand or with coding assistants such as Claude Code or Cursor.
- A single pane of glass for your agents. See activity, latency, errors, token usage, cost, cache behavior, and quality in one place with filters for time range, provider, model, agent, and labels.
- Conversation drilldown when something looks off. Open any conversation to inspect the full thread, tool calls, traces, scores, ratings, annotations, token usage, and cost breakdowns.
- Agent catalog and version history. Sigil automatically groups agents, tracks versions, shows prompt and tool footprints, surfaces usage and cost per version, and helps you compare how an agent changes over time.
- Actionable suggestions, not just dashboards. Built-in insight bars flag anomalies and optimization opportunities around cost, cache, errors, and performance, and agent detail can rate a version's prompt/tool setup and suggest improvements.
- Online evaluation on live traffic. Score production generations continuously so you can monitor quality, catch regressions, and avoid manually reading every conversation.
- OpenTelemetry-native: follows the OTel GenAI semantic conventions, emits standard traces and metrics over OTLP, and works with existing OTel pipelines.
- Generation-first: normalized generation ingest lets Sigil correlate conversations, tool executions, traces, costs, and scores.
- Version-aware agents: prompt and tool changes become queryable agent versions, even when producers do not send a clean version string.
- Built for production quality loops: observability, agent understanding, ratings, annotations, and online evaluation live in the same workflow.
| Language | Package | Path |
|---|---|---|
| Go | github.com/grafana/sigil-sdk/go |
go/ |
| Python | sigil-sdk |
python/ |
| TypeScript/JavaScript | @grafana/sigil-sdk-js |
js/ |
| .NET/C# | Grafana.Sigil |
dotnet/ |
| Java | com.grafana.sigil |
java/ |
| Language | Providers | Path |
|---|---|---|
| Go | Anthropic, OpenAI, Gemini | go-providers/ |
| Python | Anthropic, OpenAI, Gemini | python-providers/ |
| Language | Frameworks | Path |
|---|---|---|
| Go | Google ADK | go-frameworks/ |
| Python | LangChain, LangGraph, OpenAI Agents, LlamaIndex, Google ADK, Strands Agents | python-frameworks/ |
import { SigilClient } from "@grafana/sigil-sdk-js";
const client = new SigilClient({
generationExport: {
protocol: "http",
endpoint: "http://localhost:8080/api/v1/generations:export",
auth: { mode: "tenant", tenantId: "dev-tenant" },
},
});
// Configure OTEL exporters (traces/metrics) in your app OTEL setup.
await client.startGeneration(
{
conversationId: "conv-1",
model: { provider: "openai", name: "gpt-5" },
},
async (recorder) => {
recorder.setResult({
output: [{ role: "assistant", content: "Hello from Sigil" }],
});
}
);
await client.shutdown();cfg := sigil.DefaultConfig()
cfg.GenerationExport.Protocol = sigil.GenerationExportProtocolHTTP
cfg.GenerationExport.Endpoint = "http://localhost:8080/api/v1/generations:export"
cfg.GenerationExport.Auth = sigil.AuthConfig{
Mode: sigil.ExportAuthModeTenant,
TenantID: "dev-tenant",
}
client := sigil.NewClient(cfg)
defer func() { _ = client.Shutdown(context.Background()) }()
ctx, rec := client.StartGeneration(context.Background(), sigil.GenerationStart{
ConversationID: "conv-1",
Model: sigil.ModelRef{Provider: "openai", Name: "gpt-5"},
})
defer rec.End()
rec.SetResult(sigil.Generation{
Output: []sigil.Message{sigil.AssistantTextMessage("Hello from Sigil")},
}, nil)from sigil_sdk import Client, ClientConfig, GenerationStart, ModelRef, assistant_text_message
client = Client(
ClientConfig(
generation_export_endpoint="http://localhost:8080/api/v1/generations:export",
)
)
with client.start_generation(
GenerationStart(
conversation_id="conv-1",
model=ModelRef(provider="openai", name="gpt-5"),
)
) as rec:
rec.set_result(output=[assistant_text_message("Hello from Sigil")])
client.shutdown()Let your AI coding assistant add instrumentation for you. We provide ready-to-use prompts for Cursor, Claude Code, and GitHub Copilot that tell the agent how to find your LLM calls and wrap them with the SDK.
From the UI — open the AI Observability plugin in your Grafana Cloud stack, go through the onboarding wizard, and pick your IDE. The prompt is generated for you with one click.
From this repo — copy the prompt file for your IDE into your project:
| IDE | Prompt file | Where to put it in your project |
|---|---|---|
| Cursor | CLAUDE.md |
.cursor/rules/sigil.mdc (or paste into Cursor chat) |
| Claude Code | CLAUDE.md |
CLAUDE.md at your project root |
| GitHub Copilot | .github/copilot-instructions.md |
.github/copilot-instructions.md in your project |
Then ask your agent: "Instrument this codebase with Grafana AI Observability".
Minimal, self-contained examples that make a real LLM call and record the generation to Grafana Cloud AI Observability.
| Language | Example |
|---|---|
| Python | examples/getting-started/python/ |
| TypeScript | examples/getting-started/typescript/ |
| Go | examples/getting-started/go/ |
You need three values to connect. All are visible in the AI Observability plugin → Connection tab in your Grafana Cloud stack.
| What | Where to find it |
|---|---|
| Instance ID — numeric stack ID, used as tenant ID and basic-auth username | Shown under Instance ID in the Connection tab. Also in the Cloud Portal under your stack details. |
API token — starts with glc_, used as the basic-auth password |
Create one via Administration → Cloud Access Policies (docs). Scope it to your stack with write permissions for AI Observability. |
| Endpoint URL — the ingest URL for your region | Shown under API URL in the Connection tab. Append /api/v1/generations:export to get the full ingest URL. |
- OpenCode — AI Observability integration for OpenCode
Vendored protobuf definitions used by SDKs live in proto/.