🔁 AI Agent Loop

Guardrailed agentic execution engine — so your agent stops when it should, not when your budget runs out.

A zero-dependency library for running autonomous AI agent loops with built-in guardrails: loop detection, token budgets, step limits, duplicate call prevention, and graceful degradation.

⚠️ Real-World Pitfall

An AI agent spent $847 calling the same tool 2,847 times before I added loop detection.

The agent was stuck in a retry loop — same tool, same arguments, same error, 2,847 times in a row. It burned through the entire monthly token budget in 4 hours. The task? A simple "find and summarize" that should have cost $0.03.

Without guardrails, autonomous agents are token-eating machines. This library adds the brakes.

✨ Features

Loop Detection — Detects repeated tool calls, argument cycles, and stuck states before they burn your budget
Token Budget — Set hard token limits per task; agent stops gracefully when budget is exhausted
Step Limits — Cap the number of reasoning + tool-call steps per execution
Duplicate Prevention — Identical tool calls (same name + same args) are blocked with cached results
Graceful Degradation — When limits hit, agent returns partial results instead of crashing
Provider Agnostic — Works with any OpenAI-compatible API: MiMo, DeepSeek, OpenRouter, OpenAI, Anthropic
MiMo Optimized — Special handling for MiMo thinking models (mimo-v2.5-pro) to prevent token runaway
Observability — Built-in event emitter for monitoring every step, tool call, and guardrail trigger
Zero Dependencies — Pure ESM, Node.js 18+, nothing extra

🚀 Quick Start

1. Install

npm install ai-agent-loop

2. Run a Guardrailed Agent

import { createAgentLoop } from 'ai-agent-loop';

const agent = createAgentLoop({
  provider: {
    baseUrl: 'https://token-plan-sgp.xiaomimimo.com/v1',
    apiKey: process.env.MIMO_API_KEY,
    model: 'mimo-v2.5',
  },
  guards: {
    maxSteps: 20,            // Max reasoning + tool steps
    maxTokens: 50_000,       // Hard token budget per task
    maxToolCalls: 50,        // Max tool invocations
    duplicateWindow: 10,     // Block duplicate calls within last N
    stuckThreshold: 3,       // Detect stuck state after N same-result calls
  },
  tools: [
    {
      name: 'search_web',
      description: 'Search the web',
      parameters: { type: 'object', properties: { query: { type: 'string' } }, required: ['query'] },
      handler: async ({ query }) => fetch(`https://api.search.example/${query}`).then(r => r.json()),
    },
  ],
});

const result = await agent.run('Find the top 3 news stories today and summarize each in one sentence.');

console.log(result.content);       // Final response
console.log(result.steps);         // Number of steps taken
console.log(result.tokensUsed);    // Total tokens consumed
console.log(result.guardsHit);     // Which guardrails triggered (if any)
console.log(result.cost);          // Estimated cost in USD

📦 Architecture

┌──────────────────────────────────────────────────┐
│                  ai-agent-loop                    │
├──────────────────────────────────────────────────┤
│                                                   │
│  ┌──────────┐   ┌──────────┐   ┌──────────┐     │
│  │  Goal     │   │  LLM     │   │  Tool    │     │
│  │  Parser   │──▶│  Call    │──▶│  Execute │     │
│  └──────────┘   └──────────┘   └──────────┘     │
│       │              │               │            │
│       ▼              ▼               ▼            │
│  ┌───────────────────────────────────────────┐   │
│  │            GUARDRAIL ENGINE               │   │
│  │                                           │   │
│  │  ┌──────────┐ ┌──────────┐ ┌──────────┐  │   │
│  │  │  Loop    │ │  Token   │ │  Step    │  │   │
│  │  │  Detect  │ │  Budget  │ │  Limit   │  │   │
│  │  └──────────┘ └──────────┘ └──────────┘  │   │
│  │  ┌──────────┐ ┌──────────┐ ┌──────────┐  │   │
│  │  │  Dup     │ │  Stuck   │ │  Cost    │  │   │
│  │  │  Block   │ │  Detect  │ │  Track   │  │   │
│  │  └──────────┘ └──────────┘ └──────────┘  │   │
│  └───────────────────────────────────────────┘   │
│                       │                           │
│                       ▼                           │
│  ┌───────────────────────────────────────────┐   │
│  │         GRACEFUL DEGRADATION              │   │
│  │   (partial results + reason for stop)     │   │
│  └───────────────────────────────────────────┘   │
│                       │                           │
│       ┌───────────────┼───────────────┐          │
│       ▼               ▼               ▼          │
│  ┌─────────┐   ┌──────────┐   ┌──────────┐     │
│  │ Xiaomi  │   │ DeepSeek │   │ OpenAI   │     │
│  │ MiMo    │   │          │   │          │     │
│  └─────────┘   └──────────┘   └──────────┘     │
└──────────────────────────────────────────────────┘

🖥️ CLI Reference

# Run an agent task with guardrails
ai-agent-loop run "Summarize today's HN top stories" --max-steps 10 --max-tokens 20000

# Dry run — show what would happen without executing
ai-agent-loop dry-run "Analyze this codebase" --max-steps 5

# Show guardrail stats from last run
ai-agent-loop stats

# Test loop detection with a deliberately stuck prompt
ai-agent-loop test-loop --iterations 100

📚 API Reference

`createAgentLoop(config)`

Creates a guardrailed agent loop instance.

Config options:

Option	Type	Default	Description
`provider`	`Object`	required	`{ baseUrl, apiKey, model }`
`guards`	`Object`	`{}`	Guardrail configuration
`guards.maxSteps`	`number`	`30`	Max reasoning + tool steps
`guards.maxTokens`	`number`	`100_000`	Hard token budget per task
`guards.maxToolCalls`	`number`	`100`	Max tool invocations
`guards.duplicateWindow`	`number`	`10`	Block duplicates within last N calls
`guards.stuckThreshold`	`number`	`3`	Same result N times = stuck
`guards.maxRetries`	`number`	`3`	Max retries per failed tool call
`guards.timeoutMs`	`number`	`30_000`	Per-step timeout
`tools`	`Array`	`[]`	Tool definitions with handlers
`onStep`	`Function`	`null`	Callback for each step
`onGuard`	`Function`	`null`	Callback when a guardrail triggers
`onToolCall`	`Function`	`null`	Callback for each tool invocation

Returns:

Method	Description
`.run(goal)`	Execute an agent task with full guardrails
`.dryRun(goal)`	Simulate execution, return plan without running
`.getStats()`	Stats from last run
`.on(event, handler)`	Subscribe to events
`.reset()`	Clear all state

Events

agent.on('step', ({ step, tokensUsed, toolCalls }) => { ... });
agent.on('tool_call', ({ name, args, cached }) => { ... });
agent.on('guard_triggered', ({ guard, detail }) => { ... });
agent.on('complete', ({ content, steps, tokensUsed, cost, degraded }) => { ... });
agent.on('error', ({ error, step }) => { ... });

Guard Helpers

import { createLoopDetector, createTokenBudget, createStepLimiter } from 'ai-agent-loop/guards';

// Use individually
const detector = createLoopDetector({ window: 10, threshold: 3 });
detector.check({ name: 'search', args: { query: 'test' } });
// → { isDuplicate: false, isStuck: false, count: 1 }

const budget = createTokenBudget({ limit: 50_000 });
budget.consume(1200);
budget.remaining(); // → 48_800
budget.isExhausted(); // → false

Budget Module

import { createBudgetTracker } from 'ai-agent-loop/budget';

const budget = createBudgetTracker({
  provider: 'xiaomi',
  model: 'mimo-v2.5',
  limit: 100_000,
  onExhausted: (stats) => console.log('Budget hit!', stats),
});

budget.log({ inputTokens: 500, outputTokens: 200 });
budget.getSummary();
// → { totalTokens: 700, limit: 100_000, used: '0.7%', estimatedCost: '$0.000175' }

⚠️ Pitfalls & Lessons Learned

1. Infinite Retry Loops Are the #1 Budget Killer

Without a retry limit, an agent will call a failing tool forever. A tool that returns an error 100% of the time will consume your entire budget in minutes.

// ❌ No guard — agent retries forever
const agent = createAgentLoop({ guards: {} });

// ✅ Guarded — stops after 3 retries per tool
const agent = createAgentLoop({
  guards: { maxRetries: 3, maxToolCalls: 50 },
});

2. Thinking Models Amplify Every Problem

MiMo v2.5-pro, DeepSeek Reasoner, and other thinking models consume 10x+ tokens on internal reasoning. An agent loop that's slightly stuck will burn through a budget 10x faster with a thinking model.

Always use non-thinking models for agent loops unless the task genuinely requires chain-of-thought reasoning.

// ❌ Thinking model — runaway token consumption
const agent = createAgentLoop({
  provider: { model: 'mimo-v2.5-pro', ... },
});

// ✅ Non-thinking — predictable token usage
const agent = createAgentLoop({
  provider: { model: 'mimo-v2.5', ... },
});

3. Duplicate Tool Calls Hide in Different Clothing

Agents often call the same tool with slightly different arguments that resolve to the same result. Example: { query: "weather NYC" } and { query: "weather New York City" }. A simple string comparison misses these.

The duplicate detector normalizes arguments before comparison, catching semantic duplicates.

4. Context Window Overflow Causes Amnesia

When an agent's context window fills up, it forgets the original goal and starts repeating earlier steps. This manifests as a loop that's invisible to simple step counting.

Solution: Set maxSteps well below the context window limit. If your model has 128K context, don't let the agent run 200 steps.

5. Graceful Degradation > Hard Crash

When a guardrail triggers, don't just kill the agent. Return whatever partial results it has collected, along with a clear reason for stopping. Users can decide whether to continue with a fresh budget.

const result = await agent.run('Complex research task');
if (result.degraded) {
  console.log(`Stopped early: ${result.stopReason}`);
  console.log(`Partial results: ${result.content}`);
}

📄 License

MIT — Hijrah Assalam

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
bin		bin
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔁 AI Agent Loop

⚠️ Real-World Pitfall

✨ Features

🚀 Quick Start

1. Install

2. Run a Guardrailed Agent

📦 Architecture

🖥️ CLI Reference

📚 API Reference

`createAgentLoop(config)`

Events

Guard Helpers

Budget Module

⚠️ Pitfalls & Lessons Learned

1. Infinite Retry Loops Are the #1 Budget Killer

2. Thinking Models Amplify Every Problem

3. Duplicate Tool Calls Hide in Different Clothing

4. Context Window Overflow Causes Amnesia

5. Graceful Degradation > Hard Crash

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🔁 AI Agent Loop

⚠️ Real-World Pitfall

✨ Features

🚀 Quick Start

1. Install

2. Run a Guardrailed Agent

📦 Architecture

🖥️ CLI Reference

📚 API Reference

createAgentLoop(config)

Events

Guard Helpers

Budget Module

⚠️ Pitfalls & Lessons Learned

1. Infinite Retry Loops Are the #1 Budget Killer

2. Thinking Models Amplify Every Problem

3. Duplicate Tool Calls Hide in Different Clothing

4. Context Window Overflow Causes Amnesia

5. Graceful Degradation > Hard Crash

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`createAgentLoop(config)`

Packages