Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
614 changes: 387 additions & 227 deletions docs/deployment/cloud-deployment.mdx

Large diffs are not rendered by default.

1 change: 0 additions & 1 deletion docs/deployment/local-development.mdx
Original file line number Diff line number Diff line change
@@ -1,7 +1,6 @@
---
title: 'Local Development'
description: 'Running and testing agents locally'
icon: 'local'
---

## Overview
Expand Down
3 changes: 3 additions & 0 deletions docs/docs.json
Original file line number Diff line number Diff line change
Expand Up @@ -75,6 +75,9 @@
"how-to/frameworks/crewai",
"how-to/frameworks/letta",
"how-to/frameworks/agno",
"how-to/frameworks/ag2",
"how-to/frameworks/llamaindex",
"how-to/frameworks/parlant",
"how-to/frameworks/custom"
]
},
Expand Down
61 changes: 40 additions & 21 deletions docs/explanation/feature-overview.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ icon: 'rocket'

import NeedHelp from '/snippets/need-help.mdx';

> See what RunAgent supports — from sandbox microVM deployment to streaming SDKs, security, observability, and agent workflows.
> See what RunAgent supports — from serverless deployment to streaming SDKs, security, observability, and agent workflows.

## Start Small, Scale Big

Expand All @@ -18,7 +18,7 @@ import NeedHelp from '/snippets/need-help.mdx';
Use LangGraph, CrewAI, or your own custom agent.
</Card>
<Card title="SDKs & API" href="/reference/sdk/overview" arrow>
Full SDKs in Python, JS, Go, Rust; REST/WS APIs with streaming.
Full SDKs in Python, JS, Go, Rust; REST/WebSocket APIs with streaming.
</Card>
</Cards>

Expand All @@ -28,39 +28,58 @@ import NeedHelp from '/snippets/need-help.mdx';

| Feature | What It Enables |
|---|---|
| **Sandboxed MicroVMs** | Each agent runs in isolation (Firecracker-based), for security and containment |
| **Autoscaling & Concurrency** | RunAgent scales agents automatically under load, handles cold starts |
| **Streaming Token Responses** | Clients receive tokens as they are generated — ideal for chat UIs |
| **Cross-Language Access** | Call the same agent from JS, Go, Python, Rust without extra infra |
| **State & Memory** *(coming soon)* | Agents can optionally hold memory across calls for richer behavior |
| **Security & Auth** *(in progress)* | Agent-scoped API keys, role scopes, sandbox limits |
| **Telemetry & Observability** *(future)* | Logs, metrics, traces to monitor latency, errors, usage |
| **Agent-to-Agent (A2A)** *(vision)* | Agents can call and coordinate with other agents |
| **Versioning, Rollbacks, Rate Limits** *(roadmap)* | Control agent versions, limit usage, revert if needed |
| **Secure Isolation** | Each agent runs in its own isolated environment for security and resource containment |
| **Automatic Scaling** | RunAgent scales agents automatically under load with fast cold starts |
| **Real-Time Streaming** | Clients receive tokens as they are generated — ideal for chat UIs and interactive applications |
| **Cross-Language Access** | Call the same agent from JavaScript, Go, Python, Rust without extra infrastructure |
| **State & Memory** *(coming soon)* | Agents can optionally hold memory across calls for richer, contextual behavior |
| **Security & Auth** *(in progress)* | Agent-scoped API keys, role-based access, sandbox limits |
| **Telemetry & Observability** *(future)* | Logs, metrics, traces to monitor latency, errors, and usage patterns |
| **Agent-to-Agent (A2A)** *(vision)* | Agents can call and coordinate with other agents for complex workflows |
| **Versioning, Rollbacks, Rate Limits** *(roadmap)* | Control agent versions, limit usage, and revert deployments if needed |

---

## How It Works (High Level)

1. **Write an agent** (one Python function with entrypoint configuration)
2. **Deploy to microVMs** — RunAgent handles sandboxing + autoscaling
3. **Call via SDK / REST / WS** — streaming, sync calls, multi-language
4. **Monitor & scale** — logs, metrics, retries, rate limits
5. **Iterate & evolve** — new versions, memory, agent communication
1. **Write an agent** - Create a Python function with entrypoint configuration
2. **Deploy serverlessly** - RunAgent handles isolation, sandboxing, and automatic scaling
3. **Call via SDK / REST / WebSocket** - Use streaming, synchronous calls, across multiple languages
4. **Monitor & scale** - Track logs, metrics, retries, and rate limits
5. **Iterate & evolve** - Deploy new versions, add memory, enable agent communication

---

## Production-Ready Features

<CardGroup cols={2}>
<Card title="Instant Deployment" icon="bolt">
Deploy agents to production in seconds with `runagent deploy .` - no infrastructure configuration needed.
</Card>
<Card title="Multi-Framework Support" icon="layer-group">
Works seamlessly with LangGraph, CrewAI, AutoGen, AG2, Letta, and custom frameworks.
</Card>
<Card title="Native SDKs" icon="code">
Python, JavaScript/TypeScript, Go, and Rust SDKs with idiomatic APIs. C++ and C# coming soon.
</Card>
<Card title="Built-in Streaming" icon="satellite-dish">
WebSocket-based streaming works natively across all SDKs with minimal code changes.
</Card>
</CardGroup>

---

## Next Steps

- 👉 Start with **Deploy your first agent**
- 🧭 Explore **Frameworks & How-tos**
- 📖 Dive into **SDK / API Reference**
- 🔒 Or read about **Architecture & Concepts**
- Start with **[Deploy your first agent](/tutorials/deploy-your-first-agent)**
- Explore **[Frameworks & How-tos](/explanation/frameworks-overview)**
- Dive into **[SDK / API Reference](/reference/sdk/overview)**
- Or read about **[Architecture & Concepts](/explanation/core-concepts)**

---

<Note>
This page is a **landing overview**. For detailed instructions, go to the linked pages above.
</Note>

<NeedHelp context="Features" />
<NeedHelp context="Features" />
Loading