The Vercel for Agent Training - Powered by Agent Lightning
Train production-ready AI agents with reinforcement learning. 95% tool reliability. 98% time savings. 30-50% cost reduction.
pip install agentgym
agentgym train --scenario customer_support
# Training: 100%|ββββββββββ| 10000/10000 [23:45<00:00]
# Tool reliability: 94.7% βAI agents (LangChain, AutoGen, CrewAI) struggle in production:
- Tool reliability: 60-70% (untrained agents often call wrong tools or use wrong parameters)
- No systematic improvement (manual prompt engineering doesn't scale)
- Production blocked (can't deploy agents that fail 30-40% of the time)
AgentGym uses reinforcement learning to train your agents:
from agentgym import Trainer
# Train your LangChain/AutoGen/CrewAI agent
trainer = Trainer()
result = trainer.train("customer_support")
print(f"Tool reliability: {result.metrics.tool_reliability:.1%}")
# Tool reliability: 94.7% β
# Deploy to production
trained_agent = result.to_langchain() # or .to_autogen(), .to_crewai()Based on community analysis (200K+ tokens from LangChain, AutoGen, CrewAI):
| Metric | Before Training | After Training | Improvement |
|---|---|---|---|
| Tool Reliability | 60-70% | 95% | +35% |
| Development Time | 4 hours | 3 minutes | 98% faster |
| LLM Costs | Baseline | -30 to -50% | Better tool selection |
| Production Ready | β | β | One-click deployment |
# Install AgentGym
pip install agentgym
# Verify installation
agentgym --version# List available scenarios
agentgym scenarios list
# Train a customer support agent
agentgym train \
--scenario customer_support \
--framework langchain \
--episodes 10000
# Training runs on your GPU (local, RunPod, Lambda, or AgentGym Cloud)from agentgym import Trainer
# Configure training
trainer = Trainer()
# Train agent
result = trainer.train(
scenario="customer_support",
framework="langchain", # or "autogen", "crewai"
episodes=10000,
gpu="auto" # auto-detect local GPU or use BYOG
)
# Check results
print(f"Tool reliability: {result.metrics.tool_reliability:.1%}")
print(f"Cost reduction: {result.metrics.cost_reduction:.1%}")
print(f"Time savings: {result.metrics.time_savings:.1%}")
# Deploy to your framework
langchain_agent = result.to_langchain()
autogen_agent = result.to_autogen()
crewai_agent = result.to_crewai()- Installation Guide - Set up your development environment
- Quick Start Tutorial - Train your first agent in 5 minutes
- Framework Integrations - LangChain, AutoGen, CrewAI guides
- Technical Approach - How AgentGym uses Agent Lightning
- System Design - Architecture and components
- Scenarios - Pre-built training scenarios
- Executive Summary - High-level overview
- Action Plan - 12-month roadmap
- Competitive Moat - Why open core works
- Contributing Guide - How to contribute
- Development Workflow - Git workflow and best practices
- AI Assistant Instructions - Context for AI coding assistants
Works with your existing agent framework:
- β LangChain - Full support for LangChain agents
- β AutoGen - Microsoft Agent Framework support
- β CrewAI - CrewAI agent support
- π Haystack - Coming soon
- π Semantic Kernel - Coming soon
Train agents for common tasks out-of-the-box:
- Customer Support - 95% tool reliability, handle customer queries
- Code Review - Automated code review with high accuracy
- QA Testing - Comprehensive test case generation
- Data Analysis - Analyze datasets and generate insights
- Email Automation - Intelligent email handling
Or create your own scenarios with custom reward functions.
Train on your choice of infrastructure:
- Local GPU - Auto-detected CUDA GPUs
- RunPod - $0.34/hr for RTX 4090 (cheapest)
- Lambda Labs - Fast provisioning
- AgentGym Cloud - Fully managed (coming Q2 2025)
Rich terminal experience with live progress:
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β AgentGym Training Dashboard β
βββββββββββββββββββββββββββββββββββββββββββββββββββ€
β Scenario: Customer Support β
β Framework: LangChain β
β GPU: RunPod RTX 4090 ($0.34/hr) β
β β
β Episode: 2,847 / 10,000 β
β Progress: βββββββββββββββββββββ 28% β
β β
β Metrics: β
β Tool Reliability: 92.3% β (target: 95%) β
β Avg Response Time: 1.8s β β
β Cost Efficiency: -38% tokens β β
β β
β Estimated completion: 23 minutes β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
AgentGym is a platform built on top of Agent Lightning (Microsoft Research's RL library):
βββββββββββββββββββββββββββββββββββββββββββ
β AgentGym (Platform) β
β - Pre-built scenarios β
β - Framework integrations β
β - Beautiful CLI β
β - GPU orchestration β
β - One-click deployment β
βββββββββββββββββββββββββββββββββββββββββββ
β uses
βββββββββββββββββββββββββββββββββββββββββββ
β Agent Lightning (Library) β
β - RL algorithms (PPO, DQN, A3C) β
β - GPU acceleration β
β - Distributed training β
βββββββββββββββββββββββββββββββββββββββββββ
Analogy:
- Agent Lightning : AgentGym :: Docker : Heroku
- Agent Lightning : AgentGym :: TensorFlow : Weights & Biases
We use Agent Lightning as our RL engine, freeing us to focus on developer experience, scenarios, and production deployment.
See TECHNICAL_APPROACH.md for details.
AgentGym/
βββ src/agentgym/ # Source code
β βββ core/ # Core training logic
β βββ scenarios/ # Pre-built scenarios
β βββ integrations/ # LangChain, AutoGen, CrewAI
β βββ cli/ # Command-line interface
β βββ ui/ # Terminal dashboard
β βββ utils/ # GPU orchestration, etc.
β
βββ docs/ # Documentation
β βββ strategy/ # Strategic planning
β βββ architecture/ # Technical design
β βββ development/ # Dev guides
β βββ research/ # Community analysis
β βββ validation/ # User interviews
β
βββ tests/ # Test suite
βββ examples/ # Example code
βββ .github/workflows/ # CI/CD
β
βββ pyproject.toml # Project config
βββ README.md # This file
βββ CONTRIBUTING.md # How to contribute
βββ LICENSE # MIT License
We welcome contributions! See CONTRIBUTING.md for guidelines.
Ways to contribute:
- π Report bugs - Open an issue
- π‘ Suggest features - Start a discussion
- π Improve docs - Documentation PRs welcome
- π¨ Add scenarios - Contribute pre-built scenarios
- π§ Fix bugs - Look for
good first issue - β¨ Add features - Check out
help wanted
Quick start for contributors:
# Clone your fork
git clone https://github.com/YOUR_USERNAME/agentgym.git
cd agentgym
# Set up development environment
python -m venv venv
source venv/bin/activate
pip install -e ".[dev]"
pre-commit install
# Run tests
pytest
# Make changes, commit, push, create PR!See docs/development/WORKFLOW.md for detailed workflow.
- Market validation (LangChain, AutoGen, CrewAI communities)
- Strategic planning (Option D: Open Core)
- Architecture design
- Core training engine (wrapper around Agent Lightning)
- Pre-built scenarios (customer support, code review, QA)
- Framework integrations (LangChain, AutoGen, CrewAI)
- BYOG support (local GPU, RunPod, Lambda)
- Beautiful CLI with live progress
- Documentation and examples
- Target: OSS launch Month 2
- Launch on Twitter, Reddit, LangChain Slack
- Community building and feedback
- Validation interviews (15-20 users)
- GO/NO-GO for Cloud platform
- Target: 1K-5K GitHub stars, 500-1K users
- Managed GPU orchestration
- Team collaboration features
- One-click deployment
- Advanced observability
- Billing and subscriptions
- Target: 50-100 paying customers, $5K-10K MRR
- Enterprise features (SOC 2, SSO, RBAC)
- Multi-region deployment
- Training marketplace
- White-label options
- Target: $50K-100K MRR, Series A ready
See OPTION-D-ACTION-PLAN.md for detailed timeline.
- GitHub Discussions: Ask questions, share ideas
- Discord: Join our community (coming soon)
- Twitter: @agentgym - Updates and announcements
- Email: hello@agentgym.com
Current Phase: Pre-Development β OSS MVP Version: 0.1.0 (alpha) Status: Setting up project structure Next Milestone: OSS launch (Month 2)
Track progress in PROJECT-STATUS.md.
MIT License - see LICENSE file for details.
- Agent Lightning - Microsoft Research's RL library (our foundation)
- LangChain Community - Inspiration and validation
- AutoGen Community - Cross-framework insights
- CrewAI Community - Tool reliability validation
Ready to train better agents?
pip install agentgym
agentgym train --scenario customer_supportHave questions? Read the docs or join discussions.
Happy training! π―