AI Portfolio Experiment (V2): Automated Agentic LLM Study

A high-fidelity project for benchmarking the financial decision-making of 18 autonomous AI agents (6 models × 3 personas) in a deterministic, rules-based trading simulation.

🚀 The 2026 Architectural Evolution: "Institutional Intelligence"

In contrast to the reactive "V1" (Deep Search) system, the V2 Framework pivots to a deterministic, pre-computed intelligence engine. We replaced thin 7KB data snapshots with a 150KB Market Intelligence Hub (MIH) to eliminate LLM arithmetic errors and provide high-alpha context.

1. Market Intelligence Hub (MIH)

The MIH acts as the agents' "Bloomberg Terminal," delivering a machine-readable payload that includes:

Macro Narrative (300 Words): Synthesized global context via Brave Search (Freshness: PD).
Macro Indicators (FRED): Real-time 10Y/2Y Yields, Fed Funds, VIX, and Credit Spreads.
Dynamic Sector Discovery: The MIH automatically identifies the Top 3 sectors by RS-Ratio (momentum strength) and injects their current 'Champion' holdings into the catalyst scanner. This ensures the portfolio reacts to macro shifts (e.g., Energy during geopolitical conflict) without human bias.
Catalyst Watchlist (100 Tickers): A $3-$50 scanning universe focused on high-volatility catalysts like FDA/PDUFA dates, Earnings Drift, and Short Squeezes.
Technical/Risk Normalization: Pre-computed ATR (14d), RSI (14d), and Relative Volume (RVOL) Z-scores, providing agents with "alpha-ready" signals.

2. TOON Visualization (Token-Oriented Object Notation)

To maximize context window efficiency, MIH data is serialized into Markdown Tables (TOON) rather than JSON. This reduces token overhead by 60%, allowing larger data payloads without exceeding provider limits.

3. The 18-Agent Roster

benchmarked across:

Providers: OpenAI (o1/4o), Anthropic (Claude 3.5 Sonnet), Google (Gemini 2.5 Flash), Mistral, DeepSeek, and Qwen.
Risk Profiles: Conservative, Balanced, and Aggressive (Persona-driven logic).
Automation: Fully autonomous daily execution via GitHub Actions at 21:00 UTC (Market Close), with automated state persistence via git-auto-commit.

🛠 Project Structure

v2/snapshot_engine.py: The MIH Core. Fetches FRED, RRG, and 100-ticker watchlist data.
v2/agent_runner.py: The daily orchestrator. Injects playbooks/MIH and runs the LLM tool-calling loop.
v2/portfolio_simulator.py: "Code is Law" Firewall. Validates trades, enforces stop-losses, and calculates T+1 execution.
v2/api_adapters.py: Robust retry logic with exponential backoff (handles 429/503 errors across providers).
v2/mcp_servers/: Custom tools for Technicals, Fundamentals, and Web Search.

⚖️ "Code is Law" - Simulator Constraints

LLMs operate within a strict sandbox:

Universe: Long-only US Equities $3-$50. No Penny Stocks, No Shorts, No Crypto.
Execution: T+1 opening prices ONLY (pre-computed slippage).
Sizing: Max 25% ($2,500 on a $10k base) per position.
Mandatory Stops: Every trade MUST specify a stop-loss (1.5x-2.0x ATR recommended).
Tool Cap: Max 15 tool calls per turn to ensure deep analysis before propose_trades.

📅 Usage

1. Reset & Setup (Day 0)

Generate the 6-month playbooks for all 18 agents:

python -m v2.main --day-0

2. Daily Simulation Run

Generate the MIH snapshot and execute all agents for a specific date:

python -m v2.main --run-date 2026-04-07

3. Dashboard Integration

The framework automatically exports data to the static web dashboard in v1/logs/.

📜 License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 662 Commits
.agent/workflows		.agent/workflows
.github/workflows		.github/workflows
v1		v1
v2		v2
.env.example		.env.example
.gitignore		.gitignore
EXPERIMENT_HANDOVER.md		EXPERIMENT_HANDOVER.md
README.md		README.md
index.html		index.html
og-image.png		og-image.png
test_gemini_31.py		test_gemini_31.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Portfolio Experiment (V2): Automated Agentic LLM Study

🚀 The 2026 Architectural Evolution: "Institutional Intelligence"

1. Market Intelligence Hub (MIH)

2. TOON Visualization (Token-Oriented Object Notation)

3. The 18-Agent Roster

🛠 Project Structure

⚖️ "Code is Law" - Simulator Constraints

📅 Usage

1. Reset & Setup (Day 0)

2. Daily Simulation Run

3. Dashboard Integration

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Portfolio Experiment (V2): Automated Agentic LLM Study

🚀 The 2026 Architectural Evolution: "Institutional Intelligence"

1. Market Intelligence Hub (MIH)

2. TOON Visualization (Token-Oriented Object Notation)

3. The 18-Agent Roster

🛠 Project Structure

⚖️ "Code is Law" - Simulator Constraints

📅 Usage

1. Reset & Setup (Day 0)

2. Daily Simulation Run

3. Dashboard Integration

📜 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages