Skip to content

GusCayresMindsight/agent-tars-experiment

Repository files navigation

agent-tars-experiment

What this is

A BDD sandbox for Agent TARS — ByteDance's open-source multimodal agent framework with browser control, vision, MCP integration, and a Web UI. Feature descriptions are first class: Gherkin .feature files define what the agent should be able to do; step definitions implement the assertions; the Makefile and CLI are the execution layer. The CLI (@agent-tars/cli) is model-agnostic — this repo wires it to Anthropic, OpenAI, and Volcengine (Doubao). Upstream: https://github.com/bytedance/UI-TARS-desktop

🆕 Notion MCP Integration: This project now includes a fully configured Notion MCP server, allowing AI assistants to interact with your Notion workspace. See Notion Integration Guide for setup instructions.

Prerequisites

  • Node.js ≥ 22 and npm
  • git
  • At least one provider API key (Anthropic, OpenAI, or Volcengine)

Quickstart

cp .env.example .env          # 1. copy the template
$EDITOR .env                  # 2. fill in at least one API key
make install                  # 3. clone upstream ref + npm install (applies patches)
make test-smoke               # 4. fast connectivity check

Structure

features/                     # 1st — behaviour descriptions (Gherkin)
  smoke.feature               #   fast connectivity gate
  research.feature            #   multi-turn web research
  step_definitions/           # 2nd — test implementation
    agent.steps.js
  support/                    # Cucumber World + hooks
    world.js
    hooks.js
patches/                      # patch-package fix for @agent-tars/core 0.3.0
Makefile                      # 3rd — run targets

Note on the patch: @agent-tars/core@0.3.0 has a bug in its Anthropic streaming handler (convertMessages) that causes a SyntaxError after a few tool-call iterations. patches/@agent-tars+core+0.3.0.patch fixes it and is applied automatically by npm install via patch-package. A fix has been submitted upstream at bytedance/UI-TARS-desktop#1879.

Make targets

Target Description
help Print available targets (default)
install Clone upstream repo if missing; npm install
test Run all BDD feature scenarios
test-smoke Run @smoke scenarios only
test-research Run @research scenarios only
run-anthropic Interactive Web UI with Anthropic (claude-sonnet-4-5)
run-openai Interactive Web UI with OpenAI (gpt-4o)
run-volcengine Interactive Web UI with Volcengine (doubao-1-5-thinking-vision-pro-250428)
update Pull latest changes in the reference clone
clean Remove node_modules, caches, reports
nuke clean + delete the agent-tars-cli/ checkout

Notion MCP Integration

This project includes a pre-configured Notion MCP server for AI-powered Notion interactions:

# Quick test
npm run notion:test

# Start MCP server
npm run notion:server

# View setup guide
npm run notion:setup

Documentation:

Configuration: Your Notion credentials are in .env (gitignored). The MCP config file mcp-config.json can be copied to Claude Desktop, Cursor, or other MCP clients.

Links

About

Minimal sandbox for running ByteDance Agent TARS CLI across Anthropic, OpenAI, and Volcengine providers

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors