🧑‍🚀 Testronaut

Testronaut is an autonomous testing framework powered by LLMs and Playwright.
It lets you define mission-based tests in plain English, then runs them through a real browser to validate UI workflows — all while generating human-readable reports.

🌌 Join the Mission Control Community

Got questions, ideas, or missions to share?
Join the Discord to connect with other Testronauts, get support, and help shape the framework’s future.

🚀 Features

Write tests in plain English — no brittle selectors
Runs real browser sessions via Playwright
Works with multiple LLM providers (OpenAI, Google Gemini, and more coming)
Modular tool and DOM-reasoning system
Dynamic rate-limit and token-tracking logic
Generates JSON + HTML reports automatically

📖 Documentation

Looking for deeper guides, API references, and examples?
Check the official docs:

Includes:

Quickstart and setup
Writing advanced missions
Configuring providers and models
CLI options
Mission Control integration
Troubleshooting and FAQs

📦 Installation

Global install (recommended — use testronaut directly):

npm install -g testronaut

Then initialize your project:

testronaut --init

Run the sample mission:

testronaut welcome.mission.js

One-off / no install (use npx to run without installing):

npx testronaut --init
npx testronaut welcome.mission.js

📁 Project Structure

missions/
├── login.mission.js
├── logout.mission.js
└── dashboard.mission.js

Each mission exports a string or function and calls runMissions.

✍️ Example Mission

import { runMissions } from 'testronaut';

export const loginMission = `
Visit ${process.env.URL}.
Fill the username field with ${process.env.USERNAME}.
Fill the password field with ${process.env.PASSWORD}.
Click the Login button.
Wait for the dashboard to appear.
Take a screenshot.
Report SUCCESS if the dashboard is loaded, otherwise FAILURE.
`;

export async function executeMission() {
  await runMissions({ mission: loginMission }, "Login Mission");
}

Create a .env file with your credentials and LLM API key (depending on your chosen provider):

# For OpenAI
OPENAI_API_KEY=sk-...

# Or for Gemini
GEMINI_API_KEY=AIza...

URL=https://example.com/login
USERNAME=example@example.com
PASSWORD=********

🧠 LLM Provider Support

Testronaut is provider-agnostic.
Choose your preferred LLM at init or via environment variables.

# During init
testronaut --init

# Or override anytime
TESTRONAUT_PROVIDER=gemini TESTRONAUT_MODEL=gemini-2.5-pro testronaut

Current supported providers:

Provider	Example Models
OpenAI	gpt-4o, gpt-4.1, o3, gpt-5, gpt-5.1, etc.
Google Gemini	gemini-2.5-pro, gemini-2.5-flash, gemini-2.5-flash-8b

More providers coming soon (Anthropic, Mistral, etc.).

🏃 Running Missions

Run all missions:

testronaut

Run a specific mission:

testronaut login.mission.js

Chain missions together:

await runMissions({
  preMission: [loginMission],
  mission: fillContactFormMission,
  postMission: logoutMission,
}, "Contact Form Flow");

🧰 Developer Mode (Staging API)

Use the staging API base URL:

testronaut --dev

If the staging deployment is protected by Vercel, pass the bypass secret:

testronaut --dev --vercel-bypass=YOUR_SECRET login

You can also set the bypass secret via environment variables:

export VERCEL_AUTOMATION_BYPASS_SECRET=YOUR_SECRET
# or
export TESTRONAUT_VERCEL_BYPASS=YOUR_SECRET

📋 Reports

Testronaut generates both JSON and HTML reports automatically under:

missions/mission_reports/

Each includes:

Steps executed
Token usage
Screenshots
Pass/Fail summaries

🧪 Under the Hood

Playwright for browser automation
LLMs for reasoning, DOM parsing, and tool use
Token throttling + adaptive cooldowns
Extensible architecture for custom tools and workflows
DOM trimming controls to cap list sizes (env TESTRONAUT_DOM_LIST_LIMIT or config dom.listItemLimit; use all cautiously—it can spike token use)
Resource guard to ensure full list/table downloads (config resourceGuard or env TESTRONAUT_RESOURCE_*)

🧭 Mission Control

Mission Control lets you:

View all reports in one dashboard
Track mission history and success rates
Compare results across environments
Access screenshots and step details anytime

📄 License

MIT

☕ Support the Mission

🤖 Built with ❤️ by Shane Fast

If Testronaut saves you time, consider fueling the mission:

Name		Name	Last commit message	Last commit date
Latest commit History 176 Commits
.devcontainer		.devcontainer
.github		.github
bin		bin
core		core
llm		llm
policy		policy
runner		runner
tests		tests
tools		tools
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
vitest.config.js		vitest.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧑‍🚀 Testronaut

🌌 Join the Mission Control Community

🚀 Features

📖 Documentation

📦 Installation

📁 Project Structure

✍️ Example Mission

🧠 LLM Provider Support

🏃 Running Missions

🧰 Developer Mode (Staging API)

📋 Reports

🧪 Under the Hood

🧭 Mission Control

📄 License

☕ Support the Mission

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

🧑‍🚀 Testronaut

🌌 Join the Mission Control Community

🚀 Features

📖 Documentation

📦 Installation

📁 Project Structure

✍️ Example Mission

🧠 LLM Provider Support

🏃 Running Missions

🧰 Developer Mode (Staging API)

📋 Reports

🧪 Under the Hood

🧭 Mission Control

📄 License

☕ Support the Mission

About

Topics

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages