MissionBench: Zero-Shot Mission-Level Evaluation for Aerial MLLM Agents

MissionBench is a UAV autonomous mission execution framework integrated with Cosys-AirSim and Vision–Language Models (VLMs) for tasks such as patrol, visual inspection, manipulation, and package delivery. The system supports Gemini, GPT, AWS Nova, and Anthropic models, configurable via YAML files.

Visualization

Acknowledgments

Special thanks to Suman Navaratnarajah, the main contributor to this work.

Milestone

2025/05/08: The official implementation of MissionBench is now open source.
2025/05/06: The official project page can be found HERE.

System Requirements

Hardware

CPU: Intel i7 (12th Generation or above)
GPU: NVIDIA RTX 3050 (4 GB VRAM or higher)
RAM: 16 GB or more
Storage: Minimum 250 GB free space

Operating System

Ubuntu: 24 or 22
Vulkan Instance Version: 1.3.275

Software

Python: 3.10.12

Project Structure

MissionBench/
├── configs/
│   ├── environments.yaml
│   └── models.yaml
├── dataset/
├── examples/
│   └── step_by_step_demo.py
├── airsim_api_server.py
├── .env
├── airsim_env/
└── missionbench_env/

Step 1: Create virtual environments (automatic)

From the project root directory, run:

./env_setup.sh

This script automatically:

Creates airsim_env
Creates missionbench_env
Installs all required packages in both environments
Runs basic validation checks

Step 2: Create `.env` File

Create a .env file in the project root: This stores all the api credentials required to run the missions

touch .env

Example see .env_sample

Step 3: Run Mission Execution

Update the different configurations in the config file and run the mission.

First run the unit_testset.yaml which runs one mission from each of NHEnv, ForestEnv and CityEnv before running the entire testset, configs/testset.yaml

source missionbench_env/bin/activate
python main_driver.py --config_file configs/unit_testset.yaml

this downloads the packaged airsim envs and required testset if not available so may take some time during the first run.

Outputs and Debugging

Mission outputs are saved in:

MissionBench/data/

This includes:

Step-by-step images
Bounding box annotations if available
Spatial reasoning results
Debug and execution logs

Sample Mission Configuration

Open Mission Config

Model Naming

configs/models.yaml

License

CC BY-NC-SA 4.0

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
configs		configs
examples		examples
experiment_automation		experiment_automation
graphs		graphs
predictors		predictors
prompter		prompter
resource		resource
simulator_interface		simulator_interface
tests		tests
utils		utils
.env_sample		.env_sample
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
airsim_api_server.py		airsim_api_server.py
env_setup.sh		env_setup.sh
main_driver.py		main_driver.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MissionBench: Zero-Shot Mission-Level Evaluation for Aerial MLLM Agents

Visualization

Acknowledgments

Milestone

System Requirements

Hardware

Operating System

Software

Project Structure

Step 1: Create virtual environments (automatic)

Step 2: Create `.env` File

Example see .env_sample

Step 3: Run Mission Execution

Outputs and Debugging

Sample Mission Configuration

Model Naming

License

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MissionBench: Zero-Shot Mission-Level Evaluation for Aerial MLLM Agents

Visualization

Acknowledgments

Milestone

System Requirements

Hardware

Operating System

Software

Project Structure

Step 1: Create virtual environments (automatic)

Step 2: Create .env File

Example see .env_sample

Step 3: Run Mission Execution

Outputs and Debugging

Sample Mission Configuration

Model Naming

License

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Step 2: Create `.env` File

Packages