Virtual QA Automation Engineer

Features

Systematically explores websites and identifies test cases
Generates comprehensive test plans for web applications
Creates Cypress JS test automation code for identified test cases
Generates visually appealing markdown test plans
Can be connected to telemetry systems such as Arize Phoenix for observability and monitoring
Has an evaluation workflow configured to assess agent performance

Requirements

Python 3.11+
AIQ Toolkit installed
OpenAI API key (set as OPENAI_API_KEY environment variable)

Installation

Clone the NVIDIA/AIQToolkit repository.

git clone https://github.com/NVIDIA/AIQToolkit.git

Navigate to the AIQToolkit/examples folder and clone this repository

cd AIQToolkit/examples

git clone https://github.com/rodcar/virtual_qa_automation_engineer.git

# Navigate to the project's folder
cd virtual_qa_automation_engineer/

Create a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows, use .venv\Scripts\activate

Install dependencies:

pip install -e .

From here you can choose to run the agent using the terminal or the UI.

Usage (CLI)

This tutorial assumes that the project folder is inside the "examples" folder from NVIDIA/AIQToolkit.

To run the virtual QA automation engineer, include the URL in your request:

# Set your OpenAI API key first
export OPENAI_API_KEY=your_openai_api_key

# Then run the QA automation engineer
aiq run --config_file=configs/workflow.yaml --input "Generate a test plan for https://rodcar.github.io/girl-factor/"

Using the UI

The AIQ Toolkit provides a user interface that makes it easier to interact with the virtual QA automation engineer.

Note: The UI is located in the external/aiqtoolkit-opensource-ui directory. Make sure to run the UI commands from this directory.

In a new terminal:

Install the required dependencies:

# In the AIQ Toolkit root directory
npm install

Start the UI:

# In the aiqtoolkit-opensource-ui directory
npm run dev

In a new terminal: Start the AIQ server:

# Set your OpenAI API key first
export OPENAI_API_KEY=your_openai_api_key

# In a separate terminal, from the AIQToolkit folder
aiq serve --config_file examples/virtual_qa_automation_engineer/configs/workflow.yaml

Open the UI in your browser at http://localhost:3000 (or the port shown in your terminal)
Configure the connection settings in the UI (If necessary):
- Set the HTTP API endpoint (typically http://localhost:8000)
Enter your test request with a URL to start generating test plans and automation code "Generate a test plan for https://rodcar.github.io/girl-factor/"

Architecture

The virtual QA automation engineer uses a React Agent workflow with three main tools:

web_navigator: Fetches and analyzes web page content
generate_test_automation_code: Creates Cypress JS test automation code
generate_test_plan_markdown: Generates visually rich markdown test plans

The agent systematically explores web pages, extracts links, navigates through the website, identifies test cases, and generates test automation code for each identified test case. It then creates a comprehensive test plan in markdown format as its final output.

Note: The current system prompt limits the number of test cases for which Cypress JS test automation code is generated to 10. This means that, regardless of how many test cases are identified during exploration, only the first 10 will have automation code generated in each run. This is by design to ensure efficiency and manageability of the generated output.

Cypress Testing

Output Folder

The agent generates Cypress test files in the output directory. These files are automatically configured in the Cypress test runner through the project's cypress.config.js file, which includes this path in its specPattern.

Cypress Configuration

The project uses a cypress.config.js file at the root to configure the test runner:

const { defineConfig } = require("cypress");

module.exports = defineConfig({
  e2e: {
    specPattern: [
      "cypress/e2e/**/*.cy.{js,jsx,ts,tsx}",
      "examples/virtual_qa_automation_engineer/output/**/*.cy.js"
    ],
    setupNodeEvents(on, config) {
      // implement node event listeners here
    },
  },
});

This configuration includes both standard Cypress test locations and the test files generated by the virtual QA engineer.

Running Cypress Tests

Once the agent has generated test files, you can run them using the Cypress test runner:

# Run tests in headless mode
npx cypress run

# Or open the Cypress Test Runner UI
npx cypress open

Eval

To run the evaluation for the virtual QA automation engineer, set your OpenAI API key and execute the following command:

export OPENAI_API_KEY=your_openai_api_key
aiq eval --config_file examples/virtual_qa_automation_engineer/configs/eval_config.yml

The evaluation uses the eval_dataset.json file located in virtual_qa_automation_engineer/data/. This file contains a set of evaluation prompts (such as test plan requests for specific URLs) and their expected answers. The evaluation process compares the agent's generated outputs against these expected answers to assess performance and accuracy.

Telemetry with Arize Phoenix

You can enable telemetry and observability for your virtual QA automation engineer using Arize Phoenix. Phoenix provides a powerful UI for monitoring, debugging, and analyzing your LLM and agent workflows.

Enabling Telemetry

To enable telemetry, you must uncomment the telemetry configuration in your configs/workflow.yaml file. Locate the following section and remove the # characters to activate telemetry:

general:
  use_uvloop: true
  telemetry:
    logging:
      console:
        _type: console
        level: WARN
    tracing:
      phoenix:
        _type: phoenix
        endpoint: http://localhost:6006/v1/traces
        project: virtual_qa_automation_engineer

This will allow your agent to send telemetry data to the Phoenix server.

Installation

If you haven't already, install Arize Phoenix:

pip install arize-phoenix

Starting the Phoenix Server

To start the Phoenix server, run:

phoenix serve

By default, the Phoenix UI will be available at http://localhost:6006.

Usage

Once Phoenix is running and telemetry is enabled in your config, Phoenix will automatically detect and visualize traces and telemetry from supported LLM and agent frameworks. If your workflow or the AIQ Toolkit is instrumented for OpenInference or Phoenix, telemetry data will appear in the Phoenix dashboard. For more details on integration and advanced usage, see the Phoenix documentation.

License

This example is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
configs		configs
data		data
output		output
src/navigator		src/navigator
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Virtual QA Automation Engineer

Table of Contents

Features

Requirements

Installation

Usage (CLI)

Using the UI

Architecture

Cypress Testing

Output Folder

Cypress Configuration

Running Cypress Tests

Eval

Telemetry with Arize Phoenix

Enabling Telemetry

Installation

Starting the Phoenix Server

Usage

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Virtual QA Automation Engineer

Table of Contents

Features

Requirements

Installation

Usage (CLI)

Using the UI

Architecture

Cypress Testing

Output Folder

Cypress Configuration

Running Cypress Tests

Eval

Telemetry with Arize Phoenix

Enabling Telemetry

Installation

Starting the Phoenix Server

Usage

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages