Tutorial: Building an HTTP Workload

This tutorial walks through adding a new workload type to virtwork from scratch using TDD. By the end, you'll have a working http workload that deploys nginx and runs continuous HTTP benchmarks inside a VM.

What We're Building

An "http" workload that:

Installs nginx (web server) and httpd-tools (provides ab, the Apache Bench HTTP benchmarking tool)
Configures nginx to serve a default page on localhost
Runs ab in a loop, generating continuous HTTP request/response metrics
Runs on a single VM with no extra disks and no Kubernetes Service

This is a good first workload because it exercises the core Workload interface without the added complexity of data volumes (like database and disk) or multi-VM orchestration (like network).

Before You Start

Go 1.26+ installed
Ginkgo CLI installed: go install github.com/onsi/ginkgo/v2/ginkgo@latest
Read How Virtwork Works to understand the workload interface
See docs/development.md for environment setup

Step 1: Plan the Cloud-Init

Before writing any Go code, design what will happen inside the VM on first boot.

Packages

nginx — Available in Fedora repos. Serves HTTP on localhost.
httpd-tools — Available in Fedora repos. Provides ab (Apache Bench), a standard HTTP benchmarking tool.

We're choosing ab over tools like wrk or hey because httpd-tools is in Fedora's default repos — no custom builds or third-party repos needed. This follows the existing pattern: stress-ng, fio, iperf3, and postgresql-server are all standard repo packages.

Systemd unit

The workload service depends on nginx being started first, then runs ab in an infinite loop:

[Unit]
Description=Virtwork HTTP benchmark workload
After=nginx.service
Requires=nginx.service

[Service]
Type=simple
ExecStart=/bin/bash -c 'while true; do ab -n 10000 -c 10 http://localhost/; sleep 5; done'
Restart=always
RestartSec=10

[Install]
WantedBy=multi-user.target

This runs 10,000 requests with 10 concurrent connections per batch, sleeps 5 seconds, and repeats. The Requires=nginx.service ensures nginx is running before the benchmark starts.

Cloud-init plan

#cloud-config
packages:
  - nginx
  - httpd-tools
write_files:
  - path: /etc/systemd/system/virtwork-http.service
    content: |
      [Unit]
      Description=Virtwork HTTP benchmark workload
      ...
    permissions: '0644'
runcmd:
  - [systemctl, daemon-reload]
  - [systemctl, enable, --now, nginx]
  - [systemctl, enable, --now, virtwork-http.service]

Note that we enable nginx before the benchmark service, matching the After=nginx.service dependency.

Step 2: Write the Tests First

Create the test file internal/workloads/http_test.go. We follow the same Ginkgo BDD structure used by the existing workload tests (see cpu_test.go for reference):

// Copyright 2026 Red Hat
// SPDX-License-Identifier: Apache-2.0

package workloads_test

import (
	. "github.com/onsi/ginkgo/v2"
	. "github.com/onsi/gomega"

	"github.com/opdev/virtwork/internal/config"
	"github.com/opdev/virtwork/internal/workloads"
)

var _ = Describe("HTTPWorkload", func() {
	var w *workloads.HTTPWorkload

	BeforeEach(func() {
		w = workloads.NewHTTPWorkload(config.WorkloadConfig{
			Enabled:  true,
			VMCount:  1,
			CPUCores: 2,
			Memory:   "2Gi",
		}, "virtwork", "", nil)
	})

	It("should return 'http' for Name", func() {
		Expect(w.Name()).To(Equal("http"))
	})

	It("should include nginx and httpd-tools in packages", func() {
		result, err := w.CloudInitUserdata()
		Expect(err).NotTo(HaveOccurred())

		parsed := parseYAML(result)
		pkgs, ok := parsed["packages"].([]interface{})
		Expect(ok).To(BeTrue())
		Expect(pkgs).To(ContainElement("nginx"))
		Expect(pkgs).To(ContainElement("httpd-tools"))
	})

	It("should include systemd service in cloud-init", func() {
		result, err := w.CloudInitUserdata()
		Expect(err).NotTo(HaveOccurred())

		parsed := parseYAML(result)
		Expect(parsed).To(HaveKey("write_files"))
		files := parsed["write_files"].([]interface{})
		Expect(files).To(HaveLen(1))

		file := files[0].(map[string]interface{})
		Expect(file["path"]).To(Equal("/etc/systemd/system/virtwork-http.service"))

		content := file["content"].(string)
		Expect(content).To(ContainSubstring("ab"))
		Expect(content).To(ContainSubstring("http://localhost/"))
		Expect(content).To(ContainSubstring("Requires=nginx.service"))
	})

	It("should enable nginx before the benchmark service", func() {
		result, err := w.CloudInitUserdata()
		Expect(err).NotTo(HaveOccurred())

		parsed := parseYAML(result)
		cmds := parsed["runcmd"].([]interface{})
		Expect(len(cmds)).To(BeNumerically(">=", 3))
	})

	It("should produce valid YAML", func() {
		result, err := w.CloudInitUserdata()
		Expect(err).NotTo(HaveOccurred())
		Expect(result).To(HavePrefix("#cloud-config\n"))

		parsed := parseYAML(result)
		Expect(parsed).NotTo(BeNil())
	})

	It("should have no extra disks", func() {
		Expect(w.ExtraDisks()).To(BeNil())
	})

	It("should have no extra volumes", func() {
		Expect(w.ExtraVolumes()).To(BeNil())
	})

	It("should have no data volume templates", func() {
		dvts, err := w.DataVolumeTemplates()
		Expect(err).NotTo(HaveOccurred())
		Expect(dvts).To(BeNil())
	})

	It("should not require a service", func() {
		Expect(w.RequiresService()).To(BeFalse())
		Expect(w.ServiceSpec()).To(BeNil())
	})

	It("should return 1 for VMCount", func() {
		Expect(w.VMCount()).To(Equal(1))
	})

	It("should reflect config in VMResources", func() {
		res := w.VMResources()
		Expect(res.CPUCores).To(Equal(2))
		Expect(res.Memory).To(Equal("2Gi"))
	})

	It("should include SSH user when configured", func() {
		w = workloads.NewHTTPWorkload(config.WorkloadConfig{
			Enabled:  true,
			VMCount:  1,
			CPUCores: 2,
			Memory:   "2Gi",
		}, "testuser", "", []string{"ssh-ed25519 AAAA..."})

		result, err := w.CloudInitUserdata()
		Expect(err).NotTo(HaveOccurred())

		parsed := parseYAML(result)
		Expect(parsed).To(HaveKey("users"))
	})
})

The parseYAML helper is already available in helpers_test.go — it strips the #cloud-config prefix and unmarshals the YAML into a map.

Run the tests

go test ./internal/workloads/...

This will fail with compilation errors because HTTPWorkload and NewHTTPWorkload don't exist yet. This is expected — we've confirmed the tests compile against the right interface and we know what we're building.

Step 3: Create the Workload

Create the file internal/workloads/http.go. Follow the pattern established by cpu.go:

// Copyright 2026 Red Hat
// SPDX-License-Identifier: Apache-2.0

package workloads

import (
	"github.com/opdev/virtwork/internal/config"
)

const httpBenchSystemdUnit = `[Unit]
Description=Virtwork HTTP benchmark workload
After=nginx.service
Requires=nginx.service

[Service]
Type=simple
ExecStart=/bin/bash -c 'while true; do ab -n 10000 -c 10 http://localhost/; sleep 5; done'
Restart=always
RestartSec=10

[Install]
WantedBy=multi-user.target
`

// HTTPWorkload generates cloud-init userdata for an HTTP benchmark workload
// using nginx and ab (Apache Bench).
type HTTPWorkload struct {
	BaseWorkload
}

// NewHTTPWorkload creates an HTTPWorkload with the given configuration and SSH credentials.
// HTTPParamSchema declares tunable params for the HTTP workload.
// Even simple workloads should declare an explicit (possibly empty) schema.
var HTTPParamSchema = ParamSchema{}

func NewHTTPWorkload(cfg config.WorkloadConfig, sshUser, sshPassword string, sshKeys []string) *HTTPWorkload {
	return &HTTPWorkload{
		BaseWorkload: BaseWorkload{
			Config:            cfg,
			ParamSchema:       HTTPParamSchema,
			SSHUser:           sshUser,
			SSHPassword:       sshPassword,
			SSHAuthorizedKeys: sshKeys,
		},
	}
}

// Name returns "http".
func (w *HTTPWorkload) Name() string {
	return "http"
}

// CloudInitUserdata returns cloud-init YAML that installs nginx and httpd-tools,
// then runs a continuous HTTP benchmark via systemd.
func (w *HTTPWorkload) CloudInitUserdata() (string, error) {
	return w.BuildCloudConfig(CloudConfigOpts{
		Packages: []string{"nginx", "httpd-tools"},
		WriteFiles: []WriteFile{
			{
				Path:        "/etc/systemd/system/virtwork-http.service",
				Content:     httpBenchSystemdUnit,
				Permissions: "0644",
			},
		},
		RunCmd: [][]string{
			{"systemctl", "daemon-reload"},
			{"systemctl", "enable", "--now", "nginx"},
			{"systemctl", "enable", "--now", "virtwork-http.service"},
		},
	})
}

Let's walk through the key decisions:

Embedding BaseWorkload — We inherit default implementations for VMResources(), ExtraDisks(), ExtraVolumes(), DataVolumeTemplates(), RequiresService(), ServiceSpec(), and VMCount(). Since an HTTP workload doesn't need extra disks, services, or multiple VMs, the defaults are all correct.

Constructor signature — NewHTTPWorkload(cfg, sshUser, sshPassword, sshKeys) matches the same pattern as every other workload constructor. This is required by the registry's WorkloadFactory type.

Using w.BuildCloudConfig() — This is the BaseWorkload method, not cloudinit.BuildCloudConfig(). The difference matters: BuildCloudConfig() injects the SSH credentials before calling the cloudinit package. Always use the method, not the package function.

Three runcmds — We enable nginx separately from the benchmark service, and daemon-reload comes first to pick up the new unit file.

Run the tests again

go test ./internal/workloads/...

All HTTPWorkload tests should pass, and existing workload tests remain green.

Step 4: Register the Workload

The workload exists but the CLI doesn't know about it yet. Add its factory to DefaultRegistry() in internal/workloads/registry.go:

func DefaultRegistry() Registry {
	return Registry{
		// ... existing entries (chaos-disk, chaos-network, chaos-process, cpu, database, disk, memory, network, tps) ...
		"http": {
			Factory: func(cfg config.WorkloadConfig, opts *RegistryOpts) Workload {
				return NewHTTPWorkload(cfg, opts.SSHUser, opts.SSHPassword, opts.SSHAuthorizedKeys)
			},
			ParamSchema: HTTPParamSchema,
		},
		// ... existing entries ...
	}
}

Each entry is a RegistryEntry struct pairing a WorkloadFactory with a ParamSchema. The schema enables deploy-time validation of user-supplied --params values.

That's the only change needed — AllWorkloadNames() is a function that derives its list from DefaultRegistry().List(), so adding the entry automatically includes "http" in the workload name list.

Update affected tests

Adding a workload to the registry changes two things that existing tests verify:

Registry tests — The count of registered workloads increases (currently 9 → 10), and List() returns a different slice.
Orchestration tests — If the default --workloads flag includes all workload names, the total VM count changes.

Search for these assertions and update them:

grep -rn "AllWorkloadNames\|Len(9)\|HaveLen(9)" internal/ cmd/

Update any hard-coded counts to reflect the new workload.

Run all tests

go test ./...

All tests should pass after the count updates.

Step 5: Verify with Dry Run

go run ./cmd/virtwork run --dry-run --workloads http

Expected output (abbreviated):

--- Dry Run ---
Total VMs to create: 1

# VM: virtwork-http-0 (workload: http)
apiVersion: kubevirt.io/v1
kind: VirtualMachine
metadata:
  labels:
    app.kubernetes.io/component: http
    app.kubernetes.io/managed-by: virtwork
    app.kubernetes.io/name: virtwork-http
    virtwork/run-id: <uuid>
  name: virtwork-http-0
  namespace: virtwork
spec:
  running: true
  template:
    spec:
      domain:
        devices:
          disks:
          - disk:
              bus: virtio
            name: containerdisk
          - disk:
              bus: virtio
            name: cloudinitdisk
          ...
        resources:
          requests:
            cpu: "2"
            memory: 2Gi
      volumes:
      - containerDisk:
          image: quay.io/containerdisks/fedora:42
        name: containerdisk
      - cloudInitNoCloud:
          userData: |
            #cloud-config
            packages:
              - nginx
              - httpd-tools
            ...
        name: cloudinitdisk
---

Trace each section back to the interface methods:

Spec Section	Came From
`metadata.labels`	Orchestrator + `Name()`
`metadata.name`	Orchestrator + `Name()`
`resources.requests`	`VMResources()` (via `BaseWorkload`)
`volumes[0]` (containerdisk)	Orchestrator (always present)
`volumes[1]` (cloudinitdisk)	`CloudInitUserdata()`
No `dataVolumeTemplates`	`DataVolumeTemplates()` returned `nil`
No extra disks	`ExtraDisks()` returned `nil`

Step 6: Deploy and Test

This step requires an OpenShift cluster with OpenShift Virtualization.

Deploy

go run ./cmd/virtwork run --workloads http \
  --ssh-user virtwork \
  --ssh-key-file ~/.ssh/id_ed25519.pub

SSH in and verify

virtctl ssh --ssh-key ~/.ssh/id_ed25519 virtwork@virtwork-http-0 -n virtwork

Inside the VM:

# Verify nginx is serving
curl http://localhost/

# Check the benchmark service
systemctl status virtwork-http.service

# Watch the benchmark output
journalctl -u virtwork-http.service -f

You should see ab output showing requests per second, transfer rates, and latency percentiles.

Clean up

go run ./cmd/virtwork cleanup

Going Further

Adding a Data Disk (Storage-Backed Workloads)

If your workload needs persistent storage (for example, a workload that writes benchmark results to disk), override three methods. Look at internal/workloads/disk.go, database.go, or chaos_disk.go for the complete pattern. The key things to get right:

// DataVolumeTemplates returns a CDI DataVolumeTemplateSpec. The orchestrator
// suffixes the template name with the VM name (NamespaceDataVolumes in
// internal/orchestrator/types.go) to avoid collisions when --vm-count > 1,
// so use the un-suffixed base name here.
func (w *MyWorkload) DataVolumeTemplates() ([]kubevirtv1.DataVolumeTemplateSpec, error) {
    return []kubevirtv1.DataVolumeTemplateSpec{
        vm.BuildDataVolumeTemplate("my-data", w.DataDiskSize),
    }, nil
}

// ExtraDisks adds the disk definition to the VM spec. ALWAYS set the Serial
// field — the in-VM script discovers the device via
// /dev/disk/by-id/virtio-<serial>, which is stable across reboots and
// migrations (unlike /dev/vdX, which is not).
func (w *MyWorkload) ExtraDisks() []kubevirtv1.Disk {
    return []kubevirtv1.Disk{
        {
            Name:   "datadisk",
            Serial: "virtwork-mydata",
            DiskDevice: kubevirtv1.DiskDevice{
                Disk: &kubevirtv1.DiskTarget{Bus: "virtio"},
            },
        },
    }
}

// ExtraVolumes links the disk to the DataVolume. The Name must match
// ExtraDisks; the DataVolume.Name must match the template name above.
func (w *MyWorkload) ExtraVolumes() []kubevirtv1.Volume {
    return []kubevirtv1.Volume{
        {
            Name: "datadisk",
            VolumeSource: kubevirtv1.VolumeSource{
                DataVolume: &kubevirtv1.DataVolumeSource{Name: "my-data"},
            },
        },
    }
}

In your cloud-init, write the shared diskSetupScript(serial, mountPoint) helper as the first script and run it from runcmd before the workload service starts. It waits for the /dev/disk/by-id/virtio-<serial> symlink, formats with XFS if empty, mounts at mountPoint, and writes /etc/fstab so the mount survives reboots.

return w.BuildCloudConfig(CloudConfigOpts{
    WriteFiles: []WriteFile{
        {
            Path:        "/usr/local/bin/virtwork-disk-setup.sh",
            Content:     diskSetupScript("virtwork-mydata", "/mnt/data"),
            Permissions: "0755",
        },
        // ... workload service unit and script ...
    },
    RunCmd: [][]string{
        {"/usr/local/bin/virtwork-disk-setup.sh"},
        {"systemctl", "daemon-reload"},
        {"systemctl", "enable", "--now", "virtwork-my-workload.service"},
    },
})

Making It Multi-VM

If your workload needs more than one role of VM (a server and one or more clients, for example), implement the MultiVMWorkload interface. The two canonical references are internal/workloads/network.go (simplest — one Service port, iperf3) and internal/workloads/tps.go (multi-port Service). All workloads support configurable Params via the getter-with-default pattern — see development.md.

Add a Namespace field to your struct — the client needs it to build the server's in-cluster DNS name.
Implement RoleDistribution() []RoleSpec — return a slice of RoleSpec{Role: "server", VMCount: 1} entries declaring how many VMs each role needs.
Implement UserdataForRole(role, namespace) (string, error) — return different cloud-init YAML per role. The orchestrator dispatches per role; the client constructs <service>.<namespace>.svc.cluster.local and never polls for pod IPs.
Override VMCount() to return the sum of all RoleSpec.VMCount values from RoleDistribution().
Override RequiresService() to return true.
Implement ServiceSpec() to create a ClusterIP Service. Its selector should match virtwork/role: server (and ideally app.kubernetes.io/component: <your-workload>) — the orchestrator applies the virtwork/role label to each VM automatically.

The orchestrator detects MultiVMWorkload via type assertion, iterates RoleDistribution(), and calls UserdataForRole() for each role/instance instead of CloudInitUserdata().

Workload Complexity Spectrum

flowchart LR
    A["<b>Simple</b><br/>CPU, Memory, Chaos-process<br/><i>Name + CloudInit only</i>"]
    B["<b>With Storage</b><br/>Disk, Database, Chaos-disk<br/><i>+ DataVolumeTemplates<br/>+ ExtraDisks (with Serial)<br/>+ ExtraVolumes<br/>+ diskSetupScript</i>"]
    C["<b>Multi-VM</b><br/>Network, TPS<br/><i>+ MultiVMWorkload<br/>(RoleDistribution, UserdataForRole)<br/>+ Service + VMCount</i>"]
    A --> B --> C

Start simple. Add complexity only when the workload needs it.

Checklist

Before submitting a new workload, verify:

If multi-VM:

Implements RoleDistribution() []RoleSpec
Implements UserdataForRole(role, namespace) (string, error)
VMCount() returns the sum of all RoleSpec.VMCount values from RoleDistribution()
ServiceSpec().Spec.Selector includes virtwork/role: <server-role> and app.kubernetes.io/component: <name>
Client userdata builds the server DNS as <service>.<namespace>.svc.cluster.local

If storage-backed:

DataVolumeTemplates() returns templates with stable base names (orchestrator suffixes with VM name)
ExtraDisks() sets the Serial field on each Disk
Cloud-init runs diskSetupScript(serial, mountPoint) from runcmd before any service that uses the mount

See docs/development.md for the reference version of the "Adding a New Workload" checklist.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tutorial: Building an HTTP Workload

What We're Building

Before You Start

Step 1: Plan the Cloud-Init

Packages

Systemd unit

Cloud-init plan

Step 2: Write the Tests First

Run the tests

Step 3: Create the Workload

Run the tests again

Step 4: Register the Workload

Update affected tests

Run all tests

Step 5: Verify with Dry Run

Step 6: Deploy and Test

Deploy

SSH in and verify

Clean up

Going Further

Adding a Data Disk (Storage-Backed Workloads)

Making It Multi-VM

Workload Complexity Spectrum

Checklist

Uh oh!

FilesExpand file tree

03-adding-a-workload.md

Latest commit

History

03-adding-a-workload.md

File metadata and controls

Tutorial: Building an HTTP Workload

What We're Building

Before You Start

Step 1: Plan the Cloud-Init

Packages

Systemd unit

Cloud-init plan

Step 2: Write the Tests First

Run the tests

Step 3: Create the Workload

Run the tests again

Step 4: Register the Workload

Update affected tests

Run all tests

Step 5: Verify with Dry Run

Step 6: Deploy and Test

Deploy

SSH in and verify

Clean up

Going Further

Adding a Data Disk (Storage-Backed Workloads)

Making It Multi-VM

Workload Complexity Spectrum

Checklist