Add RPC ingestion load test driven by synthetic apply-load ledger bundles by cjonas9 · Pull Request #741 · stellar/stellar-rpc

cjonas9 · 2026-05-15T22:40:07Z

What

This is a PR implementing a repeatable CI ingestion load test on a full database of 7 days of ledgers. The approximate design is here:

This GHA workflow for this test, currently, is triggered on pushes to this branch (apply-load), but will later be modified to trigger on any release or on PR comments stating "run load test".

The workflow benchmarks RPC ingestion end-to-end on an ephemeral c5.2xlarge: it launches the box, pulls a mainnet-scale golden DB (~307GB, 1-week retention window), a BUILD_TESTS stellar-core, and three apply-load ledger bundles from S3 (sha-verified). After the box downloads and decompresses this data, its gp3 volume is throttled to 125 MiB/s, ingests the bundles, and posts a per-profile results table to the run summary / PR.

Main Pieces:

integrationtest/ingest_loadtest_test.go::TestIngestSyntheticLedgers: byte-concatenates N bundles into one continuous stream (the backend rebases ledger seqs per ledger, so per-bundle seq resets are harmless), ingests onto the golden DB with retention trimming live, verifies exact classic/soroban op counts via parallel getTransactions walkers, and reports per-profile wall-clock/ledgers-sec/ms-ledger/latency quantiles.
loadtest/testdata/apply-load-v27-*-cfg: config files specifying three O3 target tx profiles, 1,000 ledgers each: sac (1,000 soroban TPL), oz (900), soroswap (250). All generate these + 1,000 classic payments/ledger to create ledger bundles (for local usage or S3) offline by stellar-core apply-load.
.github/workflows/load-test.yml: push-triggered orchestrator. OIDC-assumes into AWS, launches an ephemeral c5.2xlarge (Ubuntu 22.04, 500GB gp3) with the runner script as user-data (shipped verbatim, TARGET_SHA/RUN_ID passed via a two-line env preamble), waits for SSM registration, delegates polling to the script, writes the results table to the step summary (and PR comment when one exists), fails the job on a fail verdict or timeout, and always terminates the instance.
run-load-test.sh: both halves of the run in one self-contained script, coordinated by a /tmp marker protocol.
- instance (user-data on the box): installs the toolchain, streams the golden DB + BUILD_TESTS core + all bundles from S3, builds the repo at the target SHA, throttles root volume MiB/s, runs the test, and writes results.md plus an ok/fail verdict.
- orchestrate (on the GHA runner): polls the box over SSM, drives the gp3 downshift handshake (500 -> 125 MiB/s after downloads complete, so fetches are fast but the benchmark runs on throttled I/O), and relays verdict + results as step outputs.

Why

CI testing of RPC ingestion performance; benchmarking. This also serves as an automated regression testing framework, though future work should expand this to report some metric that allows one to compare a run's results to historical results.

Known limitations

This is purely intended as a test of RPC's ingestion pipeline and seeks to see how it handles load in isolation (i.e. without captive core running). Future work should also seek to automatically refresh the S3 DB + ledger bundles on some pre-determined cadence.

… real DB test

github-actions · 2026-05-15T22:42:08Z

⏳ Load test launching on i-063ed1e3a29f001e3 (commit 241bdf833edbfc18d3c312a7109168d171324aa1).
Workflow run: https://github.com/stellar/stellar-rpc/actions/runs/25944865928
Posting results when the run finishes (~15 min).

tamirms · 2026-06-17T15:06:24Z

@@ -0,0 +1,344 @@
+#!/usr/bin/env bash


we require go to be installed to run rpc and the ingestion load test. so I wonder if most of the logic in this bash script could live in a go file. I think that it would be easier to understand and maintain a go script than a large bash script.

Yes, definitely. There's definitely some required shell, but cramming it all into one shell script is super excessive and messy (though I did like that it kept everything for the instance held in one place as user data). I'll see about refactoring some of this out

github-actions · 2026-06-17T23:55:37Z

⏳ Load test launching on i-000ee1a1274fa17a4 (commit 9bef1157b7073a5d472b8bdc297cdc4e9c5169d2).
Workflow run: https://github.com/stellar/stellar-rpc/actions/runs/27727364842
Posting results when the run finishes.

socket-security · 2026-06-17T23:55:45Z

Review the following changes in direct dependencies. Learn more about Socket for GitHub.

Diff	Package	Supply Chain Security	Vulnerability	Quality	Maintenance	License
	golang/github.com/aws/aws-sdk-go-v2@v1.39.5 ⏵ v1.42.0	⁺¹
	golang/github.com/stellar/go-stellar-sdk@v0.6.0 ⏵ v0.5.1-0.20260618200753-4daf27b6f1bf
	golang/github.com/aws/aws-sdk-go-v2/service/ssm@v1.69.3

View full report

github-actions · 2026-06-18T00:18:13Z

❌ Ingest load test failed (run 27727364842 on 9bef1157b7073a5d472b8bdc297cdc4e9c5169d2)

make build-libs failed: exit status 2

github-actions · 2026-06-18T00:25:48Z

⏳ Load test launching on i-0e82b023245865ca1 (commit 8b39ed5024f66ac96a7798af5e031445c9d466cf).
Workflow run: https://github.com/stellar/stellar-rpc/actions/runs/27728512552
Posting results when the run finishes.

github-actions · 2026-06-18T01:33:53Z

❌ Ingest load test failed (run 27728512552 on 8b39ed5024f66ac96a7798af5e031445c9d466cf)

volume throttle could not be confirmed

…olume experiment

github-actions · 2026-06-18T02:24:20Z

⏳ Load test launching on i-0b40cd7da9dfe4d32 (commit 714b1fab586ddf7d68ea309062c8a28434520d09).
Workflow run: https://github.com/stellar/stellar-rpc/actions/runs/27732589868
Posting results when the run finishes.

github-actions · 2026-06-18T04:46:42Z

📈 Ingest load test — `714b1fa`

Profile	Ledgers	Wall-clock	Ledgers/sec	ms/ledger	p50 / p95 / p99 ms
apply-load-v27-oz	1000	1250.049s	0.80	1251.30	1174.999 / 1749.997 / 2025
apply-load-v27-sac	1000	1156.650s	0.86	1156.65	1174.997 / 1250 / 1299.999
apply-load-v27-soroswap	1000	827.400s	1.21	827.40	825.001 / 900.001 / 974.999

Metric	Value
Ledgers replayed	3000
Initial DB ledger count	120960
Overall throughput	0.93 ledgers/sec
Overall ingest wall-clock	3234.099s
Per-ledger p50 / p95 / p99	1100 / 1449.999 / 1900 ms
Golden DB fetch+decompress	1180s
stellar-core	`v27.0.0`
Workflow run	#27732589868

github-actions · 2026-06-18T15:13:50Z

⏳ Load test launching on i-0ae39eb14de75bdbe (commit 5058ad3976a196cdcd99984762f2caa516a21508).
Workflow run: https://github.com/stellar/stellar-rpc/actions/runs/27769496808
Posting results when the run finishes.

github-actions · 2026-06-18T17:37:47Z

📈 Ingest load test — `5058ad3`

Profile	Ledgers	Wall-clock	Ledgers/sec	ms/ledger	p50 / p95 / p99 ms
apply-load-v27-oz	1000	1240.774s	0.81	1242.02	1150.001 / 1725 / 2050
apply-load-v27-sac	1000	1139.350s	0.88	1139.35	1149.998 / 1225.001 / 1275.001
apply-load-v27-soroswap	1000	825.875s	1.21	825.87	825.001 / 900.001 / 950

Metric	Value
Ledgers replayed	3000
Initial DB ledger count	120960
Overall throughput	0.94 ledgers/sec
Overall ingest wall-clock	3205.999s
Per-ledger p50 / p95 / p99	1099.998 / 1449.999 / 1900 ms
Golden DB fetch+decompress	1451s
stellar-core	`v27.0.0`
Workflow run	#27769496808

…d 125 MiB/s

github-actions · 2026-06-18T18:59:20Z

⏳ Load test launching on i-05d0c32462a3c7da3 (commit 3b33626afda7a66fce20d76ea7dc083c2492ad65).
Workflow run: https://github.com/stellar/stellar-rpc/actions/runs/27782520036
Posting results when the run finishes.

github-actions · 2026-06-18T21:19:10Z

⏳ Load test launching on i-01e1ab4d654e6efdf (commit fdb1926ffedb8ae4aa533faffeec6338ef973af7).
Workflow run: https://github.com/stellar/stellar-rpc/actions/runs/27789957778
Posting results when the run finishes.

github-actions · 2026-06-18T21:39:00Z

📈 Ingest load test — `3b33626`

Profile	Ledgers	Wall-clock	Ledgers/sec	ms/ledger	p50 / p95 / p99 ms
apply-load-v27-oz	1000	1234.525s	0.81	1235.76	1150 / 1700.001 / 1925
apply-load-v27-sac	1000	1138.699s	0.88	1138.70	1149.999 / 1225 / 1275
apply-load-v27-soroswap	1000	829.349s	1.21	829.35	825.002 / 900.001 / 974.999

Metric	Value
Ledgers replayed	3000
Initial DB ledger count	120960
Overall throughput	0.94 ledgers/sec
Overall ingest wall-clock	3202.573s
Per-ledger p50 / p95 / p99	1099.998 / 1450.001 / 1824.999 ms
Golden DB fetch+decompress	2440s
stellar-core	`v27.0.0`
Workflow run	#27782520036

github-actions · 2026-06-18T23:44:41Z

📈 Ingest load test — `fdb1926`

Profile	Ledgers	Wall-clock	Ledgers/sec	ms/ledger	p50 / p95 / p99 ms
apply-load-v27-oz	1000	1234.300s	0.81	1235.54	1150 / 1674.999 / 1925
apply-load-v27-sac	1000	1137.950s	0.88	1137.95	1149.999 / 1225 / 1275
apply-load-v27-soroswap	1000	829.175s	1.21	829.17	849.998 / 900.001 / 975

Metric	Value
Ledgers replayed	3000
Initial DB ledger count	120960
Overall throughput	0.94 ledgers/sec
Overall ingest wall-clock	3201.424s
Per-ledger p50 / p95 / p99	1099.998 / 1450 / 1824.999 ms
Golden DB fetch+decompress	2446s
stellar-core	`v27.0.0`
Workflow run	#27789957778

cjonas9 added 26 commits May 8, 2026 21:32

pull initial work from branch load-testing

3392f09

add ledger generation test adapted for RPC

ffe27bc

add apply load config

65da3b5

add generated ledger output to infrastructure/testdata/

34f086d

add basic ingestion of synthetic ledgers phase

80c982d

disable debug logs for load test for timeout reasons

94c69b7

add functions for snapshotting + restoring test DB

f4a16f9

improve ad restructure db restoration helpers/API

1d53b96

finish DB restoration logic flow and wiring

baf2255

skip migrations/fee-stats in load test mode

1647464

ingest test: refactor, minor semantic fixes

2f14765

test.go: add retention window to config, fix fake history archive for…

d7c90a9

… real DB test

minor db restore/trim helper fixes

0390757

rename restore backed-up ledgers function for accuracy

e0a86e7

refactor, add env vars, change DB helpers to take sequences

f151a35

remove db restoration functionality

bffb101

add performance metrics json emission functionality

e04d51d

migrate to polling getHealth, change ingest test limits to 1000 ledgers

bd8c784

remove ledger fixtures

c7bc001

add workflow and script

786423d

fix yaml referencing wrong path for script

1606829

fix yml parsing indentation bug

7d41b1a

use head-object for metadata rather than tags

b701108

refine workflow + instance script

b1cec1d

add apply load cfg

b9ef27e

testing: on-push runs

73df1e7

Copilot AI review requested due to automatic review settings May 15, 2026 22:40

Copilot started reviewing on behalf of cjonas9 May 15, 2026 22:40 View session

minor yml syntax fixes

241bdf8

tamirms reviewed Jun 17, 2026

View reviewed changes

cjonas9 mentioned this pull request Jun 17, 2026

Add GHA coordinator for performance evaluation task scatter/gather #791

Draft

cjonas9 linked an issue Jun 17, 2026 that may be closed by this pull request

Release eval: Add repeatable Core apply-load integration test #711

Open

cjonas9 added 4 commits June 17, 2026 17:44

update go version

65cb6ff

update default ledger bundle/config to existent ones for test

4aca1ef

decompose ec2 script into go programs

3d61ceb

reduce comment verbosity, minor clean up

9bef115

github-advanced-security AI found potential problems Jun 18, 2026

View reviewed changes

Comment thread cmd/stellar-rpc/internal/integrationtest/infrastructure/load-test/runner/orchestrate.go Fixed

install jq on load-test box for build-libs; surface build-libs errors

8b39ed5

throttle load-test benchmark via cgroup io.max instead of EBS ModifyV…

714b1fa

…olume experiment

tamirms reviewed Jun 18, 2026

View reviewed changes

Comment thread cmd/stellar-rpc/internal/integrationtest/infrastructure/load-test/refresh/refresh-tool Outdated

cjonas9 added 2 commits June 18, 2026 11:12

drop accidentally-committed refresh tooling and orphaned apply-load.cfg

d9987f6

fix linter errors in load-test runner

5058ad3

cjonas9 added 2 commits June 18, 2026 14:58

make load-test ingest frequency and ledger count configurable via env

8cf9f6b

EXPERIMENT: run load-test benchmark un-throttled at volume-provisione…

3b33626

…d 125 MiB/s

cjonas9 added 2 commits June 18, 2026 16:54

use SDK's maxLedgersPerFile ceiling and multiple-bundle functionality

7e60661

simplify verification walk

fdb1926

Conversation

cjonas9 commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Known limitations

Uh oh!

github-actions Bot commented May 15, 2026

Uh oh!

tamirms Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

cjonas9 Jun 17, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 17, 2026

Uh oh!

socket-security Bot commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

📈 Ingest load test — 714b1fa

Uh oh!

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

📈 Ingest load test — 5058ad3

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

📈 Ingest load test — 3b33626

Uh oh!

github-actions Bot commented Jun 18, 2026

📈 Ingest load test — fdb1926

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

cjonas9 commented May 15, 2026 •

edited

Loading

socket-security Bot commented Jun 17, 2026 •

edited

Loading

📈 Ingest load test — `714b1fa`

📈 Ingest load test — `5058ad3`

📈 Ingest load test — `3b33626`

📈 Ingest load test — `fdb1926`