feat: add ChipIngress batch emitter support by thomaska · Pull Request #21327 · smartcontractkit/chainlink

thomaska · 2026-02-27T14:43:16Z

Ticket: https://smartcontract-it.atlassian.net/browse/INFOPLAT-3436

Summary

Add ChipIngressBatchEmitterEnabled telemetry config flag (default false)
to toggle batch mode per-node without a code change
Implement PublishBatch on the chip-testsink gRPC server so CRE system tests
work when batch mode is enabled
Bump chainlink-common to include batch-emitter feature-flag support

Detail

Config flag – new ChipIngressBatchEmitterEnabled boolean in [Telemetry].
Wired through the config interface, TOML types, beholder globals, docs, and
test fixtures.

chip-testsink – the test-helper server only had single-event Publish,
inheriting UNIMPLEMENTED for PublishBatch. Now delegates each batch event to
the configured PublishFunc and forwards the full batch upstream in one RPC.

Why

9 CRE system tests depend on chip-testsink. Without this change they get gRPC
UNIMPLEMENTED errors once batch mode is the default.

Requires

smartcontractkit/chainlink-common#1862

github-actions · 2026-02-27T14:43:32Z

👋 thomaska, thanks for creating this pull request!

To help reviewers, please consider creating future PRs as drafts first. This allows you to self-review and make any final changes before notifying the team.

Once you're ready, you can mark it as "Ready for review" to request feedback. Thanks!

github-actions · 2026-02-27T14:44:21Z

✅ No conflicts with other open PRs targeting develop

Copilot

Pull request overview

Adds PublishBatch support to the chip-testsink gRPC server so CRE/system tests don’t fail with UNIMPLEMENTED when nodes emit batched ChIP ingress events.

Changes:

Implement PublishBatch on the chip-testsink ChipIngressServer.
Delegate batch handling to the existing Publish flow (including configured PublishFunc and optional upstream forwarding).

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-27T14:46:29Z

system-tests/tests/test-helpers/chip-testsink/server.go

+	}
+
+	for _, event := range batch.Events {
+		if _, err := s.Publish(ctx, event); err != nil {


Calling s.Publish() inside the batch loop triggers the per-event async upstream forwarding goroutine in Publish(). For large batches this can create a burst of goroutines and N upstream RPCs. Consider handling upstream forwarding in PublishBatch with a single PublishBatch call (or at least a bounded worker/pool), and calling the configured PublishFunc directly for local handling to avoid unbounded goroutine/RPC fan-out per batch.

Suggested change

if _, err := s.Publish(ctx, event); err != nil {

// Forward upstream synchronously to avoid spawning a goroutine per event.

if s.cfg.UpstreamEndpoint != "" {

forwardCtx, cancelFn := context.WithTimeout(context.Background(), 10*time.Second)

_, err := s.upstream.Publish(forwardCtx, event)

cancelFn()

if err != nil {

log.Printf("failed to forward to upstream: %v", err)

}

}

if _, err := s.cfg.PublishFunc(ctx, event); err != nil {

Copilot · 2026-02-27T14:46:30Z

system-tests/tests/test-helpers/chip-testsink/server.go

+// It delegates each event in the batch to the configured PublishFunc,
+// mirroring how the real ChIP Ingress processes batches atomically.


The doc comment claims this mirrors how the real ChIP ingress processes batches "atomically", but this implementation is not atomic: it publishes events one-by-one and can return an error after earlier events have already been accepted/forwarded. Please either adjust the comment to reflect best-effort sequential processing, or change the implementation to provide the atomicity guarantees being documented.

Suggested change

// It delegates each event in the batch to the configured PublishFunc,

// mirroring how the real ChIP Ingress processes batches atomically.

// It delegates each event in the batch to the configured PublishFunc

// sequentially, returning an error on the first failure. Earlier events

// in the batch may already have been published or forwarded when an error

// is returned, so processing is best-effort rather than atomic.

trunk-io · 2026-02-27T16:13:25Z

_{View Full Report ↗︎ ⋅ Docs}

github-actions · 2026-03-02T11:23:23Z

I see you updated files related to core. Please run make gocs in the root directory to add a changeset as well as in the text include at least one of the following tags:

#added For any new functionality added.
#breaking_change For any functionality that requires manual action for the node to boot.
#bugfix For bug fixes.
#changed For any change to the existing functionality.
#db_update For any feature that introduces updates to database schema.
#deprecation_notice For any upcoming deprecation functionality.
#internal For changesets that need to be excluded from the final changelog.
#nops For any feature that is NOP facing and needs to be in the official Release Notes for the release.
#removed For any functionality/config that is removed.
#updated For any functionality that is updated.
#wip For any change that is not ready yet and external communication about it should be held off till it is feature complete.

# Conflicts: # go.mod # go.sum

# Conflicts: # core/scripts/go.mod # core/scripts/go.sum # deployment/go.mod # deployment/go.sum # go.mod # go.sum # integration-tests/go.mod # integration-tests/go.sum # integration-tests/load/go.sum # system-tests/lib/go.mod # system-tests/lib/go.sum # system-tests/tests/go.sum

pkcll · 2026-03-17T16:01:10Z

Based on my thorough review of both PRs, here is a concrete implementation plan for registering ChipIngressBatchEmitter as a managed service in NewApplication.

Implementation Plan

Problem

The ChipIngressBatchEmitter is created and started inside beholder.NewGRPCClient() (called from initGlobals in shell.go), but it's not registered in the chainlink application's service list. This means:

No health check visibility (/health)
No ordered shutdown participation
No service lifecycle management

Step 1: Expose the batch emitter from `beholder.Client` (chainlink-common PR #1862)

Currently in pkg/beholder/client.go, the batchEmitterService is a local variable inside NewGRPCClient:

var batchEmitterService *ChipIngressBatchEmitter

Changes needed:

A. Add a field to the Client struct:

type Client struct {
    // ... existing fields ...
    BatchEmitter *ChipIngressBatchEmitter // nil when batch mode is disabled
}

B. Store the emitter when constructing the client (line ~261):

Change the return statement to include the batch emitter:

return &Client{cfg, logger, tracer, meter, emitter, chipIngressClient, loggerProvider, tracerProvider, meterProvider, messageLoggerProvider, signer, onClose, batchEmitterService}, nil

C. Add a public getter:

// GetChipIngressBatchEmitter returns the batch emitter service, or nil if batch mode is disabled.
func (c *Client) GetChipIngressBatchEmitter() *ChipIngressBatchEmitter {
    if c == nil {
        return nil
    }
    return c.BatchEmitter
}

Step 2: Register the batch emitter in `NewApplication` (chainlink PR #21327)

In core/services/chainlink/application.go, after telemetryManager is appended to srvcs (around line 447–448), add:

telemetryManager := telemetry.NewManager(cfg.TelemetryIngress(), csaKeystore, globalLogger)
srvcs = append(srvcs, telemetryManager)

// Register ChipIngressBatchEmitter for health checks and ordered shutdown.
// The emitter is already started by beholder.NewGRPCClient during initGlobals;
// appending it here gives us health visibility and ensures Close() runs on shutdown.
if beholderClient := beholder.GetClient(); beholderClient != nil {
    if batchEmitter := beholderClient.GetChipIngressBatchEmitter(); batchEmitter != nil {
        srvcs = append(srvcs, batchEmitter)
    }
}

Step 3: No changes to `initGlobals` in `shell.go`

The current PR #21327 changes to shell.go are already correct — they pass ChipIngressBatchEmitterEnabled and ChipIngressLogger through to beholder.Config. Since beholder creates/starts the emitter internally and we just retrieve a reference afterward, initGlobals doesn't need any additional changes.

Why this works

Concern	How it's addressed
Initialization order	`initGlobals` → `beholder.NewGRPCClient` runs in `beforeNode()` before `NewApplication`, so `beholder.GetClient()` is already populated
Double-start	`services.Engine`'s `Start()` is idempotent — the service framework calling `Start()` again is a no-op since it's already running
Nil safety	When `ChipIngressBatchEmitterEnabled = false` (default), `batchEmitterService` remains `nil`, and both nil checks protect against it
Shutdown order	Services in `srvcs` are stopped in reverse order. The emitter is closed before beholder's own `Client.Close()` (which closes the gRPC connection), matching the drain-before-disconnect requirement already implemented in PR #1862's reordered `Client.Close()`
Health checks	`ChipIngressBatchEmitter` implements `services.Service` via `services.Engine`, exposing `Name()`, `Ready()`, `HealthReport()` — all needed for `/health`

Files summary

File	Repo	Change
`pkg/beholder/client.go`	`chainlink-common` (PR #1862)	Add `BatchEmitter` field to `Client`, store it in constructor, add `GetChipIngressBatchEmitter()` getter
`core/services/chainlink/application.go`	`chainlink` (PR #21327)	After `telemetryManager`, retrieve and append batch emitter to `srvcs`

Note: The PR #21327 file list may be incomplete (API results limited to 30 files). You can view the full file list here.

…ingress-publishBatch

# Conflicts: # core/scripts/cre/environment/examples/workflows/v1/proof-of-reserve/cron-based/go.mod # core/scripts/cre/environment/examples/workflows/v1/proof-of-reserve/cron-based/go.sum # core/scripts/cre/environment/examples/workflows/v1/proof-of-reserve/web-trigger-based/go.mod # core/scripts/cre/environment/examples/workflows/v1/proof-of-reserve/web-trigger-based/go.sum # core/scripts/cre/environment/examples/workflows/v2/proof-of-reserve/cron-based/go.mod # core/scripts/cre/environment/examples/workflows/v2/proof-of-reserve/cron-based/go.sum # core/scripts/go.mod # core/scripts/go.sum # deployment/go.mod # deployment/go.sum # devenv/go.mod # devenv/go.sum # go.mod # go.sum # integration-tests/go.mod # integration-tests/go.sum # integration-tests/load/go.mod # integration-tests/load/go.sum # system-tests/lib/go.mod # system-tests/lib/go.sum # system-tests/tests/canaries_sentinels/proof-of-reserve/cron-based/go.mod # system-tests/tests/canaries_sentinels/proof-of-reserve/cron-based/go.sum # system-tests/tests/go.mod # system-tests/tests/go.sum

…ingress-publishBatch # Conflicts: # core/scripts/go.mod # core/scripts/go.sum # deployment/go.mod # deployment/go.sum # devenv/go.mod # go.mod # go.sum # integration-tests/go.mod # integration-tests/go.sum # integration-tests/load/go.mod # integration-tests/load/go.sum # system-tests/lib/go.mod # system-tests/lib/go.sum # system-tests/tests/go.mod # system-tests/tests/go.sum

…ngress-publishBatch' into infoplat-3436-chipingress-publishBatch

.github/workflows/go-mod-validation.yml

+    name: Validate go.mod dependencies
+    runs-on: ubuntu-latest
+    if: ${{ github.event_name == 'pull_request' }}
+    steps:
+      - uses: actions/checkout@v6
+
+      - name: Validate go.mod
+        uses: smartcontractkit/.github/apps/go-mod-validator@go-mod-validator/v1
+        with:
+          repo-branch-exceptions: |
+            smartcontractkit/chainlink-ccip:develop
+            smartcontractkit/chainlink-sui:2.31.0-cherry-picked
+            smartcontractkit/chainlink-protos:capabilities-development
+            smartcontractkit/cre-sdk-go:capabilities-development
+            smartcontractkit/cre-sdk-typescript:capabilities-development
+            smartcontractkit/chainlink-common:infoplat-3436-chipingress-publishBatch


To fix this, explicitly restrict the GITHUB_TOKEN permissions used by the workflow so it doesn’t rely on potentially broad repository defaults. The most direct and non-breaking approach is to add a permissions block setting contents: read, which is sufficient for checking out code and reading go.mod files.

The single best fix here is to add a job-level permissions block under go-mod-validation: in .github/workflows/go-mod-validation.yml, right after the job name line. This ensures only this job is affected and does not change any triggers, steps, or existing behavior beyond tightening token permissions. No imports, methods, or additional definitions are needed.

Concretely, in .github/workflows/go-mod-validation.yml, modify the go-mod-validation job definition to include:

permissions: contents: read

so that the job explicitly documents and enforces read-only access to repository contents for the GITHUB_TOKEN.

cl-sonarqube-production · 2026-03-26T07:01:41Z

Quality Gate passed

Issues
0 New issues
0 Fixed issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube

Amend chip-testsink

d090d7f

Copilot AI review requested due to automatic review settings February 27, 2026 14:43

thomaska requested review from a team as code owners February 27, 2026 14:43

thomaska mentioned this pull request Feb 27, 2026

Chipingress publish batch smartcontractkit/chainlink-common#1862

Draft

3 tasks

product-security-plaid-production bot requested a review from DylanTinianov February 27, 2026 14:43

product-security-plaid-production bot requested a review from george-dorin February 27, 2026 14:43

Copilot started reviewing on behalf of thomaska February 27, 2026 14:43 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

Fix test

4ce4051

jmank88 previously approved these changes Feb 27, 2026

View reviewed changes

pkcll added the build-publish Build and Publish image to SDLC label Feb 27, 2026

Add ChipIngressBatchEmitterEnabled feature flag

bcb14db

thomaska dismissed jmank88’s stale review via bcb14db March 2, 2026 11:22

thomaska added 4 commits March 2, 2026 14:41

Bump chainlink-common to include batch emitter feature flag

1875c1e

Merge branch 'develop' into infoplat-3436-chipingress-publishBatch

2db09f2

# Conflicts: # go.mod # go.sum

Update docs

778855d

Update all versions

3f2882b

thomaska requested a review from a team as a code owner March 2, 2026 14:02

product-security-plaid-production bot requested a review from Tofel March 2, 2026 14:02

Enable in test workflow nodes

11418f5

thomaska changed the title ~~Amend chip-testsink~~ feat: add ChipIngress batch emitter support Mar 2, 2026

thomaska added 4 commits March 4, 2026 14:01

Update batching with retry+drain

ac1d575

Merge branch 'develop' into infoplat-3436-chipingress-publishBatch

b80d145

Update chainlink-common

0b6c884

Use version with new defaults

97e9816

thomaska added 8 commits March 6, 2026 23:28

Using batchClient

fe70963

Update all modules

0334494

Bump chainlink-common to fix missing ChipIngressLogger in loop server

3efad90

Update modules again

3cdba0f

Add missing logger 2

b0ef50d

Update chainlink-common

ecb6174

Merge branch 'develop' into infoplat-3436-chipingress-publishBatch

3dbbcbe

thomaska added 5 commits March 17, 2026 20:03

Merge branch 'develop' into infoplat-3436-chipingress-publishBatch

1626b09

Merge remote-tracking branch 'origin/develop' into infoplat-3436-chip…

b2dd2de

…ingress-publishBatch

Update chainlink-common: use service

72517b8

Adding beholder to services

13a3879

pkcll marked this pull request as draft March 25, 2026 05:02

pkcll added 6 commits March 25, 2026 01:14

chore: bump chainlink-common for chipingress batch support

1b98427

chore: update chainlink-common refs to PR 1862

6042dd6

Merge remote-tracking branch 'refs/remotes/origin/infoplat-3436-chipi…

70a7baa

…ngress-publishBatch' into infoplat-3436-chipingress-publishBatch

chore: restore validator-safe chainlink-common pins

216bd48

ci: allow chainlink-common PR branch in go mod validation

c49652b

github-advanced-security bot found potential problems Mar 26, 2026

View reviewed changes

fix: refresh canary workflow go sums

a50d0bc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add ChipIngress batch emitter support#21327

feat: add ChipIngress batch emitter support#21327
thomaska wants to merge 32 commits intodevelopfrom
infoplat-3436-chipingress-publishBatch

thomaska commented Feb 27, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 27, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 27, 2026

Uh oh!

Copilot AI Feb 27, 2026

Uh oh!

trunk-io bot commented Feb 27, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 2, 2026

Uh oh!

pkcll commented Mar 17, 2026

Uh oh!

Check warning

Copilot Autofix

cl-sonarqube-production bot commented Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

-		if _, err := s.Publish(ctx, event); err != nil {
+		// Forward upstream synchronously to avoid spawning a goroutine per event.
+		if s.cfg.UpstreamEndpoint != "" {
+			forwardCtx, cancelFn := context.WithTimeout(context.Background(), 10*time.Second)
+			_, err := s.upstream.Publish(forwardCtx, event)
+			cancelFn()
+			if err != nil {
+				log.Printf("failed to forward to upstream: %v", err)
+			}
+		}
+		if _, err := s.cfg.PublishFunc(ctx, event); err != nil {

		// It delegates each event in the batch to the configured PublishFunc,
		// mirroring how the real ChIP Ingress processes batches atomically.

Conversation

thomaska commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Detail

Why

Requires

Uh oh!

github-actions bot commented Feb 27, 2026

Uh oh!

github-actions bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

trunk-io bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 2, 2026

Uh oh!

pkcll commented Mar 17, 2026

Implementation Plan

Problem

Step 1: Expose the batch emitter from beholder.Client (chainlink-common PR #1862)

Step 2: Register the batch emitter in NewApplication (chainlink PR #21327)

Step 3: No changes to initGlobals in shell.go

Why this works

Files summary

Uh oh!

Check warning

Copilot Autofix

cl-sonarqube-production bot commented Mar 26, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

thomaska commented Feb 27, 2026 •

edited

Loading

github-actions bot commented Feb 27, 2026 •

edited

Loading

trunk-io bot commented Feb 27, 2026 •

edited

Loading

Step 1: Expose the batch emitter from `beholder.Client` (chainlink-common PR #1862)

Step 2: Register the batch emitter in `NewApplication` (chainlink PR #21327)

Step 3: No changes to `initGlobals` in `shell.go`