move wake-stage to (sync) prim. by dtzSiFive · Pull Request #1854 · sifiveinc/wake

Will Dietz (dtzSiFive) · 2026-05-14T21:22:53Z

The job overhead for wake-stage's trivial amount of work amplifies costs
especially of startup latency when sourcing many files.

Additionally, the point of staging is to get the required information
ASAP so as to not race with concurrent modifications.

Putting into job queue is more overhead than just doing the reflink
directly, and as a primitive we ensure the staging is done immediately
not stuck behind a queue of hashing jobs.

This was especially noticeable with concurrent runs (multi-wake).

Issue diagnosed with help of wake --ps! :)

Assisted by Claude.

1347089 shows wake-stage as a "move" which is the bulk of the work being done here; I couldn't help but unify this code to preferring wcl::result instead of exceptions while touching which apparently crosses the line.

The job overhead for wake-stage's trivial amount of work amplifies costs especially of startup latency when sourcing many files. Additionally, the point of staging is to get the required information ASAP so as to not race with concurrent modifications. Putting into job queue is more overhead than just doing the reflink directly, and as a primitive we ensure the staging is done immediately not stuck behind a queue of hashing jobs. This was especially noticeable with concurrent runs (multi-wake). Issue diagnosed with help of `wake --ps`! :)

Will Dietz (dtzSiFive) · 2026-05-15T04:59:14Z

Compared to v49 on a "source every file in a large repo" pathological (if not representative or necessarily disproving possible regressions in other situations) test case:

Nuking cache/db between runs:

Benchmark 1: mwprim-single
  Time (mean ± σ):     74.711 s ±  2.151 s    [User: 121.918 s, System: 79.676 s]
  Range (min … max):   72.555 s … 78.342 s    10 runs

Benchmark 2: w49-single
  Time (mean ± σ):     109.129 s ±  1.928 s    [User: 221.944 s, System: 138.339 s]
  Range (min … max):   107.217 s … 112.847 s    10 runs

Nuking cache/db between batches and priming with warmup run:

Benchmark 1: mwprim-single
  Time (mean ± σ):     14.928 s ±  0.214 s    [User: 10.183 s, System: 4.539 s]
  Range (min … max):   14.722 s … 15.478 s    10 runs

Benchmark 2: w49-single
  Time (mean ± σ):     23.402 s ±  0.463 s    [User: 18.275 s, System: 4.887 s]
  Range (min … max):   22.964 s … 24.239 s    10 runs

With this change, wake (with WAKE_CAS=1, of course) does considerably less work (user/system) in less wall time.

Benchmark was this (quoted for passing to hyperfine):

'WAKE_CAS=1 wake -x "sources \".\" \`.*\`|rmap len"'

Will Dietz (dtzSiFive) · 2026-05-15T05:02:42Z

+    RETURN(claim_result(runtime.heap, false, err));
+  };
+
+  std::stringstream paths_stream(std::string(paths_arg->c_str(), paths_arg->size()));


As noted in detail elsewhere, the input path here both uses an arguably inappropriate separator character (more importantly, inconsistent with elsewhere) and makes many copies of the string in full or in part while processing.

If not in this PR, then in a follow-up:

Use null-terminator separator, including one at end if needed explicitly

Don't use stringstream (!)

Split into std::string_views using string::find (-> memchr)

One neat bonus of this approach besides being zero-copy is that we can pass C-strings (null-terminated) to syscalls directly without copying!

Will Dietz (dtzSiFive) requested review from Abrar Quazi (AbrarQuazi) and Sam May (ag-eitilt) May 14, 2026 21:22

Will Dietz (dtzSiFive) force-pushed the feature/stage-as-prim branch from a8c5d11 to 1a6231b Compare May 14, 2026 21:25

Will Dietz (dtzSiFive) added 2 commits May 14, 2026 14:42

stage prim: prefer wcl::result over exceptions.

941fa5f

Will Dietz (dtzSiFive) force-pushed the feature/stage-as-prim branch from 1a6231b to 941fa5f Compare May 14, 2026 21:42

Will Dietz (dtzSiFive) commented May 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

move wake-stage to (sync) prim.#1854

move wake-stage to (sync) prim.#1854
Will Dietz (dtzSiFive) wants to merge 2 commits into
feature/multi-wakefrom
feature/stage-as-prim

Will Dietz (dtzSiFive) commented May 14, 2026 •

edited

Loading

Uh oh!

Will Dietz (dtzSiFive) commented May 15, 2026

Uh oh!

Will Dietz (dtzSiFive) May 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Will Dietz (dtzSiFive) commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Will Dietz (dtzSiFive) commented May 15, 2026

Uh oh!

Will Dietz (dtzSiFive) May 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Will Dietz (dtzSiFive) commented May 14, 2026 •

edited

Loading