some perf and documentation changes by hackaugusto · Pull Request #355 · jonhoo/inferno

hackaugusto · 2026-02-26T12:46:13Z

I performed this changes while reading the code and trying to implement #342, unfortunately this doesn't achieve that, but it does add a 5% to 10% perf gain (criterion runs vary a lot on my machine), and it adds some docs and hopefully some nice small improvements.

hackaugusto

Left some comments to give context on the changes, this turned out to be a bigger change than I anticipated.

hackaugusto · 2026-02-26T13:00:35Z

src/flamegraph/merge.rs

+    pub(super) frames: Vec<TimedFrame<'a>>,
+    pub(super) accumulated_samples: usize,
+    pub(super) delta_max: usize,
 }


I replaced the HashMap with a Vec stack, because that seems to be how flow works (it pops all values that were in previous but are no longer in current, and pushes the new values from current). This meant there was no need for the Frame to act as key, so we can move all the fields to TimedFrame. This had a small perf improvement.

hackaugusto · 2026-02-26T13:08:07Z

src/flamegraph/merge.rs

+    open_frames: &mut Vec<TimedFrame<'a>>,
+    closed_frames: &mut Vec<TimedFrame<'a>>,
+    previous_frames: &[&'a str],
+    current_frames: &[&'a str],


Changing the inputs from iterator to slices gave a few benefits:

the caller frames can split the results by ; once, and cache it for the next iteration

this function can slice into the frames, and use extend which seems to give a small perf improvement

hackaugusto · 2026-02-26T13:13:11Z

src/flamegraph/merge.rs

    let mut delta = None;
    let mut delta_max = 1;
    let mut stripped_fractional_samples = false;
-    let mut prev_line = None;


the idea was to make frames into an Iterator, so I was dropping some of the stack variables to make the to-be-implemented FramesIterator smaller.

The two main changes here are:

open_frames and closed_frames are Vecs of the same type, as opposed to tmp and frames which were a hashmap and a vec. This is because the flow now uses open_frames as a stack of the shared frames, and pushes the finalized frames to closed_frames

the code caches the parsed line into the previous variable, and the previousand current vecs are reused to save allocations. this also removed the need to special case the first and last iteration with a None for an iterator.

hackaugusto · 2026-02-26T13:13:48Z

src/flamegraph/merge.rs

-            );
+        current.extend(iter::once("").chain(line.split(';')));
+
+        if !SUPPRESS_SORT_CHECK {


making this const gave a small perf improvement

hackaugusto · 2026-02-26T13:18:59Z

src/flamegraph/mod.rs

-            reversed.push(stack);
-        }
-        let mut reversed: Vec<&str> = reversed.iter().collect();
-        reversed.sort_unstable();


Moved this to an utility, the main change is that instead of using StrStack the function returns a Vec<String>, the idea is that reserved.push(stack) is copying the data an maybe reallocating, as more data gets added the cost of reallocating increases, not sure if it is worth it though.

hackaugusto · 2026-02-26T13:21:18Z

src/flamegraph/mod.rs

@@ -398,54 +400,35 @@ where
    I: IntoIterator<Item = &'a str>,


this is the biggest blocker, to make this work with big files at least we would need to make a change to the public api and add + Clone, so we can do two passes.

I still would like to add an API to handle big files, in my case it would work to have another API that receives parsed entries with the total time and delta max. What do you think?

To add to the API idea, I was thinking about something like this:

for a text format:

document := (comment | line)* comment := "#" [^\n\r]* line := <depth> <frame> <pct_value> <pct_delta>?

the rules would be, depth either increases by 1 or decreases up to current depth, and the pct_* have to add to 100. So something like:

a; 2 a;b;c 16 a;b 2

would become:

1 a 10 2 b 0 3 c 80 2 b 10

and the programmatic API would take an IntoIterator<Item=&Entry> where:

struct Entry<'a> { pub depth: usize, pub function: &'a str, pub pct_value: usize, pub pct_delta: Option<usize>, }

hackaugusto added 20 commits February 25, 2026 18:32

chore: removed commented out code

50ecc31

chore: add Sample

84c5156

doc: document flow

f6841df

chore: variable renaming

dcdb9a6

chore: removed frames ignored return values

1e9502b

chore: use iter::empty

d2671d0

feat: use smallvec

151deb8

perf: use vec

98df77a

feat: single frame struct

8138c50

chore: use iterator for new open frames

5b6a794

chore: drop prev_line

8d5229e

perf: reuse allocations

b347f79

chore: add return type

c84b00e

chore: remove unneded variable

a0f8dec

chore: remove small scans

c95d733

perf: flag as const generic

a759404

chore: simplified and commented rfind_samples

438ebb9

chore: move code to reverse_stack_order

ede89b9

chore: rename time,timemax to accumulated_samples

45c1023

doc: fix comments on rfind_samples

5c8f897

hackaugusto commented Feb 26, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

some perf and documentation changes#355

some perf and documentation changes#355
hackaugusto wants to merge 20 commits intojonhoo:mainfrom
hackaugusto:improvements

hackaugusto commented Feb 26, 2026

Uh oh!

hackaugusto left a comment

Uh oh!

hackaugusto Feb 26, 2026

Uh oh!

hackaugusto Feb 26, 2026

Uh oh!

hackaugusto Feb 26, 2026

Uh oh!

hackaugusto Feb 26, 2026

Uh oh!

hackaugusto Feb 26, 2026

Uh oh!

hackaugusto Feb 26, 2026

Uh oh!

hackaugusto Feb 26, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hackaugusto commented Feb 26, 2026

Uh oh!

hackaugusto left a comment

Choose a reason for hiding this comment

Uh oh!

hackaugusto Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

hackaugusto Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

hackaugusto Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

hackaugusto Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

hackaugusto Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

hackaugusto Feb 26, 2026

Choose a reason for hiding this comment

Uh oh!

hackaugusto Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

hackaugusto Feb 26, 2026 •

edited

Loading