Scale nodes by file size #351

zimonitrome · 2025-09-30T13:15:10Z

Summary

This PR adds optional file-node scaling by file size via -S / --scale-by-file-size.

When -S is not set, existing Gource behavior is unchanged.

Related issues: #91 #54 #147 #223

Screencast.from.2026-02-10.17.22.23_trimmed.webm

What changed

Added file-size-based node scaling (-S / --scale-by-file-size).
Added smooth interpolation when file size changes (so node size transitions are animated).
Added a dedicated packing solver for scaled nodes to reduce wobble and avoid persistent overlap in dense directories.
Added deterministic small-cluster shaping to prevent line/chain layouts for small groups of files.
Added optional hover text for file size (--show-file-size-on-hover).

Git size lookup changes

Reworked Git size lookup to index blob sizes up front using:
- git cat-file --batch-all-objects --batch-check='%(objectname) %(objecttype) %(objectsize)'
During parsing, file sizes are looked up from this in-memory index (no per-event subprocess calls).
Git raw log command now uses --abbrev=40 so blob hashes match the prebuilt index.
Existing log file compatibility:
- If scaling is OFF, older/partial raw lines continue to parse.
- If scaling is ON and required hash metadata is missing, parsing fails with a descriptive error.

Custom log format

Custom log supports optional file_size as a 6th field:

file_size is used for A/M actions.

CLI/options added

Primary feature flag:

-S, --scale-by-file-size

Advanced tuning options:

--file-scale
--dir-spacing
--file-gravity
--file-repulsion
--show-file-size-on-hover

Defaults are tuned so -S works reasonably without extra tuning.

Notes

Current scaling uses a logarithmic mapping for readability across large size ranges.

acaudwell · 2025-10-01T22:31:38Z

Cool looking feature. Thanks for the detailed overview of the changes.

src/gource_settings.cpp

zimonitrome · 2025-10-04T18:35:50Z

I have now had some time to test the PR a bit more and I think the physics in particular needs some more tweaking before it could be deemed "stable". Examples:

Adding many files in a single dir causes nodes to constantly "bounce" in and out of the cluster.
Big files in large clusters sometimes overlap.
Small files in large clusters sometimes never separate.

That being said, it still works fine for most common projects I have tested it on.

I have some ideas on how to stabilize it somewhat. Like adding adding an outward force or a more "spring like" gravity model.

For now I made a few changes to the PR:

Reverted the regex (I used an old compiler)
Fixed a bug in file size getting. Previously it only worked for cwd.

I will try to improve it and report back.

dagguh · 2025-10-05T10:06:02Z

src/dirnode.cpp

+    if (gGourceSettings.scale_by_file_size) {
+        for (RFile* f : files) {
+            if (!f->isHidden()) {
+                float r = f->getSize() / 2.0f;


So this sets the radius to file size. It means the actual dot size will be proportional to file size squared. The area of the dots will not be proportional to file size.

src/formats/git.cpp

acaudwell · 2025-11-17T05:28:52Z

src/formats/git.cpp

+
+                if (gGourceSettings.scale_by_file_size && status != "D") {
+                    char cmd_buff[2048];
+                    int written = snprintf(cmd_buff, 2048, "git --git-dir=%s/.git --work-tree=%s cat-file -s %s", m_repository_path.c_str(), m_repository_path.c_str(), dst_blob.c_str());


This seems to cause a performance issue doing this here as it blocks the UI while fetching the blob. Also this wont work if logfile is a file and not a directory. You can also see it block when you move the mouse cursor of the interactive timeline as it peeks at the part of the log under the cursor to get the timestamp.

I think for performance instead what would be good is we get all the file blob sizes up front (maybe just get every blob file size in the repo at once) right after we generate the git log, puts them into a data structure, and then we can look up into it here.

Implemented. I removed per-event git cat-file calls and replaced them with a one-time blob-size index build right after log generation, then O(1) lookups during parsing. Should avoid timeline-hover/UI stalls... I think. Maybe needs more testing.

src/file.cpp

zimonitrome · 2026-02-10T16:15:31Z

I updated the branch to make the "physics layout" more stable and took in some some review feedback.

Main improvements since earlier revisions:

Replaced per-file blob size subprocess calls with a one-time Git blob index build (git cat-file --batch-all-objects ...) and in-memory lookups.
Added compatibility behavior for existing/older raw log files:
- no scaling => continue parsing
- scaling enabled but missing blob metadata => descriptive error
Added --abbrev=40 to the Git raw log command so blob hashes consistently match the index.
Added file size transition animation (interpolated node size changes).
Reworked scaled-node directory packing to reduce wobble/overlap in large clusters and avoid chain-like minima in smaller clusters.

Updated my OP to reflect the changes, also added a new video. Some behavior should maybe be changed? As in distance between directories (clusters)?

Might do a bit more testing but it seems pretty nice all in all right now!

zimonitrome force-pushed the node-sizing branch from 5ba8bc7 to 193af0e Compare September 30, 2025 13:44

zimonitrome mentioned this pull request Sep 30, 2025

[feature request] scale by file size #91

Open

acaudwell reviewed Oct 1, 2025

View reviewed changes

src/gource_settings.cpp Outdated Show resolved Hide resolved

zimonitrome force-pushed the node-sizing branch from 193af0e to 813c3d8 Compare October 2, 2025 06:41

Minor cleanup.

9a28778

zimonitrome force-pushed the node-sizing branch from 813c3d8 to da2c52a Compare October 4, 2025 18:30

dagguh reviewed Oct 5, 2025

View reviewed changes

acaudwell reviewed Nov 17, 2025

View reviewed changes

src/formats/git.cpp Outdated Show resolved Hide resolved

acaudwell reviewed Nov 17, 2025

View reviewed changes

src/formats/git.cpp Show resolved Hide resolved

acaudwell reviewed Nov 17, 2025

View reviewed changes

src/file.cpp Show resolved Hide resolved

Add node scaling by file size.

2f37fb7

zimonitrome force-pushed the node-sizing branch from da2c52a to 2f37fb7 Compare February 10, 2026 16:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Scale nodes by file size #351

Scale nodes by file size #351

Uh oh!

zimonitrome commented Sep 30, 2025 •

edited

Loading

Uh oh!

acaudwell commented Oct 1, 2025

Uh oh!

Uh oh!

zimonitrome commented Oct 4, 2025 •

edited

Loading

Uh oh!

dagguh Oct 5, 2025

Uh oh!

Uh oh!

Uh oh!

acaudwell Nov 17, 2025 •

edited

Loading

Uh oh!

zimonitrome Feb 10, 2026 •

edited

Loading

Uh oh!

Uh oh!

zimonitrome commented Feb 10, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Scale nodes by file size #351

Are you sure you want to change the base?

Scale nodes by file size #351

Uh oh!

Conversation

zimonitrome commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

Git size lookup changes

Custom log format

CLI/options added

Notes

Uh oh!

acaudwell commented Oct 1, 2025

Uh oh!

Uh oh!

zimonitrome commented Oct 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dagguh Oct 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

acaudwell Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zimonitrome Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zimonitrome commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zimonitrome commented Sep 30, 2025 •

edited

Loading

zimonitrome commented Oct 4, 2025 •

edited

Loading

acaudwell Nov 17, 2025 •

edited

Loading

zimonitrome Feb 10, 2026 •

edited

Loading

zimonitrome commented Feb 10, 2026 •

edited

Loading