feat: dynamic stream buffer by francisdb · Pull Request #79 · mdsteele/rust-cfb

francisdb · 2026-02-02T11:04:11Z

This pull request introduces a new dynamically growing buffer implementation for streams, replacing the previous fixed-size buffer approach.

This introduces a breaking change as the options on the Stream have been moved to the CompundFile. Also you now configuring max_buffer_size instead of buffer_size with a default of 1 MiB.

I ran the original benchmark and with buffer size ignored there were only improvements, no regressions. So I removed the specific benchmarks per buffer size.

francisdb · 2026-02-02T11:07:03Z

@mdsteele that max_buffer_size would probably make more sense to be configured for the whole cfb file instead of per stream. This for memory-constrained environments. What are your thoughts?

mdsteele · 2026-02-02T19:36:33Z

Thanks for working on this!

If this is a better/more-performant design, then I think a breaking change here makes sense; the API that would get broken was only just recently published, so probably not a lot of dependencies on it yet, and anyway this is a pre-1.0 crate.

Making this a per-CompoundFile setting seems reasonable to me too. In that case, my initial gut is we should:

Just remove CreateStreamOptions and OpenStreamOptions (and their associated methods) entirely
Add a new OpenOptions for CompoundFile, with max_buffer_size and strict settings, leaving CompoundFile::open and CompoundFile::open_strict as convenience methods
Ideally, make cfb::OpenOptions be as similar as possible to std::fs::OpenOptions, which I guess would mean putting the open() method on cfb::OpenOptions rather than adding a CompoundFile::open_with_options method? Although maybe we'd want two methods, one that takes an F and a convenience one that takes an AsRef<Path>, not sure.

But I'm totally open to something else if you suggest otherwise.

francisdb · 2026-02-02T20:49:20Z

@mdsteele want me to split up to OpenOptions and CreateOptions or do you think it's ok to do open_options.create() and having the strict being ignored?

…tream options.

francisdb · 2026-02-03T14:37:10Z

Looks like this caused a regression on the read side. Adding a read benchmark.

francisdb · 2026-02-03T15:51:01Z

All done. Question on the single or split options remains.

mdsteele · 2026-02-03T21:10:16Z

@mdsteele want me to split up to OpenOptions and CreateOptions or do you think it's ok to do open_options.create() and having the strict being ignored?

Oh, hmm. I think effectively ignoring the strict setting on create() is probably fine; the crate always tries to be strict when creating files anyway. Maybe there could be other options we'd add in the future that would only make sense for one or the other? I guess if necessary we could just make it an error to call, say, open() when using an option that really only makes sense for create().

francisdb · 2026-02-03T21:23:17Z

Want me to squash everything?

mdsteele · 2026-02-04T15:46:20Z

Looks great, thanks

francisdb · 2026-02-10T14:03:07Z

@mdsteele mind releasing this?
I'm switching to other projects for now. Further optimizations will be for later. (tree balancing, tree node name caching, iteration improvements).

mdsteele · 2026-02-13T21:59:26Z

Sounds good, thanks. Published as v0.14.0.

francisdb mentioned this pull request Feb 2, 2026

Reading is very slow for larger files #57

Open

francisdb force-pushed the feat/auto_grow_buffer branch from 63617b2 to 8467e5e Compare February 2, 2026 20:58

francisdb added 3 commits February 3, 2026 09:58

feat: dynamic stream buffer

4bdfbdf

feat: improve benchmark

78c402a

api cleanup, tried to stay close to initial api before I introduced s…

d0357f5

…tream options.

francisdb force-pushed the feat/auto_grow_buffer branch from 8467e5e to d0357f5 Compare February 3, 2026 09:02

Aggressively grow buffer on the read side.

69a8c72

francisdb force-pushed the feat/auto_grow_buffer branch from 005e1f7 to 69a8c72 Compare February 3, 2026 17:03

bench: add a 10000x0B case

0ee2dae

mdsteele merged commit 317049d into mdsteele:master Feb 4, 2026
4 checks passed

francisdb deleted the feat/auto_grow_buffer branch February 4, 2026 17:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: dynamic stream buffer#79

feat: dynamic stream buffer#79
mdsteele merged 5 commits intomdsteele:masterfrom
francisdb:feat/auto_grow_buffer

francisdb commented Feb 2, 2026 •

edited

Loading

Uh oh!

francisdb commented Feb 2, 2026 •

edited

Loading

Uh oh!

mdsteele commented Feb 2, 2026

Uh oh!

francisdb commented Feb 2, 2026

Uh oh!

francisdb commented Feb 3, 2026 •

edited

Loading

Uh oh!

francisdb commented Feb 3, 2026

Uh oh!

mdsteele commented Feb 3, 2026

Uh oh!

francisdb commented Feb 3, 2026

Uh oh!

mdsteele commented Feb 4, 2026

Uh oh!

Uh oh!

francisdb commented Feb 10, 2026

Uh oh!

mdsteele commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

francisdb commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

francisdb commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdsteele commented Feb 2, 2026

Uh oh!

francisdb commented Feb 2, 2026

Uh oh!

francisdb commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

francisdb commented Feb 3, 2026

Uh oh!

mdsteele commented Feb 3, 2026

Uh oh!

francisdb commented Feb 3, 2026

Uh oh!

mdsteele commented Feb 4, 2026

Uh oh!

Uh oh!

francisdb commented Feb 10, 2026

Uh oh!

mdsteele commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

francisdb commented Feb 2, 2026 •

edited

Loading

francisdb commented Feb 2, 2026 •

edited

Loading

francisdb commented Feb 3, 2026 •

edited

Loading