Skip to content

chore: release 0.4.1#11

Merged
maish merged 2 commits into
mainfrom
release-0.4.1
Jun 8, 2026
Merged

chore: release 0.4.1#11
maish merged 2 commits into
mainfrom
release-0.4.1

Conversation

@maish

@maish maish commented Jun 8, 2026

Copy link
Copy Markdown
Collaborator

Patch release: fix spanning body cells duplicated across columns on multi-page merge.

Fix

Docling represents a col_span=N cell by repeating its text across every column it covers. The merge round-trip (grid → DataFrame → grid) flattened those into N separate col_span=1 cells — duplicating a full-width description into every value column and displacing the real rate/cap values. A repeated multi-row header (col_span=6 banner) flowing in as a body row behaved the same way.

_dataframe_to_docling_data now matches each merged row back to its source grid row (across all member fragments) and re-emits the original spans. Rows the merger genuinely transformed (stitched continuations, folded overflow) fall back to the previous flat 1x1 rebuild. The match keys on original span metadata, never value equality, so coincidentally-equal adjacent values (two plan columns sharing a cap) stay separate cells.

No changes to the extractor or the merger.

Tests

  • 106 passing (2 new regression tests: span preservation + the no-fuse guard for coincidental equals).
  • Verified against a real multi-page insurance-benefit document: col_span=7 descriptions and col_span=6 repeated headers now preserved; rate/cap land in the correct plan columns.

🤖 Generated with Claude Code

maish and others added 2 commits June 8, 2026 21:45
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@maish maish merged commit 62d57fb into main Jun 8, 2026
7 checks passed
@maish maish deleted the release-0.4.1 branch June 8, 2026 13:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant