Skip to content

feat: support content-hash passage IDs#347

Open
raoabinav wants to merge 2 commits into
StarTrail-org:mainfrom
raoabinav:feat/content-hash-passage-ids
Open

feat: support content-hash passage IDs#347
raoabinav wants to merge 2 commits into
StarTrail-org:mainfrom
raoabinav:feat/content-hash-passage-ids

Conversation

@raoabinav
Copy link
Copy Markdown
Contributor

  • Adds opt-in --id-scheme content-hash, records the scheme in metadata, and keeps incremental updates on the existing index scheme.
  • Adds leann migrate-ids to rewrite passage artifacts plus FTS5 BM25 IDs, with focused migration tests.

Add an opt-in content-hash passage ID scheme for stable chunk IDs across rebuilds and file moves.

Persist the scheme in metadata, preserve it during incremental updates, and add a migration command that rewrites passage artifacts plus FTS5 BM25 IDs.

Cover builder ID generation and migration artifact consistency with focused tests.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant