You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Feb 15, 2026. It is now read-only.
Current indexing only source from .md and .rst files for simplicity, discarding valuable information in .pdf and .csv and other file types.
π‘ Proposed Solution
For unified internal interface, convert all to .md using docling-project/docling while retaining metadata before indexing. This allows future-proof file type support, even for code files and others.
π€ Alternatives Considered
Considered low level approaches such as using pdfplumber, determined too complex given existing integrated solutions such as docling.
π― Problem
Current indexing only source from
.mdand.rstfiles for simplicity, discarding valuable information in.pdfand.csvand other file types.π‘ Proposed Solution
For unified internal interface, convert all to
.mdusingdocling-project/doclingwhile retaining metadata before indexing. This allows future-proof file type support, even for code files and others.π€ Alternatives Considered
Considered low level approaches such as using
pdfplumber, determined too complex given existing integrated solutions such asdocling.