ragpipe is a lightweight RAG ingestion pipeline toolkit.
It helps you go from documents → chunks → embeddings → vector store export.
- text chunking with overlap
- basic loaders (text files, URLs)
- embedding interface abstraction
- JSON export format
- optional FAISS dependency group
pip install ragpipeOptional FAISS:
pip install ragpipe[faiss]ragpipe ingest ./docs --out out.json
ragpipe ingest-url https://example.com --out out.jsonfrom ragpipe import chunk_text, ingest_text
chunks = chunk_text("hello world " * 200, chunk_size=200, overlap=50)
print(len(chunks))
data = ingest_text("hello world " * 200)
print(data.keys())MIT