Skip to content

Kubenew/ragpipe

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ragpipe-lite

PyPI Version Python Versions License: MIT Tests Downloads

ragpipe is a lightweight RAG ingestion pipeline toolkit.

It helps you go from documents → chunks → embeddings → vector store export.

Features (v0.1.0)

  • text chunking with overlap
  • basic loaders (text files, URLs)
  • embedding interface abstraction
  • JSON export format
  • optional FAISS dependency group

Install

pip install ragpipe

Optional FAISS:

pip install ragpipe[faiss]

CLI usage

ragpipe ingest ./docs --out out.json
ragpipe ingest-url https://example.com --out out.json

Python usage

from ragpipe import chunk_text, ingest_text

chunks = chunk_text("hello world " * 200, chunk_size=200, overlap=50)
print(len(chunks))

data = ingest_text("hello world " * 200)
print(data.keys())

License

MIT

About

`ragpipe` is a lightweight RAG ingestion pipeline toolkit. It helps you go from documents → chunks → embeddings → vector store export.

Topics

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages