This project transforms raw Amazon Seller data into an AI-powered analytics assistant that enables fast, data-driven decision making.
- 💬 Ask questions in natural language (no SQL required)
- 📊 Retrieve insights directly from structured business data
- ⚡ Combine database queries with AI-generated explanations
- 🧠 Support operational and strategic decisions
👉 Goal: Make complex e-commerce analytics accessible through a chatbot interface
This system is designed for:
- 🛒 E-commerce Managers → analyze sales, performance, and trends
- 📊 Data Analysts → speed up data exploration
- 💰 Business Teams → access insights without technical knowledge
- “Which products generated the highest revenue last month?”
- “Show sales trends by country”
- “Which categories are declining?”
- “Top performing ASINs in the last 30 days”
👉 The system converts these into SQL + contextual answers automatically.
This project implements a Retrieval-Augmented Generation (RAG) pipeline combined with SQL-based querying.
It integrates:
- Structured database queries (SQLite)
- Vector search (FAISS)
- Natural language understanding (LLM)
- Business data sources (Amazon Seller data)
👉 Result: Hybrid AI system combining symbolic + semantic retrieval
- Load CSV data sources
- Convert to structured tables (SQLite)
- Create document chunks
- Generate embeddings (Sentence Transformers)
- Store vectors in FAISS index
- User query (natural language)
- Intent detection (SQL vs semantic search)
- Retrieve relevant context
- Execute SQL query (if needed)
- Combine results with LLM
- Generate final answer
- 💬 Natural language query interface
- 🧠 RAG-based context retrieval
- 🗄️ SQL database integration (SQLite)
- 🔎 Vector search with FAISS
- 📊 Structured + unstructured data support
- ⚡ Hybrid reasoning (SQL + LLM)
- 🧩 Modular pipeline (easy to extend)
The system uses multiple Amazon datasets:
- 📁 Orders (time series revenue data)
- 📊 Business Reports
- 📢 Advertising reports
- 🔍 Search term data
- Python
- Streamlit
- SQLite
- FAISS
- Sentence Transformers
- Pandas / NumPy
- Local LLM (via Ollama or API)
-
User asks: → “Show revenue trend for last 3 months”
-
System:
- detects analytical intent
- generates SQL query
- retrieves data
- enriches with LLM explanation
-
Output:
- structured data result
- human-readable insight
git clone https://github.com/Mst-KrgZ/amazon-analytics-chatbot.git
cd amazon-analytics-chatbot
pip install -r requirements.txt
streamlit run app.py- Python 3.11
- Streamlit
- Pandas / NumPy
- FAISS
- Sentence Transformers
- SQLite
- LLM backend (Ollama / API)
Mesut Karagöz Data Scientist
🔗 GitHub: https://github.com/Mst-KrgZ 🔗 LinkedIn: https://www.linkedin.com/in/mesut-karagöz-181733260/
This project goes beyond a chatbot — it is a data-driven decision support system powered by AI and structured analytics.