CRM Lead Intelligence System — Setup Guide

Project Structure

crm-lead-intelligence/
├── leads_db.csv          # Dataset (simulated CRM data)
├── colab_training.py     # ML training pipeline (run in Google Colab)
├── model.pkl             # Trained XGBoost model (generated by Colab)
├── label_encoder.pkl     # Industry label encoder (generated by Colab)
├── main.py               # FastAPI backend
├── requirements.txt      # Python dependencies
└── README.md

Step 1 — Train the model in Google Colab

Open Google Colab
Upload leads_db.csv and colab_training.py
Run each cell in order
In the last cell, download model.pkl and label_encoder.pkl
Place both files in the same folder as main.py

Step 2 — Run the FastAPI backend locally

# Create a virtual environment (recommended)
python -m venv venv
source venv/bin/activate        # Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Start the server
uvicorn main:app --reload --port 8000

Server runs at: http://localhost:8000

Interactive API docs: http://localhost:8000/docs

Step 3 — Test the endpoints

GET /leads (auto pipeline)

curl http://localhost:8000/leads

Returns all 100 leads from leads_db.csv, each scored by the ML model.

POST /predict (manual input)

curl -X POST http://localhost:8000/predict \
  -H "Content-Type: application/json" \
  -d '{
    "name": "Jane Doe",
    "company": "TechCorp",
    "industry": "Technology",
    "num_calls": 7,
    "email_opens": 15,
    "website_visits": 40
  }'

Returns:

{
  "name": "Jane Doe",
  "company": "TechCorp",
  "industry": "Technology",
  "score": 0.9981,
  "category": "High",
  "insight": "High-potential lead from Technology. Immediate follow-up recommended."
}

GET /industries

curl http://localhost:8000/industries

API Response Format

Every scored lead returns:

Field	Type	Description
score	float	Conversion probability (0.0–1.0)
category	string	"High" / "Medium" / "Low"
insight	string	Human-readable AI recommendation

Score thresholds:

High → score > 0.80 (immediate follow-up)
Medium → score > 0.50 (nurture campaign)
Low → score ≤ 0.50 (monitor passively)

Supported Industries

Technology
Finance
Healthcare
Education
Manufacturing
Retail

Model Details

Algorithm: XGBoost (XGBClassifier)
Features: industry, num_calls, email_opens, website_visits
Training: 80 leads | Test: 20 leads | CV: 5-fold
Test accuracy: ~95% | ROC AUC: ~0.99
Trained in: Google Colab
Inference: FastAPI backend (loads model.pkl at startup)

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
__pycache__		__pycache__
README.md		README.md
colab_training.py		colab_training.py
gitignore		gitignore
index.html		index.html
label_encoder.pkl		label_encoder.pkl
leads_db.csv		leads_db.csv
main.py		main.py
main_deploy.py		main_deploy.py
model.pkl		model.pkl
render.yaml		render.yaml
requirements.txt		requirements.txt
runtime.txt		runtime.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CRM Lead Intelligence System — Setup Guide

Project Structure

Step 1 — Train the model in Google Colab

Step 2 — Run the FastAPI backend locally

Step 3 — Test the endpoints

GET /leads (auto pipeline)

POST /predict (manual input)

GET /industries

API Response Format

Supported Industries

Model Details

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CRM Lead Intelligence System — Setup Guide

Project Structure

Step 1 — Train the model in Google Colab

Step 2 — Run the FastAPI backend locally

Step 3 — Test the endpoints

GET /leads (auto pipeline)

POST /predict (manual input)

GET /industries

API Response Format

Supported Industries

Model Details

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages