Skip to content

SolyZak/acme-security-data-science

Acme Security Conversion Modeling (Portfolio)

CI License

Problem Statement

Acme Security is a fictional company operating across the Gulf and Middle East region. This project demonstrates a safe, generic data science workflow to predict whether a customer will convert based on recent engagement signals. All data and logic are synthetic and safe for public use.

Project Scope (Separate from the Pipeline)

This repo focuses only on data science modeling and analysis. It does not include ETL, warehousing, or production pipeline code.

Dataset

Synthetic CSV with aggregated customer engagement metrics:

  • Geography (country, city)
  • Channel and device type
  • Recent activity (visits, events, recency)
  • Target label converted (0/1)

Data Dictionary

See docs/data_dictionary.md for field definitions.

Workflow

  • Load synthetic dataset
  • Basic EDA (optional notebook)
  • Train a baseline classification model
  • Evaluate with accuracy and F1 score

Technologies Used

  • Python 3
  • pandas
  • scikit-learn

How to Run Locally

  1. Create and activate a virtual environment.
  2. Install dependencies: pip install -r requirements.txt
  3. Train the model: python src/train.py

Makefile Shortcuts

make install, make train

Example Output

The training script prints metrics like:

Model metrics:
accuracy: <value>
f1: <value>

Values depend on the synthetic data and split.

Safety Note

Everything in this repository is fictional and synthetic. No real company names, schemas, data, or business logic are included.

Roadmap

  • Add cross‑validation and model persistence
  • Add simple feature importance report
  • Add automated data checks

License

MIT

About

Synthetic conversion modeling baseline with scikit-learn and a clean, reproducible workflow.

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors