Customer Segmentation Analysis using Python and Power BI
This project focuses on customer segmentation using data analysis and machine learning techniques. The goal is to identify different groups of customers based on their income and spending behavior to help businesses make better marketing decisions.
The dataset contains the following features:
- CustomerID
- Gender
- Age
- Annual Income
- Spending Score
- Spending Level
- Checked for missing values
- Verified data types
- Removed inconsistencies
- Prepared data for analysis
- Gender distribution analysis
- Age distribution visualization
- Income vs Spending analysis
- Spending level distribution
- Average spending by gender
K-Means clustering was applied using:
- Annual Income
- Spending Score
The Elbow Method was used to determine the optimal number of clusters (5).
The customers were divided into 5 segments:
- High Income – High Spending (Premium Customers)
- High Income – Low Spending (Target Customers)
- Low Income – High Spending (Potential Risk / Impulsive Buyers)
- Low Income – Low Spending (Low Value Customers)
- Medium Income – Medium Spending (Regular Customers)
These insights can help businesses:
- Target the right customers
- Improve marketing strategies
- Increase revenue
- Python
- Pandas
- Matplotlib
- Scikit-learn
- Google Colab / Jupyter Notebook
- Customer_Segmentation.ipynb
- Customer_Segmentation_Final.csv
- Cluster_Summary.csv
- dataset.csv
- README.md
Shawana Hakim
Aspiring Data Analyst
