Skip to content
View Leomgama's full-sized avatar
๐Ÿš€
๐Ÿš€

Block or report Leomgama

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Leomgama/README.md

Hi, I'm Leonardo Gama ๐Ÿ‘‹

Data Analyst | Business Intelligence | SQL โ€ข Python โ€ข Power BI โ€ข Machine Learning

SQL Power BI Python Machine Learning R Excel Google Data Analytics


๐Ÿ‘ค About Me

Data Analyst and Business Intelligence professional with a BBA in Business Management and a strong foundation in finance, business analytics, and machine learning.

I deliver end-to-end analytics solutions using SQL, Python, Power BI, and Scikit-learn, with experience across EDA, ETL, KPI reporting, dashboard development, customer segmentation, churn prediction, geospatial analysis, and time series forecasting.

My work combines technical analytics skills with business and financial thinking to translate complex data into clear, actionable insights.

I apply CRISP-DM methodology across all projects and document everything here on GitHub.


๐Ÿ› ๏ธ Tech Stack

Languages

  • SQL โ€ข Python โ€ข DAX โ€ข M (Power Query) โ€ข R

Python Libraries & ML

  • Pandas โ€ข NumPy โ€ข Matplotlib โ€ข Seaborn
  • Scikit-learn (Random Forest, KMeans, Decision Tree)
  • Folium โ€ข Altair โ€ข Streamlit
  • OpenRouteService API

BI & Tools

  • Power BI โ€ข Power Query โ€ข SQL Server (SSMS)
  • Jupyter Notebook โ€ข VS Code
  • Git & GitHub

Core Skills

  • EDA โ€ข ETL โ€ข Churn Prediction โ€ข Customer Segmentation
  • Geospatial Analysis โ€ข Pareto Analysis
  • Time Series Forecasting (ARIMA) โ€ข Business Storytelling

๐Ÿ“‚ Featured Projects

๐Ÿ”ฎ Customer Churn Prediction โ€” B2B Supply Distributor

Tools: Python | Pandas | Scikit-learn (Random Forest) | Matplotlib | Seaborn

  • Analyzed 11,233 sales records and 544 customer profiles from a fictional South Carolina distributor to predict which customers were at risk of churning
  • Trained a Random Forest Classifier achieving 100% accuracy on a 109-sample test set, scoring all active 2025 customers by churn risk level
  • Identified days since last purchase as the strongest predictor (~20% feature importance) and found month-to-month customers churn at 2x the rate of annual contracts
  • Delivered 5 targeted retention recommendations including an early-warning flag system for accounts silent 60+ days

๐Ÿ”— View Repository


๐Ÿ—บ๏ธ Customer Geographic Segmentation & Classification

Tools: Python | Scikit-learn (KMeans, Decision Tree) | Pandas | Folium | Matplotlib

  • Applied KMeans clustering to geographically segment 1,000 customers into 5 commercial divisions based on latitude & longitude, validated using the Elbow Method
  • Trained a Decision Tree Classifier on the cluster output achieving 100% accuracy on 200 test samples โ€” enabling fully automated real-time classification of new customers
  • Visualized all customer divisions on interactive maps using Folium

๐Ÿ”— View Repository


๐Ÿ“ Route Change Impact Analysis: Sorocaba vs. Campinas

Tools: Python | Pandas | OpenRouteService API | Folium | Jupyter Notebook

  • Built a route simulation engine integrating the OpenRouteService API to calculate real driving distances and compare delivery scenarios across 6 days
  • Quantified that an emergency branch change increased total weekly route distance from 5,522 km to 10,250 km โ€” an increase of +4,728 km (+85.6%)
  • Visualized both route scenarios on interactive maps using Folium and delivered business-focused recommendations on inventory planning and contingency routing

๐Ÿ”— View Repository


๐Ÿ“‰ BF LUBS Sales Volume Decline Analysis

Tools: Python | Pandas | NumPy | Matplotlib | Seaborn | Jupyter Notebook

  • Analyzed 601,836 transactions (2018โ€“2024) to diagnose a business paradox where revenue grew while sales volume and active customer count both declined
  • Applied IQR outlier treatment calibrated against operational context and Pareto analysis to identify that one region-market-segment combination drove 28.95% of total volume loss
  • Delivered targeted commercial recovery recommendations prioritized by business impact

๐Ÿ”— View Repository


๐Ÿ“ˆ S&P 500 Financial Performance Analysis (2020โ€“2021)

Tools: SQL Server | Power BI (DAX) | R (ARIMA Forecasting)

  • Analyzed ~500 S&P 500 companies post-pandemic using SQL Server, 13 custom DAX measures, and an ARIMA forecast model in R integrated directly inside Power BI
  • Identified total revenue doubling from $5.6T to $13.1T with net income margin improving from 8% to 11% across 6 interactive dashboards
  • Found no direct correlation between company size and profitability โ€” business model drives margins more than revenue scale

๐Ÿ”— View Repository


๐Ÿ“Š U.S. Sales Performance Analysis โ€” AdventureWorks

Tools: SQL Server | Power BI | Power Query | DAX

  • Built a snowflake schema data model with 20 custom DAX measures and 3 interactive dashboards with collapsible navigation menus and custom tooltip pages
  • Identified $9.39M in revenue, $3.91M in profit, and 41.5% average margin across 5 U.S. regions
  • Pareto analysis confirmed top 27 SKUs generated ~80% of total revenue; Accessories (~62.6%) and Mountain Bikes (~45.4%) were top profitability drivers

๐Ÿ”— View Repository


๐Ÿ“š Certifications

  • โœ… Google Data Analytics Professional Certificate
  • โœ… SQL for Data Analysis
  • โœ… Power BI & Data Analytics
  • โœ… Python Fundamentals

๐Ÿ”— Connect With Me

Pinned Loading

  1. bf-lubs-sales-volume-decline-analysis bf-lubs-sales-volume-decline-analysis Public

    Python sales decline analysis uncovering volume loss drivers across customers, products, markets, and regions.

    Jupyter Notebook

  2. sp500-financial-performance-2020-2021 sp500-financial-performance-2020-2021 Public

    S&P 500 financial performance analysis using SQL, Power BI, DAX, and R forecasting.

  3. RouteAnalysis RouteAnalysis Public

    Route-based logistics analysis comparing emergency delivery execution from Campinas vs. Sorocaba using Python and OpenRouteService.

    Jupyter Notebook

  4. StreamlitProject StreamlitProject Public

    Interactive dashboard application built with Streamlit, Python, and data visualization libraries.

    Python

  5. US-Sales-Performance-AdventureWorks US-Sales-Performance-AdventureWorks Public

    U.S. sales analysis in SQL and Power BI with insights on revenue, profit, products, and regions.