You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A cloud-native data pipeline and visualization project analyzing Formula 1 racing data using Azure, Databricks, Delta Lake, Tableau, and Python for insightful EDA and interactive dashboards.
End-to-end data pipeline transforming Olist e-commerce data through Azure cloud services. Implements medallion architecture (Bronze-Silver-Gold) with multi-source ingestion, Spark-based processing, and OLTP-to-OLAP optimization for analytics-ready datasets.
In this project, I've created an end-to-end ETL pipeline and subsequently developed a machine learning model to predict the price of Amazon products based on several product-related features.
This project demonstrates a complete ETL pipeline for Formula 1 racing data using Azure Databricks, Delta Lake, and Azure Data Factory. It covers data ingestion, transformation with PySpark and Spark SQL, data governance with Unity Catalog, and visualization through Power BI. Designed to showcase real-world data engineering workflows in Azure.
End-to-end Cloud ETL workflow for a diabetes dataset: ingest raw data, validate schema, clean missing/outlier values, transform features, and export analysis-ready tables for reporting and EDA.
This Project involves building an e-commerce order data warehouse on Azure Synapse Analytic, leveraging the power of Azure Data Lake Storage Gen2, Synapse Pipelines, Data Flows, and Serverless SQL Pools.
This azure databricks project implements a modern data engineering pipeline on Azure using the Medallion Architecture (Bronze → Silver → Gold) to process NYC taxi trip data.
A real-world, end-to-end Azure Data Engineering pipeline built on CRM sales data — covering ingestion, transformation, security, analytics, and visualisation using industry-standard Azure services.