Repository containing portfolio of data science projects completed for academic, self learning, and professional purposes. Presented in the form of Jupyter Notebooks.
Tools
- Python: NumPy, Pandas, Seaborn, Matplotlib
- Machine Learning: scikit-learn, TensorFlow, keras
-
- Convolution Neural Network - Digit Recognizer: Convolutional Neural Network that learns to recognize sequences of digits using data generated by concatenating images from MNIST (Recognizes a digit based on an image).
- K-Means Clustering - Cluster US College Data: Using K-NN on college information
- Principal Component Analysis - Brain Tumor Genes: Using a PCA analysis to outline and reduce dimensionality of the brain genes dataset.
- Machine Learning Regression - Advertise Ad Clicking: Using a simple logistic regression to predict whether user will click on ad.
- Decision Tree and Rain Forest - Loan Data: Evaluating the likelihood of individual paying off the loan
-
- DICOM CT format Images processing: Processing CT images of .dcm format and applying mask to identify lung cancer nodules
- Stock Market Analysis of Tech Stocks: Analysis of technology stocks including change in price over time, daily returns.
- Exploratory Data Analysis - Geographical Plotting on Election and Power Consumption data: Simple analysis of election and power consumption data on geographical regions.
- Exploratory Data Analysis - TCGA Cancer Imaging Archive: Analysis of patients with lung cancer based on 70+ variables including continous and categorical data.
- Visualization - TCGA Cancer Imaging Archive: The visualization of complete data based on insights gaines during Exploratory Data Analysis - TCGA Cancer Imaging Archive
- 911 Calls - Capstone project: Analysis and insights from features based on 911 calls dataset