Skip to content

SujaanBhalla/SCT_DS_2

Repository files navigation

SCT_DS_2

Task 02 – Data Cleaning and Exploratory Data Analysis (Titanic Dataset)

SCT_DS_2 – Data Cleaning & Exploratory Data Analysis

Task Objective

The objective of this task is to perform data cleaning and exploratory data analysis (EDA) on the Titanic dataset to understand patterns and relationships in the data.

Dataset

  • Dataset Name: Titanic Dataset
  • Source: Kaggle (Titanic Competition)
  • File Used: train.csv

Work Performed

  • Loaded the dataset using Pandas
  • Checked dataset structure, missing values, and data types
  • Handled missing values in Age and Embarked columns
  • Dropped the Cabin column due to excessive missing data
  • Performed exploratory data analysis using visualizations
  • Analyzed survival patterns based on gender, passenger class, and age

Tools & Technologies

  • Python
  • Pandas
  • Matplotlib
  • Seaborn
  • Google Colab

Key Observations

  • Females had a higher survival rate than males
  • Passengers in 1st class had better survival chances
  • Children were more likely to survive
  • Most passengers were young adults

Learning Outcome

This task helped me understand the importance of data cleaning and how exploratory data analysis helps in identifying trends and patterns within real-world datasets.

About

Task 02 – Data Cleaning and Exploratory Data Analysis (Titanic Dataset)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors