- Saharsh Misra - 2022A7PS0074H
- Aarav Haran - 2022B3A70880H
- Vaishnu Kanna - 2022B3A71608H
- Aditya Pentyala - 2022B3A70522H
- Ensure you have Python>=3.8 on your system
- Clone this repository onto your local system
- Download the dataset from the provided google drive link as part of the assignment and place it in the root directory of your clone (SKIP IF REPO DOWNLOADED WITH THE DATA DIRECTORY INTACT)
- Create a new virtual environment with the command
python -m venv env - Activate the environment:
source env/bin/activateon Mac/Linuxenv/Scripts/activate.ps1on Windows Powershellenv/Scripts/activate.baton Windows cmd
- Once inside the environment, install the requirements using
pip install -r requirements.txt - Run the notebook eda.ipynb to populate the data/ directory (you may have to create an empty directory at first) (SKIP IF REPO DOWNLOADED WITH THE DATA DIRECTORY INTACT)
- Run the command
streamlit run dashboard.py
- Data Preprocessing and Exploratory Data Analysis (EDA) (5 Marks)
- Done in
eda.ipynband through plots indashboard.py
- Done in
- Feature Engineering and Representation (5 Marks)
- Done in
eda.ipynb
- Done in
- Classification Models (5 Marks)
- Done in
models/
- Done in
- Hyperparameter Tuning and Model Optimization, Complexity, Generalization, and Interpretation (5 Marks)
- Done in
model_tests.ipynband saved tosaved_models/ - Hyperparameter tuning done through GridSearchCV (sparse search space due to computational constraints)
- Complexity & generalization shown through outputs within notebook and dashboard
- Interpretation given in
INTERPRETATION.md
- Done in
- Overall ML Pipeline and Automation (5 Marks)
eda.ipynb(ETL Pipeline) ->model_tests.ipynb(train models) ->dashboard.py(interpret & visualize)
- Visualization and Video Reporting (5 Marks)
- Done in
dashboard.pyandTeam07_Assignment2_Video.mp4
- Done in
- Code Demo and Viva (10 Marks - Individual Assessment)
Classify persons into being high or low risk for having an emergency room admission.
- HIGH_EMERGENCY_RISK: Positive (1) if >2 emergency room encounters logged, else Negative (0)
- Age
- Medications
- Conditions
- Allergies
- Income
- Healthcare Coverage
- Procedure Cost
- Decision Tree
- Neural Network (MLP)
- Support Vector Machine
- Precision
- Accuracy
- Recall
- F1