Clinical Risk Flagging — Reducing False Negatives with Logistic Regression

Executive Summary

This project builds a clinical risk stratification model to predict whether a patient is likely to experience a prolonged hospital stay. The objective was to identify high-risk patients early and provide interpretable signals that could support resource planning and intervention strategies.

1. Business Problem

In clinical laboratories, the worst possible failure is a false negative — a high-risk test that is not flagged for manual review.

Missing such a case can lead to patient harm, legal exposure, and regulatory consequences.

Objective: Build a model that flags test results for manual review, prioritizing high recall to minimize false negatives, even at the cost of increased manual workload.

2. Dataset

Source: UCI Machine Learning Repository — Breast Cancer Wisconsin (Diagnostic) dataset.

Description: Each row represents a diagnostic test derived from digitized images of cell nuclei. The dataset contains 30 numeric features describing shape, texture, and structural properties.

Target variable: flag_for_review
1 = requires manual review
0 = does not require review

3. Approach

Problem framing: Binary classification.

Why accuracy was rejected: Accuracy can appear high while hiding dangerous false negatives. In this context, accuracy is misleading.

Primary metric: Recall on the positive class (flag_for_review = 1).

4. Modeling

Baseline model: Logistic Regression.

Preprocessing:

Feature scaling (StandardScaler) due to different feature scales
Class imbalance handled using class_weight='balanced'

Threshold tuning: The default probability threshold (0.5) was reduced to 0.3 to increase sensitivity to high-risk cases.

5. Results

Baseline (threshold = 0.5):

Confusion matrix: [[41 1] [ 4 68]]

False negatives: 4

After threshold tuning (threshold = 0.3):

Confusion matrix: [[41 1] [ 0 72]]

False negatives: 0

6. Model Stability — Cross Validation

5-fold cross-validation recall scores: [0.9577, 0.9859, 0.9861, 0.9722, 1.0000]

Mean recall ≈ 0.98
Low variance indicates stable performance.

7. Business Interpretation

The model reliably flags high-risk cases for review. Manual workload increases slightly, but safety is prioritized. Clinicians retain final decision authority.

8. Recommendation

Deploy as a decision-support tool, not an automated decision-maker. Use threshold = 0.3 and monitor recall continuously.

9. Tech Stack

Python
pandas, numpy
scikit-learn
Jupyter Notebook

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Clinical-risk-flagging.ipynb		Clinical-risk-flagging.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clinical Risk Flagging — Reducing False Negatives with Logistic Regression

Executive Summary

1. Business Problem

2. Dataset

3. Approach

4. Modeling

5. Results

6. Model Stability — Cross Validation

7. Business Interpretation

8. Recommendation

9. Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Clinical Risk Flagging — Reducing False Negatives with Logistic Regression

Executive Summary

1. Business Problem

2. Dataset

3. Approach

4. Modeling

5. Results

6. Model Stability — Cross Validation

7. Business Interpretation

8. Recommendation

9. Tech Stack

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages