This project combines machine learning with a Flask web interface to predict the risk of lung cancer and survival likelihood based on user input. It consists of two trained models: one for lung cancer risk prediction, and another for survival analysis using comorbidities and clinical indicators.
⚠️ Experimental Project Notice
This tool is created for educational and experimental purposes only. It is not a medical device. The results are derived from limited, small-scale datasets and do not represent clinically validated predictions.
-
Lung Cancer Risk Prediction
Uses a Random Forest model trained on basic survey features like smoking, fatigue, anxiety, etc. -
Survival Prediction
Uses a Decision Tree model trained on synthetic clinical data including comorbidities, tumor stage, and survival duration classes.
- Small and potentially biased datasets
- Simplified feature encoding
- No hyperparameter tuning or cross-validation
- Not suitable for clinical decision-making
- Python 3.x
- Flask
- pandas, numpy
- scikit-learn
- joblib
- matplotlib, seaborn
git clone https://github.com/anilcemelemir/LungCancerPrediction.git
cd LungCancerPredictionpip install -r requirements.txtpython src/train_cancer_model.py
python src/train_survival_model.pyMake sure both
.csvfiles are inside thedata/folder before training.
python main.pyGo to http://localhost:5000 in your browser.
MIT License © 2025 Semih Çetin, Velihan Özge, Anıl Cem Elemir
machine-learning
flask
lung-cancer
healthcare
survival-analysis
classification
python
experimental