A professional, machine learning-powered web application built with Python and Streamlit. This tool uses a Random Forest Classifier trained on the Pima Indians Diabetes Database to predict the probability of diabetes based on clinical health metrics.
Click here to try the Live App
- Interactive Sidebar: Adjustable sliders and input fields for real-time data entry.
- AI-Powered Predictions: Instant probability scoring with clear "High Risk" or "Low Risk" feedback.
- Data Visualization: Interactive donut charts powered by
Plotlyshowing training data distribution. - Downloadable Reports: Generate and download a personalized
.txtdiagnostic report of the results. - Model Transparency: View accuracy metrics and model evaluation details (Confusion Matrix) directly in-app.
- Language: Python 3.13
- Framework: Streamlit (Web UI)
- Machine Learning: Scikit-learn (Random Forest)
- Data Handling: Pandas & NumPy
- Visualization: Plotly Express
- Deployment: Streamlit Community Cloud
- Clone the repository:
git clone [https://github.com/ali-faraz-py/DiabetesDetector.git](https://github.com/ali-faraz-py/DiabetesDetector.git) cd DiabetesDetector - Install dependencies:
pip install -r requirements.txt
- Run the application:
streamlit run app.py
DiabetesDetector/
βββ app.py # Streamlit Web Application logic
βββ model.pkl # Pre-trained Random Forest Model
βββ explore.ipynb # Data analysis & model training notebook
βββ requirements.txt # Project dependencies
βββ .gitattributes # GitHub language customization
βββ assets/ # Images & Demo GIFs
The model achieves an 80.5% accuracy rate. Below is the Confusion Matrix showing how the model performs on unseen data:
The matrix shows our model is particularly strong at identifying healthy patients, with a focus on reducing false negatives.
Syed Ali Faraz - GitHub Profile
If you found this tool insightful, please give the repository a β!

