MindPulse: Stress Detection & Intervention System

Project Workflow & 2-Day Implementation Timeline

This document outlines the detailed, step-by-step workflow for building the MindPulse mini-project over a 2-day sprint. It breaks down exactly what to build and how to build it, ensuring strict alignment with the 7 core objectives outlined for this project.

🎯 The 7 Core Objectives (Mapped to Workflow)

Identify & Analyze Indicators: Track keyboard latency, typing speed, mouse erraticism, and context-switching.
Collect & Structure Datasets: Extract these data points into 5-minute sliding windows (46 features) and bootstrap with synthetic training data.
Develop ML/DL Model: Train an XGBoost baseline model calibrated to individual users.
Recommendation Module: Suggest dynamic interventions (breathing exercises, Lo-Fi music) based on the stress score.
Desktop Application: Build a real-time Streamlit dashboard for a user-friendly UI.
Continuous Improvement (User Feedback): Implement Online Learning (Exponential Moving Average) to adapt to the user's unique baseline over time.
Evaluation Metrics: Build a test script to evaluate Accuracy, Precision, Recall, and F1-score mathematically.

🗓️ 2-Day Detailed Execution Timeline

Day 1: Data Infrastructure & Feature Engineering

Phase 1: Environment Setup & Raw Data Collection (Hours 0 - 3)

Target Objectives: Objective 1
What to build: The background listener that captures real-time behavioral interactions securely.
How to build:
1. Create requirements.txt (pynput, pandas, xgboost, scikit-learn, streamlit, psutil).
2. Write data_collector.py using the pynput library.
3. Implement listeners for on_press, on_release, on_move, and on_click.
4. Privacy rule: Never record the actual characters typed, only the timestamps and key categories (alpha, digit, special) to measure how they type, not what.
5. Use psutil (or ctypes for Windows) to track the active application window to measure context-switching frequency.

Phase 2: Feature Extraction & Data Structuring (Hours 3 - 7)

Target Objectives: Objective 2
What to build: The engine that translates raw clicks/keys into mathematical features (e.g., WPM, pauses, rage clicks).
How to build:
1. Write feature_extractor.py.
2. Segment the raw timeline into 5-minute sliding windows moving at 1-minute steps.
3. Extract 46 specific features per window using numpy arrays:
  - Keyboard: Hold time variance, flight time (interval between keys), Typing Speed (WPM), Error Rate (backspaces), Rhythm Entropy.
  - Mouse: Velocity variance, direction changes, Rage Clicks (rapid clustered clicks), Scroll variation.
  - Context: Tab/App switch frequency (proxy for cognitive scatter/overload).

Phase 3: Dataset Bootstrapping (Hours 7 - 10)

Target Objectives: Objective 2
What to build: A dataset to train our model before the app gathers weeks of real user data.
How to build:
1. Since we are not asking users to take stress tests on Day 1, we will write a generate_synthetic_data() function.
2. Use mathematical modeling to simulate 3,000+ "sessions." E.g., force the Typing Entropy and Rage Click features to correlate strongly with a y = STRESSED label.
3. This acts as our "pre-trained" base so the ML model works immediately.

Day 2: Machine Learning, UI, & Interventions

Phase 4: Model Training & Personal Calibration (Hours 10 - 14)

Target Objectives: Objective 3, Objective 6
What to build: The core XGBoost classification model and the user-calibration system.
How to build:
1. Write model.py. Train an xgb.XGBClassifier on the synthetic dataset to output probabilities for 3 classes: NEUTRAL, MILD, STRESSED.
2. Addressing Objective 6 (Continuous Improvement): Build a PersonalBaseline class hooked to a local SQLite database (user_baselines.db).
3. Use an Exponential Moving Average (EMA) so the system slowly learns what your specific typing speed is at different times of the day, adjusting the model's sensitivity to match your unique circadian rhythm.

Phase 5: Dashboard UI & Recommendation Module (Hours 14 - 18)

Target Objectives: Objective 4, Objective 5
What to build: The desktop application interface and intervention logic.
How to build:
1. Write app.py using Streamlit. Streamlit is chosen because it allows for lighting-fast UI prototyping in pure Python.
2. Wire the dashboard to call collector.get_events() every few seconds to feed the live inference model.
3. Display a Live Stress Meter (0-100 gauge map) and breakdown of active metrics (WPM, Mouse Speed).
4. Addressing Objective 4 (Interventions): Build an if/elif logic block based on the stress score.
  - If score > 70: Trigger a red alert UI banner suggesting a 5-minute Box Breathing exercise or providing a link to Lo-Fi focus beats.
  - If score > 50: Trigger a yellow warning suggesting a quick stretch.

Phase 6: Automated Evaluation Pipeline (Hours 18 - 20)

Target Objectives: Objective 7
What to build: The testing script to prove the technical capabilities of the model.
How to build:
1. Write evaluate_model.py.
2. Generate a heavily imbalanced hold-out test set (e.g., 1,000 synthetic sessions unseen by the XGBoost model).
3. Use scikit-learn metrics to output:
  - Accuracy: Overall correctness.
  - Precision (Macro): Minimizing false stress alarms so users don't lose trust.
  - Recall (Macro): Ensuring the app doesn't miss actual stressed states.
  - F1-Score: The harmonized mean of precision/recall given class imbalances.
4. (Note: The user-centric measures like "usability" and "satisfaction" will be measured qualitatively when you and your team actually use the Streamlit interface).

🛠️ Summary Technical Stack

Data Collection: pynput, psutil
Data Processing: numpy, pandas
Machine Learning: xgboost, scikit-learn
Application UI: streamlit
Database (Calibration): sqlite3

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
app.py		app.py
data_collector.py		data_collector.py
evaluate_model.py		evaluate_model.py
feature_extractor.py		feature_extractor.py
model.py		model.py
requirements.txt		requirements.txt
sample_data.csv		sample_data.csv
synthetic_data.py		synthetic_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MindPulse: Stress Detection & Intervention System

Project Workflow & 2-Day Implementation Timeline

🎯 The 7 Core Objectives (Mapped to Workflow)

🗓️ 2-Day Detailed Execution Timeline

Day 1: Data Infrastructure & Feature Engineering

Phase 1: Environment Setup & Raw Data Collection (Hours 0 - 3)

Phase 2: Feature Extraction & Data Structuring (Hours 3 - 7)

Phase 3: Dataset Bootstrapping (Hours 7 - 10)

Day 2: Machine Learning, UI, & Interventions

Phase 4: Model Training & Personal Calibration (Hours 10 - 14)

Phase 5: Dashboard UI & Recommendation Module (Hours 14 - 18)

Phase 6: Automated Evaluation Pipeline (Hours 18 - 20)

🛠️ Summary Technical Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MindPulse: Stress Detection & Intervention System

Project Workflow & 2-Day Implementation Timeline

🎯 The 7 Core Objectives (Mapped to Workflow)

🗓️ 2-Day Detailed Execution Timeline

Day 1: Data Infrastructure & Feature Engineering

Phase 1: Environment Setup & Raw Data Collection (Hours 0 - 3)

Phase 2: Feature Extraction & Data Structuring (Hours 3 - 7)

Phase 3: Dataset Bootstrapping (Hours 7 - 10)

Day 2: Machine Learning, UI, & Interventions

Phase 4: Model Training & Personal Calibration (Hours 10 - 14)

Phase 5: Dashboard UI & Recommendation Module (Hours 14 - 18)

Phase 6: Automated Evaluation Pipeline (Hours 18 - 20)

🛠️ Summary Technical Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages