Robustness and Reliance: How Bangla–English Code-Mixing Affects Clinical Language Models and the Clinicians Who Use Them

This repository contains the dataset generation pipelines, evaluation framework, and empirical analysis for Robustness and Reliance, a research initiative investigating the intersection of multi-lingual code-mixing (Bangla-English) and critical healthcare AI. Our study quantifies how syntactically mixed clinical notes (Banglish/Code-Mixed) degrade the performance of State-of-the-Art (SOTA) Clinical Language Models and evaluates the downstream cognitive reliance of clinicians who interact with these potentially flawed AI outputs.

📌 Research Vision & Core Concept

In non-Western clinical settings like Bangladesh, physicians rarely document patient histories in a single, pure language. Instead, they produce highly code-mixed notes containing a fusion of English medical terminology and Bangla symptomatic descriptions. This project addresses two critical unmapped vulnerabilities:

Model Robustness: Assessing how severely standard clinical models (e.g., ClinicalBERT, Med-PaLM, Llama-3-Med) degrade when processing non-standard, code-mixed clinical syntax.
Human Reliance: Conducting user-study simulations to measure whether clinicians over-rely (Automation Bias) or under-rely (Algorithm Aversion) on AI explanations when models are confused by code-mixed inputs.

🛠️ Key Features & Methodology

Synthetic Code-Mixing Generator: Algorithms using linguistic switching-point laws to automatically transform standard English/Bangla clinical notes into realistic code-mixed medical logs.
Robustness Stress-Testing: Evaluation suite measuring token-level prediction dropouts, Named Entity Recognition (NER) failures, and clinical question-answering degradation.
Clinician Telemetry Framework: Framework to log clinician interactions, decision shifts, and error-catching rates during specialized clinical UI simulations.
Perplexity & Confusion Metrics: Automated calculation of Cross-Lingual Perplexity to benchmark which transformer layers collapse under heavy code-mixing.

📂 Repository Structure

├── src/
│   ├── generator/          # Code-mixing synthesis pipelines (Linguistic switching rules)
│   ├── evaluation/         # Robustness metrics, NER evaluation, and Perplexity logs
│   ├── models/             # Adapters and wrappers for base Clinical BERT/LLMs
│   └── user_study/         # Clinician interaction metrics and decision telemetry tools
├── data/                   # Anonymized synthetic datasets and evaluation benchmarks
├── configs/                # Mix-ratio parameters and tokenization setups
├── notebooks/              # Statistical graphs, error heatmaps, and reliance curves
├── Literature_Review/      # Team research matrices and BibTeX reference files
└── README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Robustness and Reliance: How Bangla–English Code-Mixing Affects Clinical Language Models and the Clinicians Who Use Them

📌 Research Vision & Core Concept

🛠️ Key Features & Methodology

📂 Repository Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Literature_Review		Literature_Review
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Robustness and Reliance: How Bangla–English Code-Mixing Affects Clinical Language Models and the Clinicians Who Use Them

📌 Research Vision & Core Concept

🛠️ Key Features & Methodology

📂 Repository Structure

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages