Skip to content

Code-Studio-AI-Research-Lab/Clinical-Banglish-Robustness

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Robustness and Reliance: How Bangla–English Code-Mixing Affects Clinical Language Models and the Clinicians Who Use Them

License: MIT Framework: PyTorch Domain: Clinical NLP

This repository contains the dataset generation pipelines, evaluation framework, and empirical analysis for Robustness and Reliance, a research initiative investigating the intersection of multi-lingual code-mixing (Bangla-English) and critical healthcare AI. Our study quantifies how syntactically mixed clinical notes (Banglish/Code-Mixed) degrade the performance of State-of-the-Art (SOTA) Clinical Language Models and evaluates the downstream cognitive reliance of clinicians who interact with these potentially flawed AI outputs.


📌 Research Vision & Core Concept

In non-Western clinical settings like Bangladesh, physicians rarely document patient histories in a single, pure language. Instead, they produce highly code-mixed notes containing a fusion of English medical terminology and Bangla symptomatic descriptions. This project addresses two critical unmapped vulnerabilities:

  1. Model Robustness: Assessing how severely standard clinical models (e.g., ClinicalBERT, Med-PaLM, Llama-3-Med) degrade when processing non-standard, code-mixed clinical syntax.
  2. Human Reliance: Conducting user-study simulations to measure whether clinicians over-rely (Automation Bias) or under-rely (Algorithm Aversion) on AI explanations when models are confused by code-mixed inputs.

🛠️ Key Features & Methodology

  1. Synthetic Code-Mixing Generator: Algorithms using linguistic switching-point laws to automatically transform standard English/Bangla clinical notes into realistic code-mixed medical logs.
  2. Robustness Stress-Testing: Evaluation suite measuring token-level prediction dropouts, Named Entity Recognition (NER) failures, and clinical question-answering degradation.
  3. Clinician Telemetry Framework: Framework to log clinician interactions, decision shifts, and error-catching rates during specialized clinical UI simulations.
  4. Perplexity & Confusion Metrics: Automated calculation of Cross-Lingual Perplexity to benchmark which transformer layers collapse under heavy code-mixing.

📂 Repository Structure

├── src/
│   ├── generator/          # Code-mixing synthesis pipelines (Linguistic switching rules)
│   ├── evaluation/         # Robustness metrics, NER evaluation, and Perplexity logs
│   ├── models/             # Adapters and wrappers for base Clinical BERT/LLMs
│   └── user_study/         # Clinician interaction metrics and decision telemetry tools
├── data/                   # Anonymized synthetic datasets and evaluation benchmarks
├── configs/                # Mix-ratio parameters and tokenization setups
├── notebooks/              # Statistical graphs, error heatmaps, and reliance curves
├── Literature_Review/      # Team research matrices and BibTeX reference files
└── README.md

About

"Robustness and Reliance: How Bangla–English Code-Mixing Affects Clinical Language Models and the Clinicians Who Use Them". Evaluates clinical LLM vulnerabilities and human-AI reliance under code-mixed medical corpora.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors