This repository contains the code and data for reproducing the results in the paper "Exploring Cognitively Informed Sentence Simplification with Gaze-Guided Text Generation" (Gaze4NLP 2026).
Requires Python>=3.12.
pip install -r requirements.txt
pip install --no-deps lens-metric==0.2.0bash train_gaze_models.shThis trains all gaze model configurations and saves them in gaze-models/.
bash score_texts.shThis predicts gaze scores for the reference simplifications in the ASSET validation set and saves them in gaze-scores/.
bash generate_texts.shThis generates outputs for the ASSET validation set with all gaze model configurations and saves them in outputs/.
bash evaluate_texts.shThis calculates evaluation metrics for all outputs and saves them in results/.
python analysis.pyThis generates the figures in the paper and saves them in results/.
Eye-tracking data (gaze-data/): The EMTeC, OneStop, MECO, and Provo corpora are all licensed under CC-BY-4.0.
Sentence simplification data (simplification-data): The ASSET dataset is licensed under CC-BY-NC-4.0.
The code in this repository is licensed under MIT.