Probing quality-evaluative geometry in transformer hidden states. GPT-2 encodes quality better than BERT, with a negativity bias that mirrors human cognition.
nlp ml bert quality-assessment gpt-2 huggingface transformer-interpretability mechanistic-interpretability probing-classifiers
-
Updated
Apr 7, 2026 - Jupyter Notebook