Official implementation of "CSKS: Continuously Steering LLMs Sensitivity to Contextual Knowledge with Proxy Models" (EMNLP 2025)
-
Updated
Aug 30, 2025 - Python
Official implementation of "CSKS: Continuously Steering LLMs Sensitivity to Contextual Knowledge with Proxy Models" (EMNLP 2025)
pysteer is a lightweight Python library for activation steering, representation engineering, and inference-time model steering in PyTorch transformer language models. It learns steering artifacts from labeled prompt/response examples, then applies interventions to intermediate activations without fine-tuning or modifying model weights.
Lobopy is a lightweight PyTorch/HuggingFace library for analysing, steering/abliteration of causal language models.
Add a description, image, and links to the model-steering topic page so that developers can more easily learn about it.
To associate your repository with the model-steering topic, visit your repo's landing page and select "manage topics."