ATV

Code repositories for ATV (Adaptive Task Vectors).

Requirements

To run this code, create and activate a conda environment using the provided environment.yaml file:

conda env create -f environment.yaml

conda activate ATV

Run code

Prepare datasets

./scripts/prepare_dataset.sh

Train ATV model

./scripts/ATV_training.sh

Running the above script trains the model on all 20 in-domain datasets. After training, evaluation is performed on the test samples from all in-domain datasets.

Batch options (Training)

--batch_size N: Groups N samples for GPT-2 forward.
--llama_batch: Additionally batches LLaMA forward (N samples × 3 templates at once). FP16 numerical differences cause slightly different training trajectories.
--logits_to_keep: Computes only final-token logits on compatible last-token inference paths. Adaptive training loss keeps full logits to preserve the original teacher-forcing objective.
--gradient_checkpointing: Enables LLaMA activation checkpointing during training. This substantially reduces memory by recomputing LLaMA activations during backward, with extra compute cost.

# GPT-2 batch only
python ATV_training.py ... --batch_size 2

# GPT-2 + LLaMA batch
python ATV_training.py ... --batch_size 2 --llama_batch

# Memory-focused mode
python ATV_training.py ... --batch_size 2 --llama_batch --gradient_checkpointing

Evaluate all datasets

./scripts/ATV_evaluate.sh

Running the above script enables evaluation of performance on each individual dataset within the full collection.

Batch options (Evaluation)

--batch_size N: Batches N samples for GPT-2 and LLaMA forward simultaneously.
--logits_to_keep: Computes only the final-token logits for last-token evaluation. This saves memory but can introduce tiny numerical differences.

python ATV_evaluate.py ... --batch_size 4

Analyze results

python ATV_analysis.py

This script enables evaluation of performance for each category. Please make sure to modify the result_dirs variable in ATV_analysis.py to match the path to your result directory.

Analyze unseen task

python ATV_analysis.py

For unseen data, run ATV_unseen.py to perform the evaluation. As above, make sure to set the correct paths accordingly.

Acknowledge

This repository is built on top of the ELICIT project. We thank the authors for sharing the source and their work itself.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
dataset_files		dataset_files
scripts		scripts
utils		utils
.gitignore		.gitignore
ATV_analysis.py		ATV_analysis.py
ATV_evaluate.py		ATV_evaluate.py
ATV_training.py		ATV_training.py
ATV_unseen.py		ATV_unseen.py
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
process_data.py		process_data.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ATV

Requirements

Run code

Prepare datasets

Train ATV model

Batch options (Training)

Evaluate all datasets

Batch options (Evaluation)

Analyze results

Analyze unseen task

Acknowledge

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ATV

Requirements

Run code

Prepare datasets

Train ATV model

Batch options (Training)

Evaluate all datasets

Batch options (Evaluation)

Analyze results

Analyze unseen task

Acknowledge

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages