Project Title

Project with the aim to implement and analyze the impact on computation of EE in LLMs: Transformers and Mamba.

This codes may (should) train any Mistral model (transformer or mamba) with EEs for its usage.

Installation

Step-by-step instructions on how to install and set up the project locally. This might include cloning the repository and installing dependencies.

git clone https://github.com/Xigm/EE_Clean.git

Install environment with the environment.yml file. Linux OS is requiered.

conda env create -n your_env_name -f environment.yml

Install Cuda toolkit (version 11.6+, I used 12.3)

conda activate your_env_name
pip install causal-conv1d
pip install mamba-ssm

Usage

To train your own early exits go to file train.py. Specify where the EE's are placed. Training is perfomed with the dataset FineWeb-Edu.

To test inference of the models go to file inference_EE.py. Select the model and modify the input.

Project Structure

├── datasets/                           # If some local hosted dataset if needed
│   ├── models/                         # ML models
│   └── utils/                          # Helper functions
├── EleutherAI_Eval_harness/            # Codes from EleutherAI to evaluate LLMs
│   └── lm_eval/                        
│      ├── models/                      # Wrappers for models to be tested are here
│      │   ├── mamba_models_EE.py       # Custom wrapper for mamba 
│      │   └── mistral_models_EE.py     # Custom wrapper for mistral
│      └── tasks/                       # Different task available
├── evals/                              # Codes to perform evaluations
│   ├── individual_evals /              # Evaluate a model in a single task
│   └── sweep_th/                       # Get results for speed up vs performance
│   plot_results.py                     # Compute the graphs for the data obtained
├── models/                             # Main codes for the models
│   ├── mamba/                          # Mamba implementations
│   └── mistral/                        # Transformer implementations
├── weights/                            # To save the backbones and EE weights
│   ├── mamba/                           
│   │   ├── codestral7b/                # Main backbone weights
│   │   └── EE_given_config/            # EE weights for a given configuration
│   └── mistral/                        # Transformer implementations
│       ├── mistral7b/                  # Main backbone weights
│       └── EE_given_config/            # EE weights for a given configuration
├── envirnment.yml                      # File to import env to conda
├── inference_EE.py                     # Code to perfor inference of the models
├── train.py                            # Code to train the EEs
├── tasks.txt                           # List of all available tasks in EleutherAi eval harness
└── utils.py                            # Some aux functions

Contributing

Instructions for users who want to contribute to the project...

Acknowledgements

Thanks to:

EleutherAI
Mistral
HuggingFace

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
EleutherAI_Eval_harness		EleutherAI_Eval_harness
evals		evals
models		models
.gitignore		.gitignore
README.md		README.md
conda_cmd.txt		conda_cmd.txt
environment.yml		environment.yml
eval_config.json		eval_config.json
inference.py		inference.py
inference_EE.py		inference_EE.py
main_eval.py		main_eval.py
tasks.txt		tasks.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Title

Table of Contents

Installation

Usage

Project Structure

Contributing

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Xigm/DYNAMAX

Folders and files

Latest commit

History

Repository files navigation

Project Title

Table of Contents

Installation

Usage

Project Structure

Contributing

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages