KB4CT: Knowledge Base for Compiler Tuning

Project Overview

KB4CT is a knowledge-based compiler tuning system that optimizes LLVM pass sequences through a combination of offline empirical prototype discovery and an online knowledge-guided personalized evolutionary algorithm. The system operates in two main stages: Offline Knowledge Base Construction and Online Knowledge-Guided Personalized Optimization.

Dataset Preparation

Extract Dataset from Supplementary Material

Extract the LLVM IR datasets from the supplementary material. Place the extracted datasets in the project root directory, ensuring the structure is as follows:

KB4CT/
├── dataset/
│   ├── train/          
│   │   ├── dataset1/
│   │   ├── dataset2/
│   │   └── ...
│   └── test/           
│       ├── dataset1/
│       ├── dataset2/
│       └── ...
│── DLL
├── LLVMEnv/
├── llvm_tools/
├── output/
└── KB4CT.py

Running Instructions

Basic Execution

python KB4CT.py

Configuration

You can configure the parameters in the if name == 'main': section of KB4CT.py.

Offline GA Parameters

"offline_ga_params": {
    "seq_len": 100,           # Sequence length
    "population_size": 100,   # Population size
    "generations": 30,        # Number of generations
    "elite_size": 20,         # Number of elite individuals
    "crossover_rate": 0.8,    # Crossover probability
    "mutation_rate": 0.8      # Mutation probability
}

Online GA Parameters

"online_ga_params": {
    "population_size": 50,     # Population size
    "generations": 5,          # Number of generations
    "elite_size": 10,          # Number of elite individuals
    "crossover_rate": 0.8,     # Crossover probability
    "mutation_rate": 0.99      # Mutation probability
}

Output Results

After execution, the results will be saved in the output/ directory:

pass_embeddings_visualization.png: Visualization of pass embeddings
pass_clusters_visualization.png: Visualization of pass clusters
ablation_study_*.png: Ablation study result figures
ablation_study_report.txt: Ablation study report
ablation_detailed_results.json: Detailed result data

Ablation Study Modes

The system supports the following ablation study modes:

full: Full knowledge-guided method
no_knowledge_crossover: No knowledge-guided crossover
no_knowledge_mutation: No knowledge-guided mutation
random_init: Random population initialization
no_knowledge: Standard GA without any knowledge guidance

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KB4CT: Knowledge Base for Compiler Tuning

Project Overview

Dataset Preparation

Extract Dataset from Supplementary Material

Running Instructions

Basic Execution

Configuration

Offline GA Parameters

Online GA Parameters

Output Results

Ablation Study Modes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
DLL		DLL
LLVMEnv		LLVMEnv
llvm_tools		llvm_tools
output		output
.gitignore		.gitignore
KB4CT.py		KB4CT.py
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

KB4CT: Knowledge Base for Compiler Tuning

Project Overview

Dataset Preparation

Extract Dataset from Supplementary Material

Running Instructions

Basic Execution

Configuration

Offline GA Parameters

Online GA Parameters

Output Results

Ablation Study Modes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages