Can the Rookies Cut the Tough Cookie? Exploring the Use of LLMs for SQL Equivalence Checking

Rajat Singh (rajat.singh@cse.iitd.ac.in), Srikanta Bedathur (srikanta@cse.iitd.ac.in)

Environment Details

We use Python 3.9.18 to run each file. We use Pytorch '2.2.1' with cuda '12.1.105'.

Dataset

For the dataset, please fill out this Google Form: https://forms.gle/cJiRNeGbvcTDYTyt6, and we will share the dataset link with you afterwards.

Instruction to Run Code

The 'result' folder contains three folders, each containing code for each dataset. The code in these folders is arranged according to the prompting pipeline given in the paper. To get the result, you must run the '.ipynb' files in this folder; before running these .ipynb files, please make the following changes:

Update the API keys for Gemini-Pro and GPT in the code (.ipynb files).
Download the relevant models (specifically CodeLLama-7B and CodeLLama-13B) from Hugging Face (https://huggingface.co).
Update all the paths in the config file.

Instruction for Fine-tuning the model

The 'finetune' folder contains all the files required for fine-tuning. The dataset we used to finetune is in the 'finetune/dataset' folder, and the script used to fine-tune the model is in the 'finetune\script' folder. Before running the finetuning code ('finetune.ipynb'), please make the following changes:

Download the relevant models (specifically CodeLLama-13B) from Hugging Face (https://huggingface.co).
Update all the paths in 'finetune.ipynb' file.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
context_prompts		context_prompts
finetune		finetune
prompts		prompts
results		results
samples		samples
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Can the Rookies Cut the Tough Cookie? Exploring the Use of LLMs for SQL Equivalence Checking

Environment Details

Dataset

Instruction to Run Code

Instruction for Fine-tuning the model

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Can the Rookies Cut the Tough Cookie? Exploring the Use of LLMs for SQL Equivalence Checking

Environment Details

Dataset

Instruction to Run Code

Instruction for Fine-tuning the model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages