GitHub - RISElabQueens/forge24-code-translation: Replication data for the paper "Exploring the Impact of the Output Format on the Evaluation of Large Language Models for Code Translation"

Replication Package

Note: This repository contains the original replication package of the FORGE paper and the extended files submitted for review to ESME (Extended dataset, RQ4 and RQ5).

M. Macedo, Y. Tian, F. Cogo, and B. Adams, "Exploring the Impact of the Output Format on the Evaluation of Large Language Models for Code Translation," in Forge, 2024.

Structure of the package

Preliminary
- original_dataset_before_processing.csv Base dataset with 4000 code samples.
- dataset_after_processing.csv Dataset with 3,820 code samples with token lengths less than 3,072, includes the input code, test input and expected output. Please note that the input code does not contain ground truth (reference output code).
RQ1
- Contains the combined random sampled subsets for RQ1 (Vanilla and Reference Prompt subsets)
RQ2
- Contains the combined random sampled subsets for RQ2 (Vanilla and Control Prompt subsets)
RQ3
- inference_output_11_models_* Contains the inference output of the 11 models across the dataset used for each tested combination in RQ3 (CRE, VDE, VRE). Alongside the input code, translated code, test case input, expected test case output
- rand_sample_350: The random samples to understand the compilation problems in RQ3

Note: Content below is added as part of the extended version of this paper, submitted for review to ESME.

RQ4
- Contains the notebooks that generate the metrics, the construction of the extended dataset and dataset files inside archive.zip
RQ5
- Contains the inference outputs that were examined from closed-source models

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Replication Package

Structure of the package

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Preliminary		Preliminary
Prompt Templates		Prompt Templates
RQ1		RQ1
RQ2		RQ2
RQ3		RQ3
RQ4		RQ4
RQ5		RQ5
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Replication Package

Structure of the package

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages