Dr. Boot: Bootstrapping Program Synthesis Language Models to Perform Repairing

(Image created by DALLE-3)

Abstract

Language models for program synthesis are usually trained and evaluated on programming competition datasets (MBPP, APPS). However, these datasets are limited in size and quality, while these language models are extremely data hungry. Additionally, the language models have a misaligned program synthesis process compared to humans. While humans iteratively develop code with the help of a compiler, most program synthesis models currently produce code in one go. To solve these issues, we introduce a bootstrapping algorithm for program synthesis, that supports teaching models how to repair. We show that bootstrapping consistently outperforms regular fine-tuning. Compared to other work, our bootstrapped model performs on par with fine-tuned models that are 68% larger. Notably, bootstrapping with repairing also improves non-repairing performance compared to regular bootstrapping during inference. However, on our models, repairing during inference is likely inferior to simply sampling the same number of solutions. Furthermore, we find that there are issues with the example test cases in the training portion of the APPS dataset that are valuable to the community, as many repairing and reinforcement learning methods rely on them.

Citation

If you use this work, please cite it as:

@masterthesis{vdvleuten2023,
    title        = {Dr. Boot: Bootstrapping Program Synthesis Language Models to Perform Repairing},
    author       = {Noah van der Vleuten},
    year         = 2023,
    month        = {July},
    note         = {Available at \url{https://scripties.uba.uva.nl/search?id=record_54126}},
    school       = {University of Amsterdam},
    type         = {Master's thesis}
}

Repository Structure

configs/: Configuration file for the experiments.
data/: Datasets used in the thesis.
models/: Code for running and training the CodeT5 model.
results/: Results of the experiments and analysis tools used in the thesis.
few_shot_examples/: Few-shot examples used in the thesis.
experiment_scripts/: Scripts used to run the experiments.
./: Training scripts with helper functions, includes the code for the bootstrapping algorithm (train_sdr.py).

Setup

To run the experiments, we need to install the required packages. A env.yml file is included in the repository to create a conda environment with the required packages. To create the environment, run the following command in the root directory:

conda env create -f env.yml

Then activate the environment with the following command:

conda activate drboot

Running the Experiments

All experiments included in the thesis are stored in the experiment_scripts/ directory.

For example, to run the APPS bootstrapping experiment with full compiler feedback, we can navigate to experiment_scripts/apps_jobs/full_feedback_apps_job.sh and run the following command in the root directory:

python train_sdr.py --batch-size-per-replica 6 --grad-acc-steps 4 --inference_batch_size 70 --num_workers 16 --model codet5-large-ntp-py --training_mode full_feedback --exp_name full_feedback_bootstrap_apps_1 --perform_experiments --beam_search_batch_size 35 --dataset APPS --only_perform_basic_tests --seed 18 --validate_first_step  --model codet5-large-ntp-py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
configs		configs
data		data
experiment_scripts		experiment_scripts
few_shot_examples		few_shot_examples
models		models
results		results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dr_boot.jfif		dr_boot.jfif
env.yml		env.yml
regular_finetuning.py		regular_finetuning.py
test_apps_task.py		test_apps_task.py
test_best_model.py		test_best_model.py
test_mbpp_task.py		test_mbpp_task.py
train_sdr.py		train_sdr.py
unittests_test_mbpp_task.py		unittests_test_mbpp_task.py
visualize_dataset_embeddings.py		visualize_dataset_embeddings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Dr. Boot: Bootstrapping Program Synthesis Language Models to Perform Repairing

Abstract

Citation

Repository Structure

Setup

Running the Experiments

License

About

Uh oh!

Languages

License

NoahVl/Dr-Boot

Folders and files

Latest commit

History

Repository files navigation

Dr. Boot: Bootstrapping Program Synthesis Language Models to Perform Repairing

Abstract

Citation

Repository Structure

Setup

Running the Experiments

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages