@article{zhang2024survey,
title={A Systematic Literature Review on Large Language Models for Automated Program Repair},
author={Zhang, Quanjun and Fang, Chunrong and Xie, Yang and Ma, Yuxiang and Sun, Weisong and Yang, Yun and Chen, Zhenyu},
journal={arXiv preprint arXiv:2405.01466}
year={2024}
}
Title | Year | Venue |
---|---|---|
Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation | 2025 | ICSE@DeepTest |
RUG: Turbo LLM for Rust Unit Test Generation | 2025 | ICSE |
What You See Is What You Get: Attention-based Self-guided Automatic Unit Test Generation | 2025 | ICSE |
Enhancing LLM's Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection | 2025 | ArXiv |
Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation | 2025 | ICST |
CITYWALK: Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge | 2025 | ArXiv |
A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing | 2025 | ISSTA |
Dynamic Scaling of Unit Tests for Code Reward Modeling | 2025 | ArXiv |
The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation | 2025 | ArXiv |
Mutation-Guided LLM-based Test Generation at Meta | 2025 | ArXiv |
STRUT: Structured Seed Case Guided Unit Test Generation for C Programs using LLMs | 2025 | ISSTA |
LLM-based Unit Test Generation for Dynamically-Typed Programs | 2025 | ArXiv |
A3Test: Assertion-Augmented Automated Test Case Generation | 2024 | IST |
ChatUniTest: A Framework for LLM-Based Test Generation | 2024 | FSE |
On the Evaluation of Large Language Models in Unit Test Generation | 2024 | ASE |
An Empirical Evaluation of Using Large Language Models for Automated Unit Test Generation | 2024 | TSE |
Enhancing LLM-based Test Generation for Hard-to-Cover Branches via Program Analysis | 2024 | ArXiv |
LLM-Powered Test Case Generation for Detecting Tricky Bugs | 2024 | ArXiv |
Evaluating and Improving ChatGPT for Unit Test Generation | 2024 | FSE |
CasModaTest: A Cascaded and Model-agnostic Self-directed Framework for Unit Test Generation | 2024 | ArXiv |
Large-scale, Independent and Comprehensive study of the power of LLMs for test case generation | 2024 | ASE |
Effective test generation using pre-trained Large Language Models and mutation testing | 2024 | IST |
Optimizing Search-Based Unit Test Generation with Large Language Models: An Empirical Study | 2024 | Internetware |
Domain Adaptation for Code Model-based Unit Test Case Generation | 2024 | ISSTA |
LLM-based Unit Test Generation via Property Retrieval | 2024 | ArXiv |
TestBench: Evaluating Class-Level Test Case Generation Capability of Large Language Models | 2024 | ArXiv |
Retrieval-Augmented Test Generation: How Far Are We? | 2024 | ArXiv |
Rethinking the Influence of Source Code on Test Case Generation | 2024 | ArXiv |
HITS: High-coverage LLM-based Unit Test Generation via Method Slicing | 2024 | ASE |
A System for Automated Unit Test Generation Using Large Language Models and Assessment of Generated Test Suites | 2024 | ArXiv |
TestART: Improving LLM-based Unit Test via Co-evolution of Automated Generation and Repair Iteration | 2024 | ArXiv |
Harnessing the Power of LLMs: Automating Unit Test Generation for High-Performance Computing | 2024 | ArXiv |
CoverUp: Coverage-Guided LLM-Based Test Generation | 2024 | ArXiv |
Enhancing Large Language Models for Text-to-Testcase Generation | 2024 | ArXiv |
Automated Unit Test Improvement using Large Language Models at Meta | 2024 | FSE |
Code-Aware Prompting: A study of Coverage Guided Test Generation in Regression Setting using LLM | 2024 | FSE |
Automatic Generation of Test Cases based on Bug Reports: a Feasibility Study with Large Language Models | 2024 | ICSE |
Using Large Language Models to Generate JUnit Tests: An Empirical Study | 2024 | EASE |
ChatGPT vs SBST: A Comparative Assessment of Unit Test Suite Generation | 2024 | TSE |
TestSpark: IntelliJ IDEA's Ultimate Test Generation Companion | 2024 | ICSE-Companion |
Towards Understanding the Effectiveness of Large Language Models on Directed Test Input Generation | 2024 | ASE |
Unit Test Generation using Large Language Models for Unity Game Development | 2024 | FaSE4Games |
Data Augmentation by Fuzzing for Neural Test Generation | 2024 | ArXiv |
ASTER: Natural and Multi-language Unit Test Generation with LLMs | 2024 | ICSE-SEIP |
Unit Test Generation using Generative AI : A Comparative Performance Analysis of Autogeneration Tools | 2024 | LLM4Code(workshop) |
Advancing Bug Detection in Fastjson2 with Large Language Models Driven Unit Test Generation | 2024 | ArXiv |
Evaluation of Large Language Models for Unit Test Generation | 2024 | ASYU |
CPP-UT-Bench: Can LLMs Write Complex Unit Tests in C++? | 2024 | ArXiv |
Unit Test Generation for Vulnerability Exploitation in Java Third-Party Libraries | 2024 | ArXiv |
UniTSyn: A Large-Scale Dataset Capable of Enhancing the Prowess of Large Language Models for Program Testing | 2024 | ISSTA |
Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study | 2024 | ArXiv |
Design choices made by LLM-based test generators prevent them from finding bugs | 2024 | ArXiv |
TestGenEval: A Real World Unit Test Generation and Test Completion Benchmark | 2024 | ArXiv |
Automating Autograding: Large Language Models as Test Suite Generators for Introductory Programming | 2024 | JCAL |
LLM4VV: Developing LLM-driven testsuite for compiler validation | 2024 | FGCS |
VALTEST: Automated Validation of Language Model Generated Test Cases | 2024 | ArXiv |
Large Language Models as Test Case Generators: Performance Evaluation and Enhancement | 2024 | ArXiv |
LLM-Based Test-Driven Interactive Code Generation: User Study and Empirical Evaluation | 2024 | TSE |
TDD-Bench Verified: Can LLMs Generate Tests for Issues Before They Get Resolved | 2024 | ArXiv |
PyTester: Deep Reinforcement Learning for Text-to-Testcase Generation | 2024 | ArXiv |
Using GitHub Copilot for Test Generation in Python: An Empirical Study | 2024 | AST |
CAT-LM Training Language Models on Aligned Code And Tests | 2023 | ASE |
Prompting Code Interpreter to Write Better Unit Tests on Quixbugs Functions | 2023 | ArXiv |
An initial investigation of ChatGPT unit test generation capability | 2023 | SAST |
Exploring the Capability of ChatGPT in Test Generation | 2023 | QRS |
Large Language Models are Few-shot Testers: Exploring LLM-based General Bug Reproduction | 2023 | ICSE |
How Well does LLM Generate Security Tests | 2023 | ArXiv |
CodeT: Code Generation with Generated Tests | 2023 | ICLR |
CodaMosa: Escaping Coverage Plateaus in Test Generation with Pre-trained Large Language Models | 2023 | ICSE |
Nuances are the Key: Unlocking ChatGPT to Find Failure-Inducing Tests with Differential Prompting | 2023 | ASE |
Unit Test Case Generation with Transformers and Focal Context | 2021 | ArXiv |
Title | Year | Venue |
---|---|---|
A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing | 2025 | ISSTA |
DeCon: Detecting Incorrect Assertions via Postconditions Generated by a Large Language Model | 2025 | ArXiv |
Improving Retrieval-Augmented Deep Assertion Generation via Joint Training | 2025 | TSE |
Improving Deep Assertion Generation via Fine-Tuning Retrieval-Augmented Pre-trained Language Models | 2025 | TOSEM |
Exploring Automated Assertion Generation via Large Language Models | 2024 | TOSEM |
AssertionBench: A Benchmark to Evaluate Large-Language Models for Assertion Generation | 2024 | ArXiv |
Chat-like Asserts Prediction with the Support of Large Language Model | 2024 | ArXiv |
Transducer Tuning: Efficient Model Adaptation for Software Tasks Using Code Property Graphs | 2024 | ArXiv |
Deep Multiple Assertions Generation | 2024 | FORGE |
Assertify: Utilizing Large Language Models to Generate Assertions for Production Code | 2024 | ArXiv |
An Empirical Study on Focal Methods in Deep-Learning-Based Approaches for Assertion Generation | 2024 | FSE |
Retrieval-Based Prompt Selection for Code-Related Few-Shot Learning | 2023 | ICSE |
Using Transfer Learning for Code-Related Tasks | 2023 | TSE |
Generating Accurate Assert Statements for Unit Test Cases using Pretrained Transformers | 2022 | AST |
On Learning Meaningful Assert Statements for Unit Test Cases | 2020 | ICSE |
Title | Year | Venue |
---|---|---|
TOGLL: Correct and Strong Test Oracle Generation with LLMs | 2025 | ICSE |
AugmenTest: Enhancing Tests with LLM-Driven Oracles | 2025 | ICST |
exLong: Generating Exceptional Behavior Tests with Large Language Models | 2025 | ICSE |
Doc2Oracle: Investigating the Impact of Javadoc Comments on Test Oracle Generation | 2024 | ArXiv |
Do LLMs generate test oracles that capture the actual or the expected program behaviour? | 2024 | ArXiv |
Towards More Realistic Evaluation for Neural Test Oracle Generation | 2024 | ISSTA |
Assessing Evaluation Metrics for Neural Test Oracle Generation | 2024 | TSE |
ChatAssert: LLM-based Test Oracle Generation with External Tools Assistance | 2023 | TSE |
Neural-Based Test Oracle Generation: A Large-scale Evaluation and Lessons Learned | 2023 | FSE |
TOGA: A Neural Method for Test Oracle Generation | 2022 | ICSE |
Title | Year | Venue |
---|---|---|
Automated Test Case Repair Using Language Models | 2025 | TSE |
A Large-scale Empirical Study on Fine-tuning Large Language Models for Unit Testing | 2025 | ISSTA |
REACCEPT: Automated Co-evolution of Production and Test Code Based on Dynamic Validation and Large Language Models | 2025 | ISSTA |
Augmenting LLMs to Repair Obsolete Test Cases with Static Collector and Neural Reranker | 2024 | ISSRE |
Identify and Update Test Cases When Production Code Changes: A Transformer-Based Approach | 2023 | ASE |
Title | Year | Venue |
---|---|---|
What You See Is What You Get: Attention-based Self-guided Automatic Unit Test Generation | 2025 | ICSE |
Automatic Generation of Test Cases based on Bug Reports: a Feasibility Study with Large Language Models | 2024 | ICSE |
Advancing Bug Detection in Fastjson2 with Large Language Models Driven Unit Test Generation | 2024 | ArXiv |
Unit Test Generation for Vulnerability Exploitation in Java Third-Party Libraries | 2024 | ArXiv |
Large Language Models are Few-shot Testers: Exploring LLM-based General Bug Reproduction | 2023 | ICSE |
Title | Year | Venue |
---|---|---|
Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation | 2025 | ICSE@DeepTest |
Automated Unit Test Refactoring | 2025 | FSE |
Test smells in LLM-Generated Unit Tests | 2024 | ArXiv |
Evaluating Large Language Models in Detecting Test Smells | 2024 | SBES |
Title | Year | Venue |
---|---|---|
TestGenEval: A Real World Unit Test Generation and Test Completion Benchmark | 2024 | ArXiv |
CAT-LM Training Language Models on Aligned Code And Tests | 2023 | ASE |
Learning deep semantics for test completion | 2023 | ICSE |
Title | Year | Venue |
---|---|---|
Leveraging Large Language Models for Enhancing the Understandability of Generated Unit Tests | 2025 | ICSE |
Improving the Readability of Automatically Generated Tests using Large Language Models | 2025 | ICST |
An LLM-based Readability Measurement for Unit Tests' Context-aware Inputs | 2024 | ArXiv |
Title | Year | Venue |
---|---|---|
Identify and Update Test Cases When Production Code Changes: A Transformer-Based Approach | 2023 | ASE |
Title | Year | Venue |
---|---|---|
Automated Unit Test Refactoring | 2025 | FSE |
Title | Year | Venue |
---|---|---|
VALTEST: Automated Validation of Language Model Generated Test Cases | 2024 | ArXiv |
Title | Year | Venue |
---|---|---|
LTM: Scalable and Black-box Similarity-based Test Suite Minimization based on Language Models | 2024 | TSE |
Title | Year | Venue |
---|---|---|
Method-Level Test-to-Code Traceability Link Construction by Semantic Correlation Learning | 2024 | TSE |