Investigating large language models capabilities for automatic code repair in Python


연구 분야: Analysis



학회: Cluster Computing


초록

Developers often encounter challenges with their introductory programming tasks as part of the development process. Unfortunately, rectifying these mistakes manually can be time-consuming and demanding. Automated program repair (APR) techniques offer a potential solution by synthesizing fixes for such errors. Previous research has investigated the utilization of both symbolic and neural techniques within the APR domain. However, these approaches typically demand significant engineering efforts or extensive datasets and training. In this paper, we explore the potential of using a large language model trained on code, specifically, we assess ChatGPT’s capability to detect and repair bugs in simple Python programs. The experimental evaluation encompasses two benchmarks: QuixBugs and Textbook. Each benchmark consists of simple Python functions that implement well-known algorithms and each function contains a single bug. To gauge repair performance in various settings, several benchmark variations were introduced including addition of plain English documentation and code obfuscation. Based on thorough experiments, we found that ChatGPT was able to correctly detect and fix about 50% of the methods, when code is documented. Repair performance drops to 25% when code is obfuscated, and 15% when documentation is removed and code is obfuscated. Furthermore, when compared to existing APR systems, ChatGPT considerably outperformed them.


Author Profile
Safwan Omari

Department of Engineering Computing and Mathematical Sciences Lewis University Romeoville IL 60446 USA

Andorra
Author Profile
Kshitiz Basnet

Department of Engineering Computing and Mathematical Sciences Lewis University Romeoville IL 60446 USA

Andorra
Author Profile
Mohammad Wardat

Department of Computer Science and Engineering Oakland University Rochester MI 48309 USA

Andorra

📄 논문 정보

발행 연도 2024년
인용수 0
출판 국가 Andorra
사이트 Springer
좋아요 수 0

연관 논문 목록 (248건)