Optimizing Error Detection in Generated Code Using Metaheuristic Optimized Natural Language Processing


연구 분야: Artificial Intelligence



학회: International Conference on Soft Computing and its Engineering Applications


초록

This paper tackles the issue of software defect identification by employing advanced classification methods, leveraging Term Frequency-Inverse Document Frequency encoding to process software source code akin to natural language processing. A semi-synthetic dataset is constructed using the mostly basic Python problems dataset. A code generator is trained to produce potential solutions based on specific prompts. Incorrect answers that do not meet the established test criteria are retained as defective code samples. The study evaluates the efficacy of the AdaBoost classifier in detecting such invalid code. Recognizing the critical influence of hyperparameter settings on classifier performance, several contemporary optimization techniques are employed to fine-tune these settings. Additionally, a modified version of the particle swarm optimization (PSO) algorithm is introduced, which demonstrates the highest performance among the tested methods, achieving an accuracy exceeding 0.983208. This approach underscores the potential of integrating advanced classification techniques with optimized hyperparameter tuning for effective software defect detection.


Author Profile
Saramma John Villoth

University of Technology and Applied Sciences Nizwa Oman

Andorra
Author Profile
John Philipose Villoth

Singidunum University Danijelova 32 Belgrade Serbia

Serbia
Author Profile
Luka Jovanovic

Singidunum University Danijelova 32 Belgrade Serbia

Serbia

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Andorra, Serbia
사이트 Springer
좋아요 수 0

연관 논문 목록 (167건)