A Generative AI Based Architecture for Data Seeding in Software Testing


연구 분야: Verification



학회: International Conference on Computational Science and Its Applications


초록

In software engineering, the testing phase is critical to ensuring application quality, reliability, and adherence to defined requirements. Effective testing requires realistic, consistent data that reflects real-world conditions and exists in sufficient volumes. This makes the data seeding process indispensable. However, traditional approaches to data creation—manual or automated—face challenges such as ensuring data diversity and compliance with business rules, often resulting in inadequate test coverage with consequences in undetected bugs, reduced reliability, higher costs and compromised software quality. Thus, the goal of this paper is to present a novel generative AI-based architecture for advancing data seeding practices in software testing. The proposed architecture integrates user interaction, data embedding, and retrieval-augmented generation (RAG) to form a seamless pipeline for generating contextually relevant and realistic data in real-time. For the purposes of validating the architecture, a Proof of Concept (POC) was implemented focused on generating fake data from Brazilian customers to assess the quality of the data generated and the effectiveness of the implemented solution, in order to evaluate the potential of the architecture for the development of broader applications and in different contexts.


Author Profile
Itamir de Morais Barroca Filho

Digital Metropolis Institute Federal University of Rio Grande do Norte Natal 59078970 Brazil

Brazil
Author Profile
Ramon Santos Malaquias

Digital Metropolis Institute Federal University of Rio Grande do Norte Natal 59078970 Brazil

Brazil
Author Profile
Jean Mário Moreira de Lima

Digital Metropolis Institute Federal University of Rio Grande do Norte Natal 59078970 Brazil

Brazil

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Brazil
사이트 Springer
좋아요 수 0

연관 논문 목록 (335건)