AI-Driven Fault-Tolerant ETL Pipelines for Enhanced Data Integration and Quality


연구 분야: Databases



학회: 2025 International Research Conference on Smart Computing and Systems Engineering (SCSE)


초록

The reliability and fault tolerance of ETL (Extract, Transform, Load) pipelines are essential for maintaining data integrity in corporate environments. Traditional ETL systems often depend on manual interventions to resolve data inconsistencies, leading to errors, inefficiencies, and increased operational costs. This study introduces an AI-driven framework designed to improve the fault tolerance of ETL processes by automating data cleaning, standardization, and integration tasks. Using machine learning models, the framework reduces the need for human intervention, enhances data quality, and supports scalability across various data formats. Using real-world data sets, the proposed solution demonstrates its ability to improve operational efficiency and reduce errors within corporate data pipelines. This research addresses a crucial gap in ETL automation, offering a scalable and proactive approach to robust data integration in large-scale corporate settings. The findings highlight the ability of the framework to improve fault tolerance, improve data quality, and offer organizations a competitive advantage in managing complex data ecosystems.


Author Profile
Samantha Thelijjagoda

Faculty of Computing Sri Lanka Institute of Information Technology Malabe Sri Lanka

Sri Lanka
Author Profile
Chathurindu Kaushalya Wickramaarachchi

Faculty of Graduate Studies and Research Sri Lanka Institute of Information Technology Malabe Sri Lanka

Andorra
Author Profile
Shachini Kavindi Perera

Faculty of Graduate Studies and Research Sri Lanka Institute of Information Technology Malabe Sri Lanka

Andorra

📄 논문 정보

발행 연도 2025년
인용수 28
출판 국가 Andorra, Sri Lanka
사이트 IEEE
좋아요 수 0

연관 논문 목록 (83건)