Data Lakehouse for Time Series Data: A Systematic Literature Review


연구 분야: Databases



학회: 2024 IEEE International Conference on Big Data (BigData)


초록

As data continues to grow exponentially, the fields of data management and analytics must evolve to ensure efficient data ingestion, knowledge extraction, and scalability. The Data Lakehouse architecture, which combines the best features of Data Warehouses and Data Lakes, has emerged as a potential solution. However, to fully leverage the capabilities of Data Lakehouses for time series data, it is crucial to understand the unique challenges and opportunities they present. This literature review examines proposed Data Lakehouse architectures specifically for time series data, exploring their implementation, the software technologies used, and potential real-world applications. The focus is on comparing these architectures to identify the most suitable technologies for similar implementations. Through an in-depth analysis, this study emphasizes the importance of optimizing configurations to enhance system performance and scalability, particularly for data analysis and artificial intelligence (AI) workloads.


Author Profile
Matthias Pohl

German Aerospace Center (DLR) Institute of Data Science Jena Germany

Germany
Author Profile
Nathira Dharindri Wijemanne

German Aerospace Center (DLR) Institute of Data Science Jena Germany

Germany
Author Profile
Daniel Staegemann

Faculty of Computer Science Otto von Guericke University Magdeburg Germany

Germany

📄 논문 정보

발행 연도 2024년
인용수 340
출판 국가 Germany
사이트 IEEE
좋아요 수 0

연관 논문 목록 (112건)