Partitioning-Aware Performance Modeling of Distributed Graph Processing Tasks


연구 분야: Databases



학회: International Journal of Parallel Programming


초록

Much of the data being produced in large scale by modern applications represents connected entities and their relationships, that can be modeled as large graphs. In order to extract valuable information from these large datasets, several parallel and distributed graph processing engines have been proposed. These systems are designed to run in large clusters, where resources must by allocated efficiently. Aiming to handle this problem, this paper presents a performance prediction model for GPS, a popular Pregel-based graph processing framework. By leveraging a micro-partitioning technique, our system can use various partitioning algorithms that greatly reduce the execution time, comparing with the simple hash partitioning that is commonly used in graph processing systems. Experimental results show that the prediction model has accuracy close to 90%, allowing it to be used in schedulers or to estimate the cost of running graph processing tasks.


Author Profile
Daniel Presser

Federal University of Santa Catarina Florianópolis Brazil

Brazil
Author Profile
Frank Siqueira

Federal University of Santa Catarina Florianópolis Brazil

Brazil

📄 논문 정보

발행 연도 2023년
인용수 0
출판 국가 Brazil
사이트 Springer
좋아요 수 0

연관 논문 목록 (239건)