ITRT(IT Research Trends)

DG-means: a superior greedy algorithm for clustering distributed data

연구 분야: Databases

논문 키워드: #research #algorithm #algorithms #web #clustering

학회: The Journal of Supercomputing

초록

Clustering divides a set of objects into several classes, where each class is composed of similar objects. Traditional centralized clustering algorithms target those objects located on the same site since they cannot perform on distributed objects. Distributed clustering algorithms, however, can fulfil this gap. They extract a classification model from the distributed objects even when they are in different sites and locations. With the trend of storing data in different locations and sites, and with the vast amount of data propagating throughout the web, it seems it will be one of the prevailing fields. Even though much research and work have been done on this topic, it is still considered in its infantry because of the challenges that are still popping up, such as bandwidth limitation, transferring data to a single site, and many others. In this work, we present DG-means, a greedy algorithm that performs on distributed data sets. Three datasets—the wholesale dataset, banknotes dataset, and Iris dataset, are used to compare multiple distributed clustering algorithms on different metrics: runtime execution, stability, and accuracy. DG-means exhibited superior performance when compared to the other algorithms.

Ramzi A. Haraty

Department of Computer Science and Mathematics Lebanese American University Beirut Lebanon

Andorra

Ali Assaf

Department of Computer Science and Mathematics Lebanese American University Beirut Lebanon

Andorra

📄 논문 정보

발행 연도	2023년
인용수	0
출판 국가	Andorra
사이트	Springer
좋아요 수	0

DG-means: a superior greedy algorithm for clustering distributed data

DG-means: a superior greedy algorithm for clustering distributed data

📄 논문 정보

연관 논문 목록 (374건) 내 서재 담기

연관 논문 목록 (374건)