Performance Evaluation of a Distributed Clustering Approach for Spatial Datasets

Files in This Item:
File Description SizeFormat 
insight_publication2.pdf641.46 kBAdobe PDFDownload
Title: Performance Evaluation of a Distributed Clustering Approach for Spatial Datasets
Authors: Bendechache, MalikaLe-Khac, Nhien-AnKechadi, Tahar
Permanent link: http://hdl.handle.net/10197/10848
Date: 20-Aug-2017
Online since: 2019-07-03T11:03:58Z
Abstract: The analysis of big data requires powerful, scalable, and accurate data analytics techniques that the traditional data mining and machine learning do not have as a whole. Therefore, new data analytics frameworks are needed to deal with the big data challenges such as volumes, velocity, veracity, variety of the data. Distributed data mining constitutes a promising approach for big data sets, as they are usually produced in distributed locations, and processing them on their local sites will reduce significantly the response times, communications, etc. In this paper, we propose to study the performance of a distributed clustering, called Dynamic Distributed Clustering (DDC). DDC has the ability to remotely generate clusters and then aggregate them using an efficient aggregation algorithm. The technique is developed for spatial datasets. We evaluated the DDC using two types of communications (synchronous and asynchronous), and tested using various load distributions. The experimental results show that the approach has super-linear speed-up, scales up very well, and can take advantage of the recent programming models, such as MapReduce model, as its results are not affected by the types of communications.
Funding Details: Science Foundation Ireland
Type of material: Conference Publication
Publisher: Springer
Start page: 38
End page: 56
Series/Report no.: Communications in Computer and Information Science book series (CCIS); Volume 845
Copyright (published version): 2018 Springer
Keywords: Distributed data miningDistributed computingSynchronous communicationAsynchronous communicationSpacial data miningSuper-speedup
DOI: 10.1007/978-981-13-0292-3_3
Language: en
Status of Item: Peer reviewed
Is part of: Boo, Y.L., Stirling, D., Chi, L., Liu, L., Ong, K.-L., Williams, G. (eds.). Data Mining 15th Australasian Conference, AusDM 2017, Melbourne, VIC, Australia, August 19-20, 2017, Revised Selected Papers
Conference Details: AusDM 2017: 15th Australasian Conference, Melbourne, VIC, Australia, 19-20 August 2017
Appears in Collections:Insight Research Collection

Show full item record

Google ScholarTM

Check

Altmetric


This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. For other possible restrictions on use please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.