Distributed Spatial Data Clustering as a New Approach for Big Data Analysis

Files in This Item:
 File SizeFormat
Downloadinsight_publication.pdf739.72 kBAdobe PDF
Title: Distributed Spatial Data Clustering as a New Approach for Big Data Analysis
Authors: Bendechache, MalikaLe-Khac, Nhien-AnKechadi, Tahar
Permanent link: http://hdl.handle.net/10197/9649
Date: 20-Aug-2017
Online since: 2019-03-21T15:28:48Z
Abstract: In this paper we propose a new approach for Big Data mining and analysis. This new approach works well on distributed datasets and deals with data clustering task of the analysis. The approach consists of two main phases: the first phase executes a clustering algorithm on local data, assuming that the datasets was already distributed among the system processing nodes. The second phase deals with the local clusters aggregation to generate global clusters. This approach not only generates local clusters on each processing node in parallel, but also facilitates the formation of global clusters without prior knowledge of the number of the clusters, which many partitioning clustering algorithm require. In this study, this approach was applied on spatial datasets. The pro- posed aggregation phase is very efficient and does not involve the exchange of large amounts of data between the processing nodes. The experimental results show that the approach has super-linear speed-up, scales up very well, and can take advantage of the recent programming models, such as MapReduce model, as its results are not affected by the types of communications.
Funding Details: Science Foundation Ireland
Type of material: Conference Publication
Publisher: Springer
Journal: Communications in Computer and Information Science
Volume: 845
Start page: 38
End page: 56
Copyright (published version): 2018 Springer Nature Singapore
Keywords: Distributed data miningDistributed computingSynchronous communicationAsynchronous communicationSuper-speedupSpacial data mining
DOI: 10.1007/978-981-13-0292-3
Other versions: http://ausdm17.azurewebsites.net/
https://arxiv.org/
Language: en
Status of Item: Peer reviewed
Conference Details: The 15th Australasian Data Mining Conference, Melbourne, Australia, 19-20 August 2017
This item is made available under a Creative Commons License: https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
Appears in Collections:Insight Research Collection

Show full item record

Page view(s)

414
Last Week
4
Last month
16
checked on Jan 26, 2022

Download(s)

216
checked on Jan 26, 2022

Google ScholarTM

Check

Altmetric


If you are a publisher or author and have copyright concerns for any item, please email research.repository@ucd.ie and the item will be withdrawn immediately. The author or person responsible for depositing the article will be contacted within one business day.