Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
  • Colleges & Schools
  • Statistics
  • All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Institutes and Centres
  3. Insight Centre for Data Analytics
  4. Insight Research Collection
  5. Performance Evaluation of a Distributed Clustering Approach for Spatial Datasets
 
  • Details
Options

Performance Evaluation of a Distributed Clustering Approach for Spatial Datasets

File(s)
FileDescriptionSizeFormat
Download insight_publication2.pdf641.46 KB
Author(s)
Bendechache, Malika 
Le-Khac, Nhien-An 
Kechadi, Tahar 
Uri
http://hdl.handle.net/10197/10848
Date Issued
20 August 2017
Date Available
03T11:03:58Z July 2019
Abstract
The analysis of big data requires powerful, scalable, and accurate data analytics techniques that the traditional data mining and machine learning do not have as a whole. Therefore, new data analytics frameworks are needed to deal with the big data challenges such as volumes, velocity, veracity, variety of the data. Distributed data mining constitutes a promising approach for big data sets, as they are usually produced in distributed locations, and processing them on their local sites will reduce significantly the response times, communications, etc. In this paper, we propose to study the performance of a distributed clustering, called Dynamic Distributed Clustering (DDC). DDC has the ability to remotely generate clusters and then aggregate them using an efficient aggregation algorithm. The technique is developed for spatial datasets. We evaluated the DDC using two types of communications (synchronous and asynchronous), and tested using various load distributions. The experimental results show that the approach has super-linear speed-up, scales up very well, and can take advantage of the recent programming models, such as MapReduce model, as its results are not affected by the types of communications.
Sponsorship
Science Foundation Ireland
Other Sponsorship
Insight Research Centre
Type of Material
Conference Publication
Publisher
Springer
Start Page
38
End Page
56
Series
Communications in Computer and Information Science book series (CCIS)
Volume 845
Copyright (Published Version)
2018 Springer
Keywords
  • Distributed data mini...

  • Distributed computing...

  • Synchronous communica...

  • Asynchronous communic...

  • Spacial data mining

  • Super-speedup

DOI
10.1007/978-981-13-0292-3_3
Language
English
Status of Item
Peer reviewed
Part of
Boo, Y.L., Stirling, D., Chi, L., Liu, L., Ong, K.-L., Williams, G. (eds.). Data Mining 15th Australasian Conference, AusDM 2017, Melbourne, VIC, Australia, August 19-20, 2017, Revised Selected Papers
Description
AusDM 2017: 15th Australasian Conference, Melbourne, VIC, Australia, 19-20 August 2017
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
Owning collection
Insight Research Collection
Scopus© citations
2
Acquisition Date
Jan 26, 2023
View Details
Views
594
Acquisition Date
Jan 26, 2023
View Details
Downloads
290
Acquisition Date
Jan 26, 2023
View Details
google-scholar
University College Dublin Research Repository UCD
The Library, University College Dublin, Belfield, Dublin 4
Phone: +353 (0)1 716 7583
Fax: +353 (0)1 283 7667
Email: mailto:research.repository@ucd.ie
Guide: http://libguides.ucd.ie/rru

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement