Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
    Colleges & Schools
    Statistics
    All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. College of Science
  3. School of Computer Science
  4. Computer Science Research Collection
  5. BigDataNetSim: A Simulator for Data and Process Placement in Large Big Data Platforms
 
  • Details
Options

BigDataNetSim: A Simulator for Data and Process Placement in Large Big Data Platforms

Author(s)
Batista de Almeida, Leandro  
Cunha de Almeida, Eduardo  
Murphy, John  
De Grande, Robson E.  
Ventresque, Anthony  
Uri
http://hdl.handle.net/10197/10594
Date Issued
2018-10-17
Date Available
2019-05-22T07:54:46Z
Abstract
Big Data platforms are convoluted distributed systems which commonly comprise skill- and labour-intensive solution development to treat inherent Big Data application challenges. Several tools have been proposed to help developers and engineers to overcome the involved complexities in coordinating the execution of plenty processes/threads on multiple machines. However, no work so far has been able to combine both an accurate representation of Big Data jobs and realistic modeling of the behaviour of Big Data platforms at scale, including networking elements and data and job placement. In this paper, we propose BigDataNetSim, the first simulator which models accurately all the main components of the data movements in Big Data platforms (e.g., HDFS, YARN/MapReduce, network topologies, switching/routing protocols) in a large scale system. BigDataNetSim can serve as a valuable tool for engineering Big Data solutions, which includes set-up of systems, prototyping of jobs, and improvement of components/algorithms for Big Data platforms. We also demonstrate that BigDataNetSim can simulate a real Hadoop cluster with a high degree of accuracy in terms of data and job placements, being able to scale up to very large systems.
Type of Material
Conference Publication
Publisher
IEEE
Copyright (Published Version)
2018 IEEE
Subjects

Big data

Hadoop

Simulation

Task analysis

Data models

Tools

Network topology

Protocols

YARN

DOI
10.1109/DISTRA.2018.8601018
978-1-5386-5048-6
Web versions
http://ds-rt.com/2018/
Language
English
Status of Item
Not peer reviewed
Journal
Besada, E., Polo. O.R., De Grande, R., Risco, J.L. (eds.). Proceedings of the 2018 IEEE/ACM 22nd International Symposium on Distributed Simulation and Real Time Applications (DS-RT) October 15-17, 2018, Madrid, Spain
Conference Details
The 2018 IEEE/ACM 22nd International Symposium on Distributed Simulation and Real Time Applications (DS-RT)
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
File(s)
Loading...
Thumbnail Image
Name

SimulatorPaper2018.pdf

Size

610.9 KB

Format

Adobe PDF

Checksum (MD5)

c7ed2d1388ff928c702ce567f5bd1fc4

Owning collection
Computer Science Research Collection

Item descriptive metadata is released under a CC-0 (public domain) license: https://creativecommons.org/public-domain/cc0/.
All other content is subject to copyright.

For all queries please contact research.repository@ucd.ie.

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement