Options
Designing and implementing data warehouse for agricultural big data
Author(s)
Date Issued
2019-06-20
Date Available
2019-10-10T08:46:23Z
Abstract
In recent years, precision agriculture that uses modern information and communication technologies is becoming very popular. Raw and semi-processed agricultural data are usually collected through various sources, such as: Internet of Thing (IoT), sensors, satellites, weather stations, robots, farm equipment, farmers and agribusinesses, etc. Besides, agricultural datasets are very large, complex, unstructured, heterogeneous, non-standardized, and inconsistent. Hence, the agricultural data mining is considered as Big Data application in terms of volume, variety, velocity and veracity. It is a key foundation to establishing a crop intelligence platform, which will enable resource efficient agronomy decision making and recommendations. In this paper, we designed and implemented a continental level agricultural data warehouse by combining Hive, MongoDB and Cassandra. Our data warehouse capabilities: (1) flexible schema; (2) data integration from real agricultural multi datasets; (3) data science and business intelligent support; (4) high performance; (5) high storage; (6) security; (7) governance and monitoring; (8) consistency, availability and partition tolerant; (9) distributed and cloud deployment. We also evaluate the performance of our data warehouse.
Sponsorship
Science Foundation Ireland
Other Sponsorship
Origin Enterprises
Type of Material
Journal Article
Publisher
Springer
Series
Lecture Notes in Computer Science (LNCS, volume 11514)
Information Systems and Applications, incl. Internet/Web, and HCI book sub series (LNISA, volume 11514)
Copyright (Published Version)
2019 Springer
Language
English
Status of Item
Peer reviewed
Journal
Chan, K., Seshadri, S., Zhang, LJ. Big Data – BigData 2019 8th International Congress, Held as Part of the Services Conference Federation, SCF 2019, San Diego, CA, USA, June 25–30, 2019, Proceedings
ISSN
0302-9743
This item is made available under a Creative Commons License
File(s)
No Thumbnail Available
Name
1905.12411v1.pdf
Size
3.65 MB
Format
Adobe PDF
Checksum (MD5)
413fbcc60322bc28a8772916456159e4
Owning collection