Improving Data Quality of Low-cost IoT Sensors in Environmental Monitoring Networks Using Data Fusion and Machine Learning Approach

Files in This Item:
File Description SizeFormat 
Okafor2020.pdf1.06 MBAdobe PDFDownload
Title: Improving Data Quality of Low-cost IoT Sensors in Environmental Monitoring Networks Using Data Fusion and Machine Learning Approach
Authors: Okafor, Nwamaka U.Alghorani, YahiaDelaney, Declan T.
Permanent link:
Date: Sep-2020
Online since: 2021-01-19T16:46:36Z
Abstract: Environmental monitoring has become an active research area due to the current rise in the global climate change crises. Current environmental monitoring solutions, however, are characterized by high cost of acquisition and complexity of installation; often requiring extensive resources, infrastructure and expertise. It is infeasible to achieve with these solutions, high density in-situ networks such as are required to build refined scale models to facilitate robust monitoring, thus, leaving large gaps within the collected dataset. Low-Cost Sensors (LCS) can offer high-resolution spatiotemporal measurements which could be used to supplement existing dataset from current environmental monitoring solutions. LCS however, require frequent calibration in order to provide accurate and reliable data as they are often affected by environmental conditions when deployed on the field. Calibrating LCS can help to improve their data quality and ensure they are collecting accurate data. Achieving effective calibration, however, requires identifying factors that affect sensor’s data quality for a given measurement. This study evaluates the performance of three Feature Selection (FS) algorithms including Forward Feature Selection (FFS), Backward Elimination (BE) and Exhaustive Feature Selection (EFS) in identifying factors that affect data quality of low-cost IoT sensors in environmental monitoring networks. Applying the concept of data fusion, sensors data were merged with environmental factors and integrated into a single calibration equation to calibrate cairclipO3/NO2 and cairclipNO2 sensors using Linear Regression (LR) and Artificial Neural Networks (ANN). The study showed the effectiveness of calibration in improving low-cost IoT sensor data quality and also demonstrated the convenience of feature selection and the ability of data fusion to provide more consistent, accurate and reliable information for calibration models. The analysis showed that the cairclipO3/NO2 sensor provided measurements that have good correlation with reference measurements whereas the cairclipNO2 sensor showed no reasonable correlation with the reference data. Calibrating the cairclipO3/NO2 yielded good improvement in its measurement outputs when compared to reference measurements (R2=0.83). However, calibrating the cairclipNO2 sensor data yielded no significant improvement in its data quality.
Funding Details: Environmental Protection Agency
Funding Details: Schlumberger Foundation, Netherlands
Type of material: Journal Article
Publisher: Elsevier
Journal: ICT Express
Volume: 6
Issue: 3
Start page: 220
End page: 228
Copyright (published version): 2020 The Korean Institute of Communications and Information Sciences
Keywords: Internet of ThingsMachine learningData fusionFeature selectionEnvironmental monitoring
DOI: 10.1016/j.icte.2020.06.004
Language: en
Status of Item: Peer reviewed
This item is made available under a Creative Commons License:
Appears in Collections:Electrical and Electronic Engineering Research Collection

Show full item record

Page view(s)

Last Week
Last month
checked on Feb 25, 2021


checked on Feb 25, 2021

Google ScholarTM



If you are a publisher or author and have copyright concerns for any item, please email and the item will be withdrawn immediately. The author or person responsible for depositing the article will be contacted within one business day.