Improving Data Quality of Low-cost IoT Sensors in Environmental Monitoring Networks Using Data Fusion and Machine Learning Approach
|Title:||Improving Data Quality of Low-cost IoT Sensors in Environmental Monitoring Networks Using Data Fusion and Machine Learning Approach||Authors:||Okafor, Nwamaka U.; Alghorani, Yahia; Delaney, Declan T.||Permanent link:||http://hdl.handle.net/10197/11855||Date:||Sep-2020||Online since:||2021-01-19T16:46:36Z||Abstract:||Environmental monitoring has become an active research area due to the current rise in the global climate change crises. Current environmental monitoring solutions, however, are characterized by high cost of acquisition and complexity of installation; often requiring extensive resources, infrastructure and expertise. It is infeasible to achieve with these solutions, high density in-situ networks such as are required to build refined scale models to facilitate robust monitoring, thus, leaving large gaps within the collected dataset. Low-Cost Sensors (LCS) can offer high-resolution spatiotemporal measurements which could be used to supplement existing dataset from current environmental monitoring solutions. LCS however, require frequent calibration in order to provide accurate and reliable data as they are often affected by environmental conditions when deployed on the field. Calibrating LCS can help to improve their data quality and ensure they are collecting accurate data. Achieving effective calibration, however, requires identifying factors that affect sensor’s data quality for a given measurement. This study evaluates the performance of three Feature Selection (FS) algorithms including Forward Feature Selection (FFS), Backward Elimination (BE) and Exhaustive Feature Selection (EFS) in identifying factors that affect data quality of low-cost IoT sensors in environmental monitoring networks. Applying the concept of data fusion, sensors data were merged with environmental factors and integrated into a single calibration equation to calibrate cairclipO3/NO2 and cairclipNO2 sensors using Linear Regression (LR) and Artificial Neural Networks (ANN). The study showed the effectiveness of calibration in improving low-cost IoT sensor data quality and also demonstrated the convenience of feature selection and the ability of data fusion to provide more consistent, accurate and reliable information for calibration models. The analysis showed that the cairclipO3/NO2 sensor provided measurements that have good correlation with reference measurements whereas the cairclipNO2 sensor showed no reasonable correlation with the reference data. Calibrating the cairclipO3/NO2 yielded good improvement in its measurement outputs when compared to reference measurements (R2=0.83). However, calibrating the cairclipNO2 sensor data yielded no significant improvement in its data quality.||Funding Details:||Environmental Protection Agency||Funding Details:||Schlumberger Foundation, Netherlands||Type of material:||Journal Article||Publisher:||Elsevier||Journal:||ICT Express||Volume:||6||Issue:||3||Start page:||220||End page:||228||Copyright (published version):||2020 The Korean Institute of Communications and Information Sciences||Keywords:||Internet of Things; Machine learning; Data fusion; Feature selection; Environmental monitoring||DOI:||10.1016/j.icte.2020.06.004||Language:||en||Status of Item:||Peer reviewed||This item is made available under a Creative Commons License:||https://creativecommons.org/licenses/by-nc-nd/3.0/ie/|
|Appears in Collections:||Electrical and Electronic Engineering Research Collection|
Show full item record
If you are a publisher or author and have copyright concerns for any item, please email firstname.lastname@example.org and the item will be withdrawn immediately. The author or person responsible for depositing the article will be contacted within one business day.