Options
Effect of Combination of HBM and Certainty Sampling on Workload of Semi-Automated Grey Literature Screening
Author(s)
Date Issued
2021-07-24
Date Available
2024-02-12T15:52:39Z
Abstract
With the rapid increase of unstructured text data, grey literature has become an important source of information to support research and innovation activities. In this paper, we propose a novel semiautomated grey literature screening approach that combines a Hierarchical BERT Model (HBM) with active learning to reduce the human workload in grey literature screening. Evaluations over three real-world grey literature datasets demonstrate that the proposed approach can save up to 64.88% of the human screening workload, while maintaining high screening accuracy. We also demonstrate how the use of the HBM model allows salient sentences within grey literature documents to be selected and highlighted to support workers in screening tasks.
Sponsorship
Teagasc
Type of Material
Conference Publication
Copyright (Published Version)
2021 the Authors
Web versions
Language
English
Status of Item
Peer reviewed
Conference Details
The 38th International Conference on Machine Learning (ICML 2021), Virtual Event, 18-24 July 2021
This item is made available under a Creative Commons License
File(s)
Loading...
Name
4ce291_f7158ffff5e342618b27ce3128e631f0.pdf
Size
389.59 KB
Format
Adobe PDF
Checksum (MD5)
9df25c0cfc3789777ac379deaf6d0f7f
Owning collection