Options
Weak Supervision for Semi-Supervised Topic Modeling via Word Embeddings
Author(s)
Date Issued
2017-06-20
Date Available
2017-07-27T10:13:39Z
Abstract
Semi-supervised algorithms have been shown to improve the results of topic modeling when applied to unstructured text corpora. However, sufficient supervision is not always available. This paper proposes a new process, Weak+, suitable for use in semi-supervised topic modeling via matrix factorization, when limited supervision is available. This process uses word embeddings to provide additional weakly-labeled data, which can result in improved topic modeling performance.
Sponsorship
Science Foundation Ireland
Type of Material
Journal Article
Subjects
Web versions
Language
English
Status of Item
Peer reviewed
Conference Details
LDK 2017: Language, Data and Knowledge, Galway Ireland, 19-20 June 2017
This item is made available under a Creative Commons License
File(s)
Loading...
Name
insight_publication.pdf
Size
252.79 KB
Format
Adobe PDF
Checksum (MD5)
fb99a16fb2f88c1e63bdc81e661be6ee
Owning collection
Mapped collections