Weak Supervision for Semi-Supervised Topic Modeling via Word Embeddings

Files in This Item:
File Description SizeFormat 
insight_publication.pdf252.79 kBAdobe PDFDownload
Title: Weak Supervision for Semi-Supervised Topic Modeling via Word Embeddings
Authors: Conheady, Gerald
Greene, Derek
Permanent link: http://hdl.handle.net/10197/8691
Date: 20-Jun-2017
Abstract: Semi-supervised algorithms have been shown to improve the results of topic modeling when applied to unstructured text corpora. However, sufficient supervision is not always available. This paper proposes a new process, Weak+, suitable for use in semi-supervised topic modeling via matrix factorization, when limited supervision is available. This process uses word embeddings to provide additional weakly-labeled data, which can result in improved topic modeling performance.
Funding Details: Science Foundation Ireland
Type of material: Journal Article
Keywords: Machine learning;Statistics
Language: en
Status of Item: Peer reviewed
Conference Details: LDK 2017: Language, Data and Knowledge, Galway Ireland, 19-20 June 2017
Appears in Collections:Computer Science Research Collection
Insight Research Collection

Show full item record

Download(s) 50

38
checked on May 25, 2018

Google ScholarTM

Check


This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. For other possible restrictions on use please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.