Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
  • Colleges & Schools
  • Statistics
  • All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Institutes and Centres
  3. Insight Centre for Data Analytics
  4. Insight Research Collection
  5. Finding Niche Topics using Semi-Supervised Topic Modeling via Word Embeddings
 
  • Details
Options

Finding Niche Topics using Semi-Supervised Topic Modeling via Word Embeddings

File(s)
FileDescriptionSizeFormat
Download insight_publication.pdf1.04 MB
Author(s)
Conheady, Gerald 
Greene, Derek 
Uri
http://hdl.handle.net/10197/10853
Date Issued
31 July 2017
Date Available
08T08:57:26Z July 2019
Abstract
Topic modeling techniques generally focus on the discovery of the predominant thematic structures in text corpora. In contrast, a niche topic is made up of a small number of documents related to a common theme. Such a topic may have so few documents relative to the overall corpus size that it fails to be identified when using standard techniques. This paper proposes a new process, called Niche+, for finding these kinds of niche topics. It assumes interactions with a user who can provide a strictly limited level of supervision, which is subsequently employed in semi-supervised matrix factorization. Furthermore, word embeddings are used to provide additional weakly-labeled data. Experimental results show that documents in niche topics can be successfully identified using Niche+. These results are further supported via a use case that explores a real-world company email database.
Sponsorship
Science Foundation Ireland
Other Sponsorship
Insight Research Centre
Type of Material
Conference Publication
Publisher
CEUR-WS.org
Start Page
36
End Page
48
Keywords
  • Modeling techniques

  • Niche+

  • Word embeddings

  • Text corpus explorati...

  • Topic modeling

Web versions
https://dblp.org/db/conf/aics/aics2017
Language
English
Status of Item
Peer reviewed
Part of
McAuley, J., McKeever, S. (eds.). Proceedings of the 25th Irish Conference on Artificial Intelligence and Cognitive Science, Dublin, Ireland, December 7 - 8, 2017. CEUR Workshop Proceedings 2086, CEUR-WS.org 2018
Description
AICS 2017: 25th Irish Conference on Artificial Intelligence and Cognitive Science, Dublin, Ireland, 7-8 December 2017
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
Owning collection
Insight Research Collection
Views
480
Acquisition Date
Jan 31, 2023
View Details
Downloads
61
Last Week
1
Last Month
1
Acquisition Date
Jan 31, 2023
View Details
google-scholar
University College Dublin Research Repository UCD
The Library, University College Dublin, Belfield, Dublin 4
Phone: +353 (0)1 716 7583
Fax: +353 (0)1 283 7667
Email: mailto:research.repository@ucd.ie
Guide: http://libguides.ucd.ie/rru

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement