Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
  • Colleges & Schools
  • Statistics
  • All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Institutes and Centres
  3. Insight Centre for Data Analytics
  4. Insight Research Collection
  5. Synthetic Dataset Generation for Online Topic Modeling
 
  • Details
Options

Synthetic Dataset Generation for Online Topic Modeling

File(s)
FileDescriptionSizeFormat
Download insight_publication.pdf308.82 KB
Author(s)
Belford, Mark 
MacNamee, Brian 
Greene, Derek 
Uri
http://hdl.handle.net/10197/10845
Date Issued
12 April 2018
Date Available
03T07:45:47Z July 2019
Abstract
Online topic modeling allows for the discovery of the underlying latent structure in a real time stream of data. In the evaluation of such approaches it is common that a static value for the number of topics is chosen. However, we would expect the number of topics to vary over time due to changes in the underlying structure of the data, known as concept drift and concept shift. We propose a semi-synthetic dataset generator, which can introduce concept drift and concept shift into existing annotated non-temporal datasets, via user-controlled paramaterization. This allows for the creation of multiple different artificial streams of data, where the “correct” number and composition of the topics is known at each point in time. We demonstrate how these generated datasets can be used as an evaluation strategy for online topic modeling approaches.
Sponsorship
Science Foundation Ireland
Other Sponsorship
Insight Research Centre
Type of Material
Conference Publication
Publisher
CEUR-WS.org
Start Page
63
End Page
75
Copyright (Published Version)
2017 the Author
Keywords
  • Machine Learning & St...

  • Online topic modeling...

  • Semi-synthetic datase...

  • Paramaterization

Web versions
https://dblp.org/db/conf/aics/aics2017
Language
English
Status of Item
Peer reviewed
Part of
McAuley, J., McKeever, S. (eds.). Proceedings of the 25th Irish Conference on Artificial Intelligence and Cognitive Science, Dublin, Ireland, December 7 - 8, 2017
Description
AICS 2017: 25th Irish Conference on Artificial Intelligence and Cognitive Science, Dublin, Ireland, 7 - 8 December 2017
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
Owning collection
Insight Research Collection
Views
501
Last Week
1
Last Month
1
Acquisition Date
Feb 5, 2023
View Details
Downloads
149
Last Week
4
Last Month
9
Acquisition Date
Feb 5, 2023
View Details
google-scholar
University College Dublin Research Repository UCD
The Library, University College Dublin, Belfield, Dublin 4
Phone: +353 (0)1 716 7583
Fax: +353 (0)1 283 7667
Email: mailto:research.repository@ucd.ie
Guide: http://libguides.ucd.ie/rru

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement