Belford, MarkMarkBelfordMacNamee, BrianBrianMacNameeGreene, DerekDerekGreene2017-02-132017-02-132016 the A2016-09-21http://hdl.handle.net/10197/833624th Irish Conference on Artificial Intelligence and Cognitive Science (AICS'16), Dublin, Ireland, 20-21 September 2016Topic models can provide us with an insight into the underlying latent structure of a large corpus of documents, facilitating knowledge discovery and information summarization. A range of methods have been proposed in the literature, including probabilistic topic models and techniques based on matrix factorization. However, these methods tend to have stochastic elements in their initialization, which can lead to their output being unstable. That is, if a topic modeling algorithm is applied to the same data multiple times, the output will not necessarily always be the same. With this idea of stability in mind we ask the question – how can we produce a definitive topic model that is both stable and accurate? To address this, we propose a new ensemble topic modeling method, based on Non-negative Matrix Factorization (NMF), which combines a collection of unstable topic models to produce a definitive output. We evaluate this method on an annotated tweet corpus, where we show that this new approach is more accurate and stable than traditional NMF.enMachine learningStatisticsEnsemble Topic Modeling via Matrix FactorizationConference Publication17512016-11-14https://creativecommons.org/licenses/by-nc-nd/3.0/ie/