Online Trans-dimensional von Mises-Fisher Mixture Models for User Profiles

Files in This Item:
File Description SizeFormat 
insight_publication.pdf6.66 MBAdobe PDFDownload
Title: Online Trans-dimensional von Mises-Fisher Mixture Models for User Profiles
Authors: Qin, Xiangju
Cunningham, Pádraig
Salter-Townshend, Michael
Permanent link: http://hdl.handle.net/10197/8155
Date: 2016
Abstract: The proliferation of online communities has attracted much attention to modelling user behaviour in terms of social interaction, language adoption and contribution activity. Nevertheless, when applied to large-scale and cross-platform behavioural data, existing approaches generally suffer from expressiveness, scalability and generality issues. This paper proposes trans-dimensional von Mises-Fisher (TvMF) mixture models for L2 normalised behavioural data, which encapsulate: (1)a Bayesian framework for vMF mixtures that enables prior knowledge and information sharing among clusters, (2) an extended version of reversible jump MCMC algorithm that allows adaptivechanges in the number of clusters for vMF mixtures when the model parameters are updated, and (3)an online TvMF mixture model that accommodates the dynamics of clusters for time-varying user behavioural data. We develop efficient collapsed Gibbs sampling techniques for posterior inference,which facilitates parallelism for parameter updates. Empirical results on simulated and real-world data show that the proposed TvMF mixture models can discover more interpretable and intuitive clusters than other widely-used models, such as k-means, non-negative matrix factorization (NMF), Dirichlet process Gaussian mixture models (DP-GMM), and dynamic topic models (DTM). Wefurther evaluate the performance of proposed models in real-world applications, such as the churn prediction task, that shows the usefulness of the features generated.
Funding Details: Science Foundation Ireland
Type of material: Journal Article
Publisher: Journal of Machine Learning Research
Journal: Journal of Machine Learning Research
Volume: 17
Issue: 200
Start page: 1
1
End page: 51
51
Keywords: Machine learningStatistics
Other versions: http://www.jmlr.org/papers/v17/15-454.html
Language: en
Status of Item: Peer reviewed
Appears in Collections:Insight Research Collection

Show full item record

Google ScholarTM

Check


This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. For other possible restrictions on use please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.