Options
Infinite Mixtures of Infinite Factor Analysers
Date Issued
2020-09
Date Available
2024-02-12T16:17:08Z
Abstract
Factor-analytic Gaussian mixtures are often employed as a model-based approach to clustering high-dimensional data. Typically, the numbers of clusters and latent factors must be fixed in advance of model fitting. The pair which optimises some model selection criterion is then chosen. For computational reasons, having the number of factors differ across clusters is rarely considered. Here the infinite mixture of infinite factor analysers (IMIFA) model is introduced. IMIFA employs a Pitman-Yor process prior to facilitate automatic inference of the number of clusters using the stick-breaking construction and a slice sampler. Automatic inference of the cluster-specific numbers of factors is achieved using multiplicative gamma process shrinkage priors and an adaptive Gibbs sampler. IMIFA is presented as the flagship of a family of factor-analytic mixtures. Applications to benchmark data, metabolomic spectral data, and a handwritten digit example illustrate the IMIFA model’s advantageous features. These include obviating the need for model selection criteria, reducing the computational burden associated with the search of the model space, improving clustering performance by allowing cluster-specific numbers of factors, and uncertainty quantification.
Sponsorship
Science Foundation Ireland
Other Sponsorship
Insight Research Centre
Type of Material
Journal Article
Publisher
International Society for Bayesian Analysis
Journal
Bayesian Analysis
Volume
15
Issue
3
Start Page
937
End Page
963
Copyright (Published Version)
2020 International Society for Bayesian Analysis
Language
English
Status of Item
Peer reviewed
This item is made available under a Creative Commons License
File(s)
Loading...
Name
Infinite Mixtures of Infinite Factor Analysers.pdf
Size
1.75 MB
Format
Adobe PDF
Checksum (MD5)
c08ed2d3f891fbfab9cad7a40f32a60f
Owning collection
Mapped collections