Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
    Colleges & Schools
    Statistics
    All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. College of Science
  3. School of Mathematics and Statistics
  4. Mathematics and Statistics Research Collection
  5. Clustering South African households based on their asset status using latent variable models
 
  • Details
Options

Clustering South African households based on their asset status using latent variable models

Author(s)
McParland, Damien  
Gormley, Isobel Claire  
McCormick, Tyler H.  
et al.  
Uri
http://hdl.handle.net/10197/7094
Date Issued
2014-06
Date Available
2015-09-24T09:57:46Z
Abstract
The Agincourt Health and Demographic Surveillance System has since 2001 conducted a biannual household asset survey in order to quantify household socio-economic status (SES) in a rural population living in northeast South Africa. The survey contains binary, ordinal and nominal items. In the absence of income or expenditure data, the SES landscape in the study population is explored and described by clustering the households into homogeneous groups based on their asset status. A model-based approach to clustering the Agincourt households, based on latent variable models, is proposed. In the case of modeling binary or ordinal items, item response theory models are employed. For nominal survey items, a factor analysis model, similar in nature to a multinomial probit model, is used. Both model types have an underlying latent variable structure—this similarity is exploited and the models are combined to produce a hybrid model capable of handling mixed data types. Further, a mixture of the hybrid models is considered to provide clustering capabilities within the context of mixed binary, ordinal and nominal response data. The proposed model is termed a mixture of factor analyzers for mixed data (MFA-MD). The MFA-MD model is applied to the survey data to cluster the Agincourt households into homogeneous groups. The model is estimated within the Bayesian paradigm, using a Markov chain Monte Carlo algorithm. Intuitive groupings result, providing insight to the different socio-economic strata within the Agincourt region.
Sponsorship
Science Foundation Ireland
Other Sponsorship
NIH grants
Google Faculty Research Award
Type of Material
Journal Article
Publisher
Institute of Mathematical Statistics (IMS)
Journal
Annals of Applied Statistics
Volume
8
Issue
2
Start Page
747
End Page
776
Subjects

Clustering

Mixed data

Item response theory

Metropolis-within-Gib...

DOI
10.1214/14-AOAS726
Web versions
http://arxiv.org/abs/1401.5343
Language
English
Status of Item
Peer reviewed
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
File(s)
Loading...
Thumbnail Image
Name

McParlandEtAl.pdf

Size

5.86 MB

Format

Adobe PDF

Checksum (MD5)

95eb922b116d18ce67c34b7bdaf8f4cc

Owning collection
Mathematics and Statistics Research Collection
Mapped collections
Insight Research Collection

Item descriptive metadata is released under a CC-0 (public domain) license: https://creativecommons.org/public-domain/cc0/.
All other content is subject to copyright.

For all queries please contact research.repository@ucd.ie.

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement