Change search
ReferencesLink to record
Permanent link

Direct link
Implementation of the MBCA Matlab Program for Model-Based Cluster Analysis
Stockholm University, Faculty of Social Sciences, Department of Statistics.
Manuscript (Other academic)
URN: urn:nbn:se:su:diva-25087OAI: diva2:198851
Part of urn:nbn:se:su:diva-7686Available from: 2008-05-13 Created: 2008-05-09 Last updated: 2010-01-13Bibliographically approved
In thesis
1. Bayesian Cluster Analysis: Some Extensions to Non-standard Situations
Open this publication in new window or tab >>Bayesian Cluster Analysis: Some Extensions to Non-standard Situations
2008 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The Bayesian approach to cluster analysis is presented. We assume that all data stem from a finite mixture model, where each component corresponds to one cluster and is given by a multivariate normal distribution with unknown mean and variance. The method produces posterior distributions of all cluster parameters and proportions as well as associated cluster probabilities for all objects. We extend this method in several directions to some common but non-standard situations. The first extension covers the case with a few deviant observations not belonging to one of the normal clusters. An extra component/cluster is created for them, which has a larger variance or a different distribution, e.g. is uniform over the whole range. The second extension is clustering of longitudinal data. All units are clustered at all time points separately and the movements between time points are modeled by Markov transition matrices. This means that the clustering at one time point will be affected by what happens at the neighbouring time points. The third extension handles datasets with missing data, e.g. item non-response. We impute the missing values iteratively in an extra step of the Gibbs sampler estimation algorithm. The Bayesian inference of mixture models has many advantages over the classical approach. However, it is not without computational difficulties. A software package, written in Matlab for Bayesian inference of mixture models is introduced. The programs of the package handle the basic cases of clustering data that are assumed to arise from mixture models of multivariate normal distributions, as well as the non-standard situations.

Place, publisher, year, edition, pages
Stockholm: Statistiska institutionen, 2008. 162 p.
Cluster analysis, Clustering, Classification, Mixture model, Gaussian, Bayesian inference, MCMC, Gibbs sampler, Deviant group, Longitudinal, Missing data, Multiple imputation
National Category
Probability Theory and Statistics
Research subject
urn:nbn:se:su:diva-7686 (URN)978-91-7155-645-5 (ISBN)
Public defence
2008-06-04, hörsal 3, hus B, Universitetsvägen 10, Stockholm, 10:00
Available from: 2008-05-13 Created: 2008-05-09Bibliographically approved

Open Access in DiVA

No full text

By organisation
Department of Statistics

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 198 hits
ReferencesLink to record
Permanent link

Direct link