Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Bayesian Cluster Analysis: Some Extensions to Non-standard Situations
Stockholm University, Faculty of Social Sciences, Department of Statistics.
2008 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The Bayesian approach to cluster analysis is presented. We assume that all data stem from a finite mixture model, where each component corresponds to one cluster and is given by a multivariate normal distribution with unknown mean and variance. The method produces posterior distributions of all cluster parameters and proportions as well as associated cluster probabilities for all objects. We extend this method in several directions to some common but non-standard situations. The first extension covers the case with a few deviant observations not belonging to one of the normal clusters. An extra component/cluster is created for them, which has a larger variance or a different distribution, e.g. is uniform over the whole range. The second extension is clustering of longitudinal data. All units are clustered at all time points separately and the movements between time points are modeled by Markov transition matrices. This means that the clustering at one time point will be affected by what happens at the neighbouring time points. The third extension handles datasets with missing data, e.g. item non-response. We impute the missing values iteratively in an extra step of the Gibbs sampler estimation algorithm. The Bayesian inference of mixture models has many advantages over the classical approach. However, it is not without computational difficulties. A software package, written in Matlab for Bayesian inference of mixture models is introduced. The programs of the package handle the basic cases of clustering data that are assumed to arise from mixture models of multivariate normal distributions, as well as the non-standard situations.

Place, publisher, year, edition, pages
Stockholm: Statistiska institutionen , 2008. , 162 p.
Keyword [en]
Cluster analysis, Clustering, Classification, Mixture model, Gaussian, Bayesian inference, MCMC, Gibbs sampler, Deviant group, Longitudinal, Missing data, Multiple imputation
National Category
Probability Theory and Statistics
Research subject
Statistics
Identifiers
URN: urn:nbn:se:su:diva-7686ISBN: 978-91-7155-645-5 (print)OAI: oai:DiVA.org:su-7686DiVA: diva2:198852
Public defence
2008-06-04, hörsal 3, hus B, Universitetsvägen 10, Stockholm, 10:00
Opponent
Supervisors
Available from: 2008-05-13 Created: 2008-05-09 Last updated: 2016-11-18Bibliographically approved
List of papers
1. Bayesian Inference for a Mixture Moddel using the Gibbs Sampler
Open this publication in new window or tab >>Bayesian Inference for a Mixture Moddel using the Gibbs Sampler
(English)Manuscript (Other academic)
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-25083 (URN)
Note

Part of urn:nbn:se:su:diva-7686

Available from: 2008-05-13 Created: 2008-05-09 Last updated: 2016-11-18
2. Classification with the Possibility of a Deviant Group: An Approach to Twelve-Year-Old Students
Open this publication in new window or tab >>Classification with the Possibility of a Deviant Group: An Approach to Twelve-Year-Old Students
(English)In: Multivariate Behavioral ResearchArticle in journal (Refereed) Submitted
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-25084 (URN)
Note

Part of urn:nbn:se:su:diva-7686

Available from: 2008-05-13 Created: 2008-05-09 Last updated: 2016-11-18
3. Successive Clustering of Longitudinal Data: A Bayesian Approach
Open this publication in new window or tab >>Successive Clustering of Longitudinal Data: A Bayesian Approach
(English)Manuscript (Other academic)
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-25085 (URN)
Note

Part of urn:nbn:se:su:diva-7686

Available from: 2008-05-13 Created: 2008-05-09 Last updated: 2016-11-18
4. Longitudinal, Model-Based Clustering with Missing Data
Open this publication in new window or tab >>Longitudinal, Model-Based Clustering with Missing Data
(English)Manuscript (Other academic)
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-25086 (URN)
Note

Part of urn:nbn:se:su:diva-7686

Available from: 2008-05-13 Created: 2008-05-09 Last updated: 2016-11-18
5. Implementation of the MBCA Matlab Program for Model-Based Cluster Analysis
Open this publication in new window or tab >>Implementation of the MBCA Matlab Program for Model-Based Cluster Analysis
(English)Manuscript (Other academic)
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-25087 (URN)
Note

Part of urn:nbn:se:su:diva-7686

Available from: 2008-05-13 Created: 2008-05-09 Last updated: 2016-11-18

Open Access in DiVA

fulltext(232 kB)3536 downloads
File information
File name FULLTEXT01.pdfFile size 232 kBChecksum SHA-1
21d04d29f3bfd96dcaa44a86bbe2df3057f0185e60fca54a1ff41af225bf5654fbe2b3d2
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Franzén, Jessica
By organisation
Department of Statistics
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 3536 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1371 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf