Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Classification with the Possibility of a Deviant Group: An Approach to Twelve-Year-Old Students
Stockholm University, Faculty of Social Sciences, Department of Statistics.
(English)In: Multivariate Behavioral ResearchArticle in journal (Refereed) Submitted
National Category
Probability Theory and Statistics
Identifiers
URN: urn:nbn:se:su:diva-25084OAI: oai:DiVA.org:su-25084DiVA: diva2:198848
Note

Part of urn:nbn:se:su:diva-7686

Available from: 2008-05-13 Created: 2008-05-09 Last updated: 2016-11-18
In thesis
1. Bayesian Cluster Analysis: Some Extensions to Non-standard Situations
Open this publication in new window or tab >>Bayesian Cluster Analysis: Some Extensions to Non-standard Situations
2008 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The Bayesian approach to cluster analysis is presented. We assume that all data stem from a finite mixture model, where each component corresponds to one cluster and is given by a multivariate normal distribution with unknown mean and variance. The method produces posterior distributions of all cluster parameters and proportions as well as associated cluster probabilities for all objects. We extend this method in several directions to some common but non-standard situations. The first extension covers the case with a few deviant observations not belonging to one of the normal clusters. An extra component/cluster is created for them, which has a larger variance or a different distribution, e.g. is uniform over the whole range. The second extension is clustering of longitudinal data. All units are clustered at all time points separately and the movements between time points are modeled by Markov transition matrices. This means that the clustering at one time point will be affected by what happens at the neighbouring time points. The third extension handles datasets with missing data, e.g. item non-response. We impute the missing values iteratively in an extra step of the Gibbs sampler estimation algorithm. The Bayesian inference of mixture models has many advantages over the classical approach. However, it is not without computational difficulties. A software package, written in Matlab for Bayesian inference of mixture models is introduced. The programs of the package handle the basic cases of clustering data that are assumed to arise from mixture models of multivariate normal distributions, as well as the non-standard situations.

Place, publisher, year, edition, pages
Stockholm: Statistiska institutionen, 2008. 162 p.
Keyword
Cluster analysis, Clustering, Classification, Mixture model, Gaussian, Bayesian inference, MCMC, Gibbs sampler, Deviant group, Longitudinal, Missing data, Multiple imputation
National Category
Probability Theory and Statistics
Research subject
Statistics
Identifiers
urn:nbn:se:su:diva-7686 (URN)978-91-7155-645-5 (ISBN)
Public defence
2008-06-04, hörsal 3, hus B, Universitetsvägen 10, Stockholm, 10:00
Opponent
Supervisors
Available from: 2008-05-13 Created: 2008-05-09 Last updated: 2016-11-18Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Franzén, Jessica
By organisation
Department of Statistics
Probability Theory and Statistics

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 90 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf