Multivariate Data Clustering for the Gaussian Mixture Model
Volume 16, Issue 1 (2005), pp. 61–74
Pub. online: 5 August 2022
Type: Research Article
Open Access
Received
1 June 2004
1 June 2004
Published
5 August 2022
5 August 2022
Abstract
This paper discusses a soft sample clustering problem for multivariate independent random data satisfying the mixture model of the Gaussian distribution. The theory recommends to estimate the parameters of model by the maximum likelihood method and to use “plug-in” approach for data clustering. Unfortunately, the calculation problem of the maximum likelihood estimate is not completely solved in multivariate case. This work proposes a new constructive a few stage procedure to solve this task. This procedure includes statistical distribution analysis of a large number of the univariate projections of observations, geometric clustering of a multivariate sample and application of EM algorithm. The results of the accuracy analysis of the proposed methods is made by means of Monte-Carlo simulation.