Informatica logo


Login Register

  1. Home
  2. Issues
  3. Volume 20, Issue 2 (2009)
  4. On a Minimal Spanning Tree Approach in t ...

Informatica

Information Submit your article For Referees Help ATTENTION!
  • Article info
  • Related articles
  • Cited by
  • More
    Article info Related articles Cited by

On a Minimal Spanning Tree Approach in the Cluster Validation Problem
Volume 20, Issue 2 (2009), pp. 187–202
Zeev Barzily   Zeev Volkovich   Başak Akteke-Öztürk   Gerhard-Wilhelm Weber  

Authors

 
Placeholder
https://doi.org/10.15388/Informatica.2009.245
Pub. online: 1 January 2009      Type: Research Article     

Received
1 August 2008
Accepted
1 December 2008
Published
1 January 2009

Abstract

In this paper, a method for the study of cluster stability is purposed. We draw pairs of samples from the data, according to two sampling distributions. The first distribution corresponds to the high density zones of data-elements distribution. Thus it is associated with the clusters cores. The second one, associated with the cluster margins, is related to the low density zones. The samples are clustered and the two obtained partitions are compared. The partitions are considered to be consistent if the obtained clusters are similar. The resemblance is measured by the total number of edges, in the clusters minimal spanning trees, connecting points from different samples. We use the Friedman and Rafsky two sample test statistic. Under the homogeneity hypothesis, this statistic is normally distributed. Thus, it can be expected that the true number of clusters corresponds to the statistic empirical distribution which is closest to normal. Numerical experiments demonstrate the ability of the approach to detect the true number of clusters.

Related articles Cited by PDF XML
Related articles Cited by PDF XML

Copyright
No copyright data available.

Keywords
clustering cluster validation minimal spanning tree two sample test

Metrics
since January 2020
777

Article info
views

0

Full article
views

544

PDF
downloads

188

XML
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

INFORMATICA

  • Online ISSN: 1822-8844
  • Print ISSN: 0868-4952
  • Copyright © 2023 Vilnius University

About

  • About journal

For contributors

  • OA Policy
  • Submit your article
  • Instructions for Referees
    •  

    •  

Contact us

  • Institute of Data Science and Digital Technologies
  • Vilnius University

    Akademijos St. 4

    08412 Vilnius, Lithuania

    Phone: (+370 5) 2109 338

    E-mail: informatica@mii.vu.lt

    https://informatica.vu.lt/journal/INFORMATICA
Powered by PubliMill  •  Privacy policy