Volume 32, Issue 1 (2021), pp. 195–216
In this paper, the CODAS (Combinative Distance-based Assessment) is utilized to address some MAGDM issues by using picture 2-tuple linguistic numbers (P2TLNs). At first, some essential concepts of picture 2-tuple linguistic sets (P2TLSs) are briefly reviewed. Then, the CODAS method with P2TLNs is constructed and all calculating procedures are simply depicted. Eventually, an empirical application of green supplier selection has been offered to demonstrate this novel method and some comparative analysis between the CODAS method with P2TLNs and several methods are also made to confirm the merits of the developed method.
Pub. online:1 Jan 2017Type:Research ArticleOpen Access
Volume 28, Issue 2 (2017), pp. 303–328
Clustering high-dimensional data is a challenging task in data mining, and clustering high-dimensional categorical data is even more challenging because it is more difficult to measure the similarity between categorical objects. Most algorithms assume feature independence when computing similarity between data objects, or make use of computationally demanding techniques such as PCA for numerical data. Hierarchical clustering algorithms are often based on similarity measures computed on a common feature space, which is not effective when clustering high-dimensional data. Subspace clustering algorithms discover feature subspaces for clusters, but are mostly partition-based; i.e. they do not produce a hierarchical structure of clusters. In this paper, we propose a hierarchical algorithm for clustering high-dimensional categorical data, based on a recently proposed information-theoretical concept named holo-entropy. The algorithm proposes new ways of exploring entropy, holo-entropy and attribute weighting in order to determine the feature subspace of a cluster and to merge clusters even though their feature subspaces differ. The algorithm is tested on UCI datasets, and compared with several state-of-the-art algorithms. Experimental results show that the proposed algorithm yields higher efficiency and accuracy than the competing algorithms and allows higher reproducibility.