A weighted Cramér’s V Index for the assessment of stability in the fuzzy clustering of class C G protein-coupled receptors

Other authors

Universitat Politècnica de Catalunya. Departament de Ciències de la Computació

Universitat Politècnica de Catalunya. SOCO - Soft Computing

Publication date

2015

Abstract

After decades of intensive use, K-Means is still a common choice for crisp data clustering in real-world applications, particularly in biomedicine and bioinformatics. It is well-known that different initializations of the algorithm can lead to different solutions, precluding replicability. It has also been reported that even solutions with very similar errors may widely differ. A criterion for the choice of clustering solutions according to a combination of error and stability measures has recently been suggested. It is based on the use of Cramér’s V index, calculated from contingency tables, which is valid only for crisp clustering. Here, this criterion is extended to fuzzy and probabilistic clustering by first defining weighted contingency tables and a corresponding weighted Cramér’s V index. The proposed method is illustrated using Fuzzy C-Means in a proteomics problem.


Peer Reviewed


Postprint (author's final draft)

Document Type

Conference report

Language

English

Publisher

Springer

Related items

http://link.springer.com/chapter/10.1007/978-3-319-16483-0_52

Recommended citation

This citation was generated automatically.

Rights

Open Access

This item appears in the following Collection(s)

E-prints [73032]