UPM Institutional Repository

A maximal-clique-based clustering approach for multi-observer multi-view data by using k-nearest neighbor with S-pseudo-ultrametric induced by a fuzzy similarity


Citation

Khameneh, Azadeh Zahedi and Ghaznavi, Mehrdad and Kilicman, Adem and Mahad, Zahari and Mardani, Abbas (2024) A maximal-clique-based clustering approach for multi-observer multi-view data by using k-nearest neighbor with S-pseudo-ultrametric induced by a fuzzy similarity. Neural Computing and Applications, 36 (16). pp. 9525-9550. ISSN 0941-0643; eISSN: 1433-3058

Abstract

Partitioning multi-view data is a recent challenge in clustering methods, which traditionally consider single-view data. In clustering techniques, finding the similarity or distance between objects, handled by metrics in Rn, plays a central role in community detection. Under this framework, different algorithms have been developed where the output relies on an exact distance calculated based on the objects’ features. As feature information might be qualitative data defined in an ambiguous environment, this study offers a new class of metrics, so-called S-distance, as a dual of a fuzzy T-similarity, which successfully produces a collective distance based on all views/observers and provides a more flexible framework to define distance under uncertainty. Besides, most existing approaches handle multi-view clustering by aggregating each view’s clusters or using an iterative optimization method; both are time-consuming. Here, by transforming the multi-view clustering problem into node clustering, we suggest a new approach without iteration for multi-view and multi-observer data. Our proposed method, GMSkNN, uses an attribute-structural similarity relation between nodes to get more coherent clusters. To this end, we first build a k-nearest neighbor (kNN) directed graph using the proposed S-distance, then transform it into an undirected graph based on the neighborhood information of the nodes so that the resultant graph is characterized based on nodes interactions and initial features information of the nodes. Next, a new maximal-clique-based clustering is designed to complete the node partitioning. The proposed clustering algorithm is programmed and tested on synthetic and four real-world datasets using the R software. The clustering results are analyzed based on several indexes. This analysis shows the efficiency of the proposed algorithm compared to the traditional clustering methods. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.


Download File

Full text not available from this repository.

Additional Metadata

Item Type: Article
Divisions: Institute for Mathematical Research
DOI Number: https://doi.org/10.1007/s00521-024-09560-x
Publisher: Springer Science and Business Media Deutschland GmbH
Keywords: Decision-making; Fuzzy T-similarity; k-Nearest neighbor digraph; Maximal clique; Multi-observer multi-view clustering; S-pseudo ultrametric
Depositing User: Ms. Azian Edawati Zakaria
Date Deposited: 28 Oct 2024 04:08
Last Modified: 28 Oct 2024 04:08
Altmetrics: http://www.altmetric.com/details.php?domain=psasir.upm.edu.my&doi=10.1007/s00521-024-09560-x
URI: http://psasir.upm.edu.my/id/eprint/112041
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item