An advanced visualization of self-organizing maps by determining data clusters

Pavel Stefanovič; Olga Kurasova

doi:10.15388/namc.2025.30.43803

Articles

Pavel Stefanovič

Vilnius Gediminas Technical University

Olga Kurasova

Vilnius University

https://orcid.org/0000-0002-0570-1741

Published 2025-10-14

https://doi.org/10.15388/namc.2025.30.43803

PDF

Keywords

self-organizing maps
u-matrix
similarity distances
visualization
data clustering
number of clusters

How to Cite

Stefanovič, P. and Kurasova, O. (2025) “An advanced visualization of self-organizing maps by determining data clusters”, Nonlinear Analysis: Modelling and Control, 30(6), pp. 1163–1185. doi:10.15388/namc.2025.30.43803.

Download Citation

Abstract

This paper proposes a novel approach to improve the visualization capabilities of self-organizing maps and facilitate the identification of the resulting clusters. Unlike other clustering algorithms, self-organizing maps lack the feature to select a predefined number of clusters, and the boundaries of the clusters are not explicitly represented on the self-organizing maps. The main advantage of our proposed approach is that the option for selecting the desired number of clusters has been implemented. The experimental investigation was performed using four datasets with different characteristics. The improved visualization leverages various similarity distances to assess their impact on performance. The effectiveness of the novel approach to clustering results has been compared with those of the well-known k-means and hierarchical clustering methods, which allow for the selection of the desired number of clusters. Additionally, the visualization results, obtained by the proposed approach, were compared with those produced using the Orange Data Mining tool, where the u-matrix is applied to visualize a self-organizing map. The advantage of our approach compared to the u-matrix visualization has been highlighted in this paper. The performance of clustering algorithms has been measured by calculating the ratio of data items correctly assigned to clusters in the case when the clusters are predefined in the analyzed dataset. The results obtained showed that the most effective similarity distances are the cosine and correlation distances, which help to detect the correctly predefined clusters in the visualization of self-organizing maps.

PDF

References

This work is licensed under a Creative Commons Attribution 4.0 International License.

Downloads

Download data is not yet available.

Most read articles by the same author(s)

Vilmantas Gėgžna, Olga Kurasova, Gintautas Dzemyda, Ruta Kurtinaitienė, Ignas Čiplys, Juozas Vidmantis Vidmantis Vaitkus, Aurelija Vaitkuvienė, The ROC-based analysis of spectroscopic signals from medical specimens , Nonlinear Analysis: Modelling and Control: Vol. 23 No. 3 (2018): Nonlinear Analysis: Modelling and Control
Pavel Stefanovič, Olga Kurasova, Visual analysis of self-organizing maps , Nonlinear Analysis: Modelling and Control: Vol. 16 No. 4 (2011): Nonlinear Analysis: Modelling and Control
Kotryna Paulauskienė, Olga Kurasova, Projection error evaluation for large multidimensional data sets , Nonlinear Analysis: Modelling and Control: Vol. 21 No. 1 (2016): Nonlinear Analysis: Modelling and Control