Obstructive sleep apnea is a heterogeneous sleep disorder with varying phenotypes. Several studies have already performed cluster analyses to discover various obstructive sleep apnea phenotypic clusters. However, the selection of the clustering method might affect the outputs. Consequently, it is unclear whether similar obstructive sleep apnea clusters can be reproduced using different clustering methods. In this study, we applied four well-known clustering methods: Agglomerative Hierarchical Clustering; K-means; Fuzzy C-means; and Gaussian Mixture Model to a population of 865 suspected obstructive sleep apnea patients. By creating five clusters with each method, we examined the effect of clustering methods on forming obstructive sleep apnea clusters and the differences in their physiological characteristics. We utilized a visualization technique to indicate the cluster formations, Cohen's kappa statistics to find the similarity and agreement between clustering methods, and performance evaluation to compare the clustering performance. As a result, two out of five clusters were distinctly different with all four methods, while three other clusters exhibited overlapping features across all methods. In terms of agreement, Fuzzy C-means and K-means had the strongest (κ = 0.87), and Agglomerative hierarchical clustering and Gaussian Mixture Model had the weakest agreement (κ = 0.51) between each other. The K-means showed the best clustering performance, followed by the Fuzzy C-means in most evaluation criteria. Moreover, Fuzzy C-means showed the greatest potential in handling overlapping clusters compared with other methods. In conclusion, we revealed a direct impact of clustering method selection on the formation and physiological characteristics of obstructive sleep apnea clusters. In addition, we highlighted the capability of soft clustering methods, particularly Fuzzy C-means, in the application of obstructive sleep apnea phenotyping.
- A comparative analysis of unsupervised machine-learning methods in PSG-related phenotyping
- Mohammadreza Ghorvei (Author)Tuomas Karhu (Author)Salla Hietakoste (Author)Daniela Ferreira-Santos (Author)Harald Hrubos-Strøm (Author)Anna Sigridur Islind (Author)Luka Biedebach (Author)Sami Nikkonen (Author)Timo Leppänen (Author)Matias Rusanen (Author)
- Journal of sleep research, Vol.34(3), pp.1-13
- 2025
- John Wiley and Sons; Hoboken
- 09621105; 1365-2869
- 13
- 5041828; 5041790; 5041798 / Wellbeing Service County of North Savo Kuopion Yliopistollinen Sairaala (http://data.elsevier.com/vocabulary/SciValFunders/501100004092) MIAI@Grenoble Alpes Hengityssairauksien Tutkimussäätiö (501100008309) Agence Nationale de la Recherche (http://data.elsevier.com/vocabulary/SciValFunders/501100001665) Tampereen Tuberkuloosisäätiö (501100006706) 965417; 5041809; 5041794 / Horizon 2020 Framework Programme (http://data.elsevier.com/vocabulary/SciValFunders/100010661) Orionin Tutkimussäätiö (http://data.elsevier.com/vocabulary/SciValFunders/501100007083) Horizon 2020 Framework Programme (http://data.elsevier.com/vocabulary/SciValFunders/100010661) Kuopion Seudun Hengityssäätiö (http://data.elsevier.com/vocabulary/SciValFunders/501100013506)
- English
- Tutti i diritti sono riservati ai legittimi detentori del copyright. L'Università Carlo Cattaneo - LIUC pubblica i dati relativi alle pubblicazioni di ricerca realizzate dai propri affiliati. La presenza nell'archivio ARL di testi completi non determina in alcun modo la libera riproduzione degli stessi, ma esclusivamente la possibilità della loro consultazione sul sito di ARL.
- Università Carlo Cattaneo - LIUC
- Journal article
- 991001124375405126