Clustering ranked data using copulas

Marta Nai Ruscone

Conference proceeding

Clustering ranked data using copulas

Marta Nai Ruscone

16th conference of the International federation of classification societies: 26-29 August 2019, Thessaloniki Concert Hall, Thessaloniki, Greece, #IFCS2019: abstract book, pp.127-127

16th conference of the International federation of classification societies (Thessaloniki, Greece, 26/08/2019 - 29/08/2019)

2019

Abstract

Copulas

Ranked data

Dissimilarity

Cluster analysis

Clustering of ranking data aims at the identification of groups of subjects with a homogenous, common, preference behavior. Human beings naturally tend to rank objects in the everyday life such as shops, one’s place of living, choice of occupations, singers and football teams, according to their preferences. More generally, ranking data occurs when a number of subjects are asked to rank a list of objects according to their personal preference order. The input in cluster analysis is a dissimilarity matrix quantifying the differences between rankings of two subjects. The choice of the dissimilarity dramatically affects the classification outcome and therefore the computation of an appropriate dissimilarity matrix is an issue. Several distance measures have been proposed for ranking data. We propose generalizations of this kind of distance using copulas adapted to the case of missing data. We consider the case of the extreme list where only the top-k and/or bottom-k ranks are known. We discuss an optimistic and a pessimistic imputation of missing values and show its effect on the classification. Those generalizations provide a more flexible instrument to model different types of data dependence structures and consider different situations in the classification process. Simulated and real data are used to illustrate the performance and the importance of our proposal.

Files and links (1)

pdf

bitstream_6a519e73-4f32-47da-be31-cc48dd92760c799.29 kB

Ask the Library / Chiedi alla Biblioteca Restricted Access

Metrics

31 Record Views

Details

Title: Clustering ranked data using copulas
Creators: Marta Nai Ruscone (Author)
Publication Details: 16th conference of the International federation of classification societies: 26-29 August 2019, Thessaloniki Concert Hall, Thessaloniki, Greece, #IFCS2019: abstract book, pp.127-127
Date published: 2019
Publisher: Artion conferences & events; Kalamaria
Conference: 16th conference of the International federation of classification societies (Thessaloniki, Greece, 26/08/2019 - 29/08/2019)
Format: Online
Language: English
Copyright: Tutti i diritti sono riservati ai legittimi detentori del copyright. L'Università Carlo Cattaneo - LIUC pubblica i dati relativi alle pubblicazioni di ricerca realizzate dai propri affiliati. La presenza nell'archivio ARL di testi completi non determina in alcun modo la libera riproduzione degli stessi, ma esclusivamente la possibilità della loro consultazione sul sito di ARL.
Academic Unit: Area di ricerca Scienze economiche e statistiche
Resource Type: Conference proceeding
Identifiers: 991000853489205126