Initially, a mixture of multivariate contaminated normal distributions was developed for model-based clustering. In addition to the parameters of the classical normal mixture, this model has, for each cluster, a parameter controlling the proportion of mild outliers and one specifying the degree of contamination. Parsimony is introduced via eigen-decomposition of the component covariance matrices. R software was developed to make the approach publicly available. This model was then extended in several ways to account for different situations that may arise.
Impact
Because the parameters controlling the proportion of mild outliers and the degree of contamination, respectively, do not have to be specified a priori, this work has added flexibility to the ongoing problem of robust clustering.
Student Experience
A former Ph.D. student was involved in the extension to binary data.
Countries
Italy
Impact
Research
Institutional Partner(s)
University of Catania
Community Partner(s)
Industry Partner(s)
Key Outcomes
Publications
Sponsorship
Federal
Sponsorship Details
Canada Research Chairs program. NSERC Discovery grant.