TY - JOUR
T1 - Archetypal Analysis for Nominal Observations
AU - Seth, Sohan
AU - Eugster, Manuel J A
N1 - VK: Kaski, S.; COIN; HIIT; Triton
PY - 2016/5/1
Y1 - 2016/5/1
N2 - Archetypal analysis is a popular exploratory tool that explains a set of observations as compositions of few 'pure' patterns. The standard formulation of archetypal analysis addresses this problem for real valued observations by finding the approximate convex hull. Recently, a probabilistic formulation has been suggested which extends this framework to other observation types such as binary and count. In this article we further extend this framework to address the general case of nominal observations which includes, for example, multiple-option questionnaires. We view archetypal analysis in a generative framework: this allows explicit control over choosing a suitable number of archetypes by assigning appropriate prior information, and finding efficient update rules using variational Bayes'. We demonstrate the efficacy of this approach extensively on simulated data, and three real world examples: Austrian guest survey dataset, German credit dataset, and SUN attribute image dataset.
AB - Archetypal analysis is a popular exploratory tool that explains a set of observations as compositions of few 'pure' patterns. The standard formulation of archetypal analysis addresses this problem for real valued observations by finding the approximate convex hull. Recently, a probabilistic formulation has been suggested which extends this framework to other observation types such as binary and count. In this article we further extend this framework to address the general case of nominal observations which includes, for example, multiple-option questionnaires. We view archetypal analysis in a generative framework: this allows explicit control over choosing a suitable number of archetypes by assigning appropriate prior information, and finding efficient update rules using variational Bayes'. We demonstrate the efficacy of this approach extensively on simulated data, and three real world examples: Austrian guest survey dataset, German credit dataset, and SUN attribute image dataset.
KW - Archetypal analysis
KW - clustering
KW - nominal observations
KW - prototype
KW - simplex visualization
KW - variational Bayes
UR - https://www.scopus.com/pages/publications/84963831013
U2 - 10.1109/TPAMI.2015.2470655
DO - 10.1109/TPAMI.2015.2470655
M3 - Article
SN - 0162-8828
VL - 38
SP - 849
EP - 861
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 5
M1 - 7214318
ER -