TY - JOUR
T1 - DOLDA
T2 - a regularized supervised topic model for high-dimensional multi-class regression
AU - Magnusson, Måns
AU - Jonsson, Leif
AU - Villani, Mattias
PY - 2020/3/1
Y1 - 2020/3/1
N2 - Generating user interpretable multi-class predictions in data-rich environments with many classes and explanatory covariates is a daunting task. We introduce Diagonal Orthant Latent Dirichlet Allocation (DOLDA), a supervised topic model for multi-class classification that can handle many classes as well as many covariates. To handle many classes we use the recently proposed Diagonal Orthant probit model (Johndrow et al., in: Proceedings of the sixteenth international conference on artificial intelligence and statistics, 2013) together with an efficient Horseshoe prior for variable selection/shrinkage (Carvalho et al. in Biometrika 97:465–480, 2010). We propose a computationally efficient parallel Gibbs sampler for the new model. An important advantage of DOLDA is that learned topics are directly connected to individual classes without the need for a reference class. We evaluate the model’s predictive accuracy and scalability, and demonstrate DOLDA’s advantage in interpreting the generated predictions.
AB - Generating user interpretable multi-class predictions in data-rich environments with many classes and explanatory covariates is a daunting task. We introduce Diagonal Orthant Latent Dirichlet Allocation (DOLDA), a supervised topic model for multi-class classification that can handle many classes as well as many covariates. To handle many classes we use the recently proposed Diagonal Orthant probit model (Johndrow et al., in: Proceedings of the sixteenth international conference on artificial intelligence and statistics, 2013) together with an efficient Horseshoe prior for variable selection/shrinkage (Carvalho et al. in Biometrika 97:465–480, 2010). We propose a computationally efficient parallel Gibbs sampler for the new model. An important advantage of DOLDA is that learned topics are directly connected to individual classes without the need for a reference class. We evaluate the model’s predictive accuracy and scalability, and demonstrate DOLDA’s advantage in interpreting the generated predictions.
KW - Diagonal Orthant probit model
KW - Horseshoe prior
KW - Interpretable models
KW - Latent Dirichlet Allocation
KW - Text classification
UR - http://www.scopus.com/inward/record.url?scp=85067414496&partnerID=8YFLogxK
U2 - 10.1007/s00180-019-00891-1
DO - 10.1007/s00180-019-00891-1
M3 - Article
AN - SCOPUS:85067414496
SN - 0943-4062
VL - 35
SP - 175
EP - 201
JO - Computational Statistics
JF - Computational Statistics
IS - 1
ER -