Abstract
The Expectation-Maximization (EM) algorithm is routinely used for maximum likelihood estimation in latent class analysis. However, the EM algorithm comes with no global guarantees of reaching the global optimum. We study the geometry of the latent class model in order to understand the behavior of the maximum likelihood estimator. In particular, we characterize the boundary stratification of the binary latent class model with a binary hidden variable. For small models, such as for three binary observed variables, we show that this stratification allows exact computation of the maximum likelihood estimator. In this case we use simulations to study the maximum likelihood estimation attraction basins of the various strata and performance of the EM algorithm. Our theoretical study is complemented with a careful analysis of the EM fixed point ideal which provides an alternative method of studying the boundary stratification and maximizing the likelihood function. In particular, we compute the minimal primes of this ideal in the case of a binary latent class model with a binary or ternary hidden random variable.
| Original language | English |
|---|---|
| Pages (from-to) | 51-84 |
| Number of pages | 34 |
| Journal | Journal of Algebraic Statistics |
| Volume | 10 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - 2019 |
| MoE publication type | A1 Journal article-refereed |
Funding
Kubjas was supported by the European Union's Horizon 2020 research and innovation programme (Marie Sklodowska-Curie grant agreement No 748354). Zwiernik acknowledges the support of the Beatriu de Pinos fellowship of the Government of Catalonia's Secretariat for Universities and Research of the Ministry of Economy and Knowledge.
Keywords
- Maximum likelihood estimation
- Expectation Maximization
- latent class models
- fixed point ideals
- boundary stratification
- EM ALGORITHM
- RANK
- PARAMETERS