TY - JOUR
T1 - Likelihood-free inference via classification
AU - Gutmann, Michael U.
AU - Dutta, Ritabrata
AU - Kaski, Samuel
AU - Corander, Jukka
PY - 2018
Y1 - 2018
N2 - Increasingly complex generative models are being used across disciplines as they allow for realistic characterization of data, but a common difficulty with them is the prohibitively large computational cost to evaluate the likelihood function and thus to perform likelihood-based statistical inference. A likelihood-free inference framework has emerged where the parameters are identified by finding values that yield simulated data resembling the observed data. While widely applicable, a major difficulty in this framework is how to measure the discrepancy between the simulated and observed data. Transforming the original problem into a problem of classifying the data into simulated versus observed, we find that classification accuracy can be used to assess the discrepancy. The complete arsenal of classification methods becomes thereby available for inference of intractable generative models. We validate our approach using theory and simulations for both point estimation and Bayesian inference, and demonstrate its use on real data by inferring an individual-based epidemiological model for bacterial infections in child care centers.
AB - Increasingly complex generative models are being used across disciplines as they allow for realistic characterization of data, but a common difficulty with them is the prohibitively large computational cost to evaluate the likelihood function and thus to perform likelihood-based statistical inference. A likelihood-free inference framework has emerged where the parameters are identified by finding values that yield simulated data resembling the observed data. While widely applicable, a major difficulty in this framework is how to measure the discrepancy between the simulated and observed data. Transforming the original problem into a problem of classifying the data into simulated versus observed, we find that classification accuracy can be used to assess the discrepancy. The complete arsenal of classification methods becomes thereby available for inference of intractable generative models. We validate our approach using theory and simulations for both point estimation and Bayesian inference, and demonstrate its use on real data by inferring an individual-based epidemiological model for bacterial infections in child care centers.
KW - Approximate Bayesian computation
KW - Generative models
KW - Intractable likelihood
KW - Latent variable models
KW - Simulator-based models
UR - http://www.scopus.com/inward/record.url?scp=85015068579&partnerID=8YFLogxK
UR - https://arxiv.org/abs/1407.4981
U2 - 10.1007/s11222-017-9738-6
DO - 10.1007/s11222-017-9738-6
M3 - Article
AN - SCOPUS:85015068579
VL - 28
SP - 411
EP - 425
JO - STATISTICS AND COMPUTING
JF - STATISTICS AND COMPUTING
SN - 0960-3174
IS - 2
ER -