LM101-077: How to Choose the Best Model using BIC from Learning Machines 101

Learning Machines 101

Richard M. Golden, Ph.D., M.S.E.E., B.S.E.E.

Episodes

LM101-077: How to Choose the Best Model using BIC

24 minutes Posted May 1, 2019 at 8:03 pm.

0:00

24:15

Download MP3

Share at current time

Show notes

In this 77th episode of www.learningmachines101.com , we explain the proper semantic interpretation of the Bayesian Information Criterion (BIC) and emphasize how this semantic interpretation is fundamentally different from AIC (Akaike Information Criterion) model selection methods. Briefly, BIC is used to estimate the probability of the training data given the probability model, while AIC is used to estimate out-of-sample prediction error. The probability of the training data given the model is called the “marginal likelihood”. Using the marginal likelihood, one can calculate the probability of a model given the training data and then use this analysis to support selecting the most probable model, selecting a model that minimizes expected risk, and support Bayesian model averaging. The assumptions which are required for BIC to be a valid approximation for the probability of the training data given the probability model are also discussed.

← See all 85 episodes of Learning Machines 101