Create Your Own Nested Sampling Algorithm for Bayesian Parameter Fitting and Model Selection (With Python) Level Up Coding

Artificial Intelligence

Create Your Own Nested Sampling Algorithm for Bayesian Parameter Fitting and Model Selection (With Python) Level Up Coding

admin

June 11, 2023

Create Your Own Nested Sampling Algorithm for Bayesian Parameter Fitting and Model Selection (With Python)
Level Up Coding

In today’s recreational coding exercise, we learn a more advanced and robust Monte Carlo approach for model parameter fitting, which also allows us to calculate the Bayesian evidence of a model and perform model selection. Previously, we now have implemented the easy Metropolis-Hastings MCMC algorithm, and I highly recommend trying out the linked post as a primer on Monte Carlo methods and the Bayesian framework.

You might find the accompanying Nested Sampling Python code on GitHub.

Before we start, below is a gif of what running our model fitting algorithm looks like:

Nested Sampling

The Nested Sampling algorithm extends ideas from Metropolis-Hastings MCMC. Along with obtaining the posteriors of model parameters being fit to a knowledge set, the strategy calculates the Bayesian evidence (Z) of the model. The algorithm originates from John Skilling (2004).

We discussed the Bayesian framework in Metropolis-Hastings MCMC algorithm. To recap, we recall :

where:

θ is the set of model parameters
D is the info set
H is the model (‘hypothesis’)
P(θ|D,H) is the posterior distribution of model parameters given the info and model
P(θ|H) is the prior distribution for the model parameters given the model
Z=P(D|H) is the

In the usual Metropolis-Hastings MCMC algorithm, the Bayesian evidence is only a normalization factor that will be ignored. Nevertheless, here we’re considering calculating it, which is a challenge since it is a high-dimensional integral defined as the typical likelihood over the prior:

Luckily, Nested Sampling gives us just the answer to calculate this essential quantity.

The algorithm creates a set variety of ‘live’ particles that sample the parameter space. The live particles are initially drawn from the prior distribution. In each step i=1..M of the algorithm, perform the next:

Find the live particle that has the least likelihood
Replace that particle with a newly proposed higher-likelihood particle by choosing one other live particle at random and applying a small variety of MCMC steps to it. Repeat until a set variety of acceptances (e.g. 10) have taken place in the method to make sure the latest particle is uncorrelated with the live sample. Within the MCMC step, accept the proposed set of parameters with a probability equal to the ratio of priors
Keep a record of the live particles which might be kicked out of the sample, and their likelihoods L_i, and weight w_i=exp(-i/N_live)

At the top of the algorithm, calculate the Bayesian evidence as:

numerical estimation of the Bayesian evidence

Our code does just this. In the instance, we create 20 live particles and a threshold of 10 accepted MCMC steps are used to update particles. A complete of M=600 outer iterations are taken: