Create Your Own Metropolis-Hastings Markov Chain Monte Carlo Algorithm for Bayesian Inference (With Python) Level Up Coding

Artificial Intelligence

Create Your Own Metropolis-Hastings Markov Chain Monte Carlo Algorithm for Bayesian Inference (With Python) Level Up Coding

admin

May 19, 2023

Create Your Own Metropolis-Hastings Markov Chain Monte Carlo Algorithm for Bayesian Inference (With Python)
Level Up Coding

In today’s recreational coding exercise, we learn the way to fit model parameters to data (with error bars) and acquire the more than likely distribution of modeling parameters that best explain the info, called the posterior distribution. We’ll accomplish that in a Bayesian framework, which is a really powerful approach since it allows us to include prior knowledge and uncertainties, and to update our beliefs concerning the model parameters as we observe more data. We’ll sample the posterior distribution of model parameters using an easy and general Markov Chain Monte Carlo (MCMC) method often known as the Metropolis-Hastings algorithm.

You might find the accompanying Python code on GitHub.

Before we start, below is a gif of what running our model fitting algorithm looks like:

Bayesian Framework

We’ll use a Bayesian framework for model fitting and parameter estimation. Supposing a model with a set of parameters and given a dataset D, we seek to seek out the posterior distribution P(|D) for the model parameters using :

where

P(|D) is the probability for the parameters to have a price given the info D are true.
P(D|) is the of the info D, assuming is true.
P() is the probability that a set of parameters is true (before seeing any data).
P(D) is the , that’s, the probability of D being true.

That’s all we’ll need for Bayesian inference. A straightforward, yet powerful idea.

Metropolis-Hastings Markov Chain Monte Carlo

The Metropolis-Hastings MCMC algorithm will randomly sample the posterior distribution. A Markov Chain is a stochastic process where each state within the chain only is determined by the previous state. The algorithm will:

Draw a random value for from the prior distribution, prev
For i = 1 … N, where N is the length of the Markov Chain:

Propose a latest value prop so as to add to the chain, by adding a random perturbation to prev using a proposal distribution
Evaluate the posterior probabilities Pprop and Pprev of the parameters prop and prev
Draw a random number U from a uniform distribution between 0 and 1
If U < min(1, Pprop/Pprev) then add prop to the Markov chain and set prev=prop, else add (one other copy of) prev to the chain.

3. Cut off the start of the chain (the ‘burn-in’ region) where the model parameters are removed from being a great fit.

When it comes to code, this looks like: