Introduction to ML Deployment: Flask, Docker & Locust Introduction What’s “deployment” anyway? Setup Project Overview What’s Flask? Create Flask App Containerise Flask App Test Flask App Summary

Learn deploy your models in Python and measure the performance using Locust

You’ve spent a whole lot of time on EDA, rigorously crafted your features, tuned your model for days and at last have something that performs well on the test set. Now what? Now, my friend, we’d like to deploy the model. In any case, any model that stays within the notebook has a worth of zero, no matter how good it’s.

It would feel overwhelming to learn this a part of the info science workflow, especially if you happen to don’t have a whole lot of software engineering experience. Fear not, this post’s predominant purpose is to get you began by introducing one of the crucial popular frameworks for deployment in Python — Flask. As well as, you’ll learn containerise the deployment and measure its performance, two steps which are incessantly ignored.

Very first thing first, let’s make clear what I mean by deployment on this post. ML deployment is the means of taking a trained model and integrating it right into a production system (server within the diagram below), making it available to be used by end-users or other systems.

Model deployment diagram. Image by creator.

Take into accout that in point of fact, deployment process is way more complicated than simply making the model available to end-users. It also involves service integration with other systems, collection of an appropriate infrastructure, load balancing and optimisation, and robust testing of all of those components. Most of those steps are out-of-scope for this post and will ideally be handled by experienced software/ML engineers. Nevertheless, it’s essential to have some understanding around these areas which is why this post will cover containerisation, inference speed testing, and cargo handling.

All of the code could be present in this GitHub repo. I’ll show fragments from it, but make sure that to drag it and experiment with it, that’s the very best approach to learn. To run the code you’ll need — docker , flask , fastapi , and locust installed. There is perhaps some additional dependencies to put in, depending on the environment you’re running this code in.

To make the educational more practical, this post will show you an easy demo deployment of a loan default prediction model. The model training process is out of scope for this post, so already trained and serialised CatBoost model is on the market within the GitHub repo. The model was trained on the pre-processed U.S. Small Business Administration dataset (CC BY-SA 4.0 license). Be happy to explore the info dictionary to grasp what each of the columns mean.

This project focuses totally on the serving part i.e. making the model available to other systems. Hence, the model will actually be deployed in your local machine which is sweet for testing but is suboptimal for the actual world. Listed below are the predominant steps that deployments for Flask and FastAPI will follow:

Create API endpoint (using Flask or FastAPI)
Containerise the appliance (endpoint) using Docker
Run the Docker image locally, making a server
Test the server performance

Sounds exciting, right? Well, let’s start then!

Flask is a well-liked and widely adopted web framework for Python on account of its lightweight nature and minimal installation requirements. It offers a simple approach to developing REST APIs which are ideal for serving machine learning models.

The everyday workflow for Flask involves defining a prediction HTTP endpoint and linking it to specific Python functions that receive data as input and generate predictions as output. This endpoint can then be accessed by users and other applications.

For those who’re curious about simply making a prediction endpoint, it’s going to be quite easy. All you must do is to deserialise the model, create the Flask application object, and specify the prediction endpoint with POST method. More details about POST and other methods yow will discover here.