Machine Learning Model – Serverless Deployment

asha Last Updated : 14 Dec, 2020

5 min read

Introduction

Read this article on machine learning model deployment using serverless deployment. Serverless compute abstracts away provisioning, managing
severs and configuring software, simplifying model deployment.

Aimed towards becoming a Full Stack Data Scientist.

What is a serverless deployment?

Serverless is the next step in Cloud Computing. This means that servers are simply hidden from the picture. In serverless computing, this separation of server and application is managed by using a platform. The responsibility of the platform or serverless provider is to manage all the needs and configurations for your application. These platforms manage the configuration of your server behind the scenes. This is how in serverless computing, one can simply focus on the application or code itself being built or deployed.

Machine Learning Model Deployment is not exactly the same as software development. In ML models a constant stream of new data is needed to keep models working well. Models need to adjust in the real world because of various reasons like adding new categories, new levels, and many other reasons. Deploying models is just the beginning, as many times models need to retrain and check their performance. So, using serverless deployment can save time and effort and for retraining models every time, which is cool!

serverless deployment : ML workflow

Fig: ML Workflow

Models are performing worse in production than in development, and the solution needs to be sought in deployment. So, it’s easy to deploy ML models through serverless deployment.

Prerequisites to understand serverless deployment

Basic understanding of cloud computing
Basic understanding of cloud functions
Machine Learning

Serverless Deployment Models for prediction

We can deploy our ML model in 3 ways:

web hosting frameworks like Flask and Django, etc.
Serverless compute AWS lambda, Google Cloud Functions, Azure Functions
Cloud Platform specific frameworks like AWS Sagemaker, Google AI Platform, Azure Function

Servverless models for prediction

Fig: Types of ML model deployment

Serverless deployment architecture overview

cloud function

Fig: A Image is taken from google search and modified

Store models in Google Cloud Storage buckets then write Google Cloud Functions. Using Python for retrieving models from the bucket and by using HTTP JSON requests we can get predicted values for the given inputs with the help of Google Cloud Function.

Steps to start serverless model deployment

1. About Data, code, and models

Taking the movie reviews datasets for sentiment analysis, see the solution here in my GitHub repository and data, models also available in the same repository.

2. Create a storage bucket

By executing the “ServerlessDeployment.ipynb“ file you will get 3 ML models: Decision Classifier, LinearSVC, and Logistic Regression.

Click on the Browser in Storage option for creating a new bucket as shown in the image:

serverless deployment storage bucket

Fig: click Store option from GCP

3. Create a new function

Create a new bucket, then create a folder and upload the 3 models in that folder by creating 3 subfolders as shown.

Here models are my main folder name and my subfolders are:

decision_tree_model
linear_svc_model
logistic_regression_model

serverless deployment new model

Fig: Folders at Storage

4. Create a function

Then go to Google Cloud Functions and create a function, then select trigger type as HTTP and select language as Python (you can choose any language):

function

Fig: Select Cloud Function option from GCP

5. Write cloud function in the editor

Check the cloud function in my repository, here I have imported required libraries for calling models from google cloud bucket and other libraries for HTTP request GET method used to test the URL response and POST method delete default template and paste our code then pickel is used for deserialized our model google.cloud — access our cloud storage function.

If the incoming request is GET we simply return “welcome to classifier”.

If the incoming request is POST access the JSON data in the body request get JSON gives us to instantiate the storage client object and access models from the bucket, here we have 3 — classification models in the bucket.

If the user specifies “Decision Classifier” we access the model from the respective folder respectively with other models.

If the user does not specify any model, the default model is the Logistic Regression model.

The blob variable contains a reference to the model.pkl file for the correct model.

We download the .pkl file on to the local machine where this cloud function is running. Now every invocation might be running on a different VM and we only access /temp folder on the VM that’s why we save our model.pkl file.

We desterilize the model by invoking pkl.load access the prediction instances from the incoming request and call model.predict on the prediction data.

The response that will send back from the serverless function is the original text that is the review that we want to classify and our pred class.

After main.py write requirement.txt with required libraries and versions

write cloud function editor

Fig : Google Cloud Function(find detailed code in my github page)

5. Deploy the model

Fig : Green tick represent successful model deployment

6. Test the model

Fig : Give model name and review(s) for testing

Test function with other model

test function

Fig : Test the model

Code References:

My GitHub Repository : https://github.com/Asha-ai/ServerlessDeployment

Become a Full Stack Data Scientist by learning various ML Model deployments and reason behind this much explanation at initial days I struggle a lot for learning ML Model deployment, So I decided my blog should useful to data science freshers end to end

will meet you with my next blog : Deploy ML Model using “Web Hosting Framework – Flask“

asha

Have 4+ years of experience in data science field, working in telecom domain With a strong foundation in Python, machine learning, deep learning, natural language processing (NLP), and artificial intelligence (AI).

Free Courses

4.7

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

4.5

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

4.6

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

4.8

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

4.7

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

Reading list

Basics of Machine Learning

Machine Learning Lifecycle

Importance of Stats and EDA

Understanding Data

Probability

Exploring Continuous Variable

Exploring Categorical Variables

Missing Values and Outliers

Central Limit theorem

Bivariate Analysis Introduction

Continuous - Continuous Variables

Continuous Categorical

Categorical Categorical

Multivariate Analysis

Different tasks in Machine Learning

Build Your First Predictive Model

Evaluation Metrics

Preprocessing Data

Linear Models

KNN

Selecting the Right Model

Feature Selection Techniques

Decision Tree

Feature Engineering

Naive Bayes

Multiclass and Multilabel

Basics of Ensemble Techniques

Advance Ensemble Techniques

Hyperparameter Tuning

Support Vector Machine

Advance Dimensionality Reduction

Unsupervised Machine Learning Methods

Recommendation Engines

Improving ML models

Working with Large Datasets

Interpretability of Machine Learning Models

Automated Machine Learning

Model Deployment

Deploying ML Models

Embedded Devices

Machine Learning Model – Serverless Deployment

Introduction

What is a serverless deployment?

Prerequisites to understand serverless deployment

Serverless Deployment Models for prediction

Serverless deployment architecture overview

Steps to start serverless model deployment

Test function with other model

Code References:

will meet you with my next blog : Deploy ML Model using “Web Hosting Framework – Flask“

Login to continue reading and enjoy expert-curated content.

Free Courses

Generative AI - A Way of Life

Getting Started with Large Language Models

Building LLM Applications using Prompt Engineering

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Microsoft Excel: Formulas & Functions

Recommended Articles

Responses From Readers

Write for us

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#