Support Vector Regression Tutorial for Machine Learning

Alakh Sethi Last Updated : 04 Apr, 2025

10 min read

Support Vector Machines (SVM) are widely used in machine learning for classification problems, but they can also be applied to regression problems through Support Vector Regression (SVR). SVR uses the same principles as SVM but focuses on predicting continuous outputs rather than classifying data points. This tutorial will explore SVR’s work, emphasizing key concepts such as quadratic, radial basis function, and sigmoid kernels. By leveraging these kernels, SVR can effectively handle complex, non-linear relationships in data. We will also demonstrate how to implement SVR in Python using training samples, showcasing its practical applications in artificial intelligence.

In this article you will get understanding about the Support Vector Regression Mdoel. So, Support vector regression (SVR) is a robust machine learning method utilized for forecasting continuous results. The SVR model, unlike typical regression models, employs support vector machines (SVMs) principles to transform input features into high-dimensional spaces to locate the ideal hyperplane that accurately represents the data. This method enables support vector regression (SVR) to effectively manage both linear and non-linear relationships, rendering it a versatile tool across different fields, such as financial forecasting and scientific research. Utilizing the distinctive features of support vector machine regression allows SVR models to attain high accuracy and robustness, even when dealing with intricate datasets.

Support Vector Regression Tutorial for Machine Learning | SVR | SVM

Learning Outcomes

Grasp the fundamental concepts of Support Vector Machine Regression, including hyperplanes, margins, and how SVM separates data into different classes.
Recognize the key differences between Support Vector Machines for classification and Support Vector Regression for regression problems.
Learn about important SVR hyperparameters, such as kernel types (quadratic, radial basis function, and sigmoid), and how they influence the model’s performance.
Gain practical experience in implementing Support Vector Regression using Python, including data preprocessing, feature scaling, and model training.
Use SVR to predict continuous outputs in various contexts, demonstrating its application in fields like finance, engineering, and healthcare.
Develop skills to visualize the results of SVM for Regression, understand how to interpret the best-fit line, and understand the impact of different kernels on the model’s predictions.
Learn how to assess the performance of SVR models using appropriate metrics and techniques, ensuring accurate and reliable predictions.

What is a Support Vector Machine (SVM)?
Hyperparameters of the Support Vector Machine (SVM) Algorithm
Introduction to Support Vector Regression (SVR)
- The Idea Behind Support Vector Regression
Implementing Support Vector Regression (SVR) in Python
What is the difference between SVM and SVR?
Conclusion

What is a Support Vector Machine (SVM)?

A Support Vector Machine (SVM) is a supervised machine learning algorithm used for classification and regression tasks. SVM works by finding a hyperplane in a high-dimensional space that best separates data into different classes. It aims to maximize the margin (the distance between the hyperplane and the nearest data points of each class) while minimizing classification errors. SVM can handle both linear and non-linear classification problems by using various kernel functions. It’s widely used in tasks such as image classification, text categorization, and more.

So what exactly is Support Vector Machine (SVM)? We’ll start by understanding SVM in simple terms. Let’s say we have a plot of two label classes as shown in the figure below:

Can you decide what the separating line will be? You might have come up with this:

The line fairly separates the classes. This is what SVM essentially does – simple class separation. Now, what is the data was like this:

Here, we don’t have a simple line separating these two classes. So we’ll extend our dimension and introduce a new dimension along the z-axis. We can now separate these two classes:

When we transform this line back to the original plane, it maps to the circular boundary as I’ve shown here:

This is exactly what Support Vector Machine Regression does! It tries to find a line/hyperplane (in multidimensional space) that separates these two classes. Then it classifies the new point depending on whether it lies on the positive or negative side of the hyperplane depending on the classes to predict.

Also, Read about this article “A-Z guide to Support Vector Machine“

Hyperparameters of the Support Vector Machine (SVM) Algorithm

There are a few important parameters of SVM that you should be aware of before proceeding further:

Kernel: A kernel helps us find a hyperplane in the higher dimensional space without increasing the computational cost. Usually, the computational cost will increase if the dimension of the data increases. This increase in dimension is required when we are unable to find a separating hyperplane in a given dimension and are required to move in a higher dimension:

Hyperplane: This is basically a separating line between two data classes in SVM. But in Support Vector Regression, this is the line that will be used to predict the continuous output
Decision Boundary: A decision boundary can be thought of as a demarcation line (for simplification) on one side of which lie positive examples and on the other side lie the negative examples. On this very line, the examples may be classified as either positive or negative. This same concept of SVM will be applied in Support Vector Regression as well

To understand SVM from scratch, I recommend this tutorial: How to Use Support Vector Machines (SVM) for Data Science

Introduction to Support Vector Regression (SVR)

Support Vector Regression (SVR) is a machine learning algorithm used for regression analysis. SVR Model in Machine Learning aims to find a function that approximates the relationship between the input variables and a continuous target variable while minimizing the prediction error.

Unlike Support Vector Machines (SVMs) used for classification tasks, SVR Model seeks a hyperplane that best fits the data points in a continuous space. This is achieved by mapping the input variables to a high-dimensional feature space and finding the hyperplane that maximizes the margin (distance) between the hyperplane and the closest data points, while also minimizing the prediction error.

SVR Model can handle non-linear relationships between the input and target variables by using a kernel function to map the data to a higher-dimensional space. This makes it a powerful tool for regression tasks where complex relationships may exist.

Support Vector Regression (SVR) uses the same principle as SVM but for regression problems. Let’s spend a few minutes understanding the idea behind SVR in Machine Learning.

Here top 6 Free University Courses to Learn Machine Learning

The Idea Behind Support Vector Regression

The problem of regression is to find a function that approximates mapping from an input domain to real numbers based on a training sample. So, let’s dive deep and understand how SVR actually works.

Support Vector Regression, svm regression python

Consider these two red lines as the decision boundary and the green line as the hyperplane. When we move on with Support Vector Regression (SVR) in Machine Learning, our objective is to consider the points within the decision boundary line. Our best fit line is the hyperplane with the maximum number of points.

The first thing that we’ll understand is what is the decision boundary (the danger red line above!). Consider these lines as being at any distance, say ‘a’, from the hyperplane. So, these are the lines that we draw at distance ‘+a’ and ‘-a’ from the hyperplane. This ‘a’ in the text is basically referred to as epsilon.

Assuming that the equation of the hyperplane is as follows:

Y = wx+b (equation of hyperplane)

Then the equations of decision boundary become:

wx+b= +a

wx+b= -a

Thus, any hyperplane that satisfies our SVM for Regression Model should satisfy:

-a < Y- wx+b < +a

Our main aim here is to decide a decision boundary at ‘a’ distance from the original hyperplane such that data points closest to the hyperplane or the support vectors are within that boundary line.

Hence, we will take only those points within the decision boundary that have the least error rate or are within the Margin of Tolerance. This will give us a better-fitting model.

Implementing Support Vector Regression (SVR) in Python

Time to put on our coding hats! In this section, we’ll understand the use of Support Vector Regression with the help of a dataset. Here, we have to predict the salary of an employee, given a few independent variables. A classic HR analytics project!

Implementing Support Vector Regression in Python

If you looking for Guide here is Full Guide for Support Vector Machine (SVM) Algorithm

Step 1: Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

Step 2: Reading the dataset

dataset = pd.read_csv('Position_Salaries.csv')
X = dataset.iloc[:, 1:2].values
y = dataset.iloc[:, 2].values

Step 3: Feature Scaling

A real-world dataset contains features that vary in magnitudes, units, and range. I would suggest performing normalization when the scale of a feature is irrelevant or misleading.

Feature Scaling basically helps to normalize the data within a particular range. Normally several common class types contain the feature scaling function so that they make feature scaling automatically. However, the SVR Model in machine learning class is not a commonly used class type so we should perform feature scaling using Python.

from sklearn.preprocessing import StandardScaler
sc_X = StandardScaler()
sc_y = StandardScaler()
X = sc_X.fit_transform(X)
y = sc_y.fit_transform(y)

Step 4: Fitting SVR to the dataset

from sklearn.svm import SVR
regressor = SVR(kernel = 'rbf')
regressor.fit(X, y)

Kernel is the most important feature. There are many types of kernels – linear, Gaussian, etc. Each is used depending on the dataset.

To explore this further, check out: Support Vector Machine (SVM) in Python and R

Step 5. Predicting a New Result

y_pred = regressor.predict(6.5)
y_pred = sc_y.inverse_transform(y_pred)

So, the prediction for y_pred(6, 5) will be 170,370.

Step 6. Visualizing the SVR results (for higher resolution and smoother curve)

X_grid = np.arange(min(X), max(X), 0.01) #this step required because data is feature scaled.
X_grid = X_grid.reshape((len(X_grid), 1))
plt.scatter(X, y, color = 'red')
plt.plot(X_grid, regressor.predict(X_grid), color = 'blue')
plt.title('Truth or Bluff (SVR)')
plt.xlabel('Position level')
plt.ylabel('Salary')
plt.show()

Implementing SVR in Python, svm regression python

This is what we get as output- the best fit line that has a maximum number of points. Quite accurate!

What is the difference between SVM and SVR?

Support Vector Machines (SVM) and Support Vector Regression (SVR) serve as supervised learning techniques in machine learning, each with unique functions and features.

Key Differences:

Support Vector Machine (SVM) is mostly utilized for tasks involving classification. The goal is to locate the best hyperplane that divides distinct classes within the feature space. The objective is to increase the distance between the nearest points of distinct classes, which are referred to as support vectors.

Support Vector Regression (SVR) is utilized for tasks involving regression. It forecasts values that are continual instead of distinct category labels. SVR aims to maximize the number of data points fitting within a given margin of tolerance (epsilon) while reducing errors outside this range.

Conclusion

Support Vector Regression (SVR) extends the principles of Support Vector Machines (SVM) to regression problems, offering a powerful tool for predicting continuous outputs. By leveraging various kernels such as quadratic, radial basis function, and sigmoid, SVR Model can handle complex and non-linear relationships in the data. Through this tutorial, we’ve explored the essential hyperparameters, implemented SVR in Python, and applied it to real-world datasets, demonstrating its versatility in artificial intelligence applications. Whether dealing with training samples in finance, engineering, or healthcare, SVR Model provides a robust approach to model continuous data effectively, enhancing the accuracy and reliability of predictive analytics.

Hope you like the article! Support vector regression (SVR) uses support vector machines to forecast continuous results, effectively managing linear and non-linear correlations. The SVR model shows robustness, versatility, and accuracy across different applications. If you found this information helpful, feel free to Share it.

Here is 10 YouTube Channels to Master Python

Key Takeaways

SVR extends Support Vector Machines (SVM) into regression problems, allowing for the prediction of continuous outcomes rather than classifying data into discrete categories as with a classifier.
SVR utilizes various kernel functions, such as quadratic, radial basis function, and sigmoid, to handle non-linear relationships in data, akin to how neural networks manage complex patterns.
Effective hyperparameter tuning, including choosing the right kernel and setting the epsilon parameter, is vital for maximizing SVR performance, similar to the role of gradient optimization in neural networks.
The SVR Model offers greater flexibility and robustness compared to traditional linear regression. It finds a hyperplane that best fits the data within a specified margin, making it suitable for more complex datasets.
Unlike logistic regression, primarily used for binary classification problems, Support Vector Regression (SVR) focuses on predicting continuous outcomes. SVR in Machine Learning leverages kernel functions to handle non-linear relationships in data, offering a more versatile approach for regression tasks.

Frequently Asked Questions

Q1. What are the applications of SVM regression?

A. Support Vector Regression (SVM) is a versatile algorithm used in finance, engineering, bioinformatics, natural language processing, image processing, and healthcare for accurate predictions. It commonly predicts stock prices, machine performance, protein structures, text classifications, sentiment analysis, object recognition, and medical outcomes.

Q2. How does the regularization parameter in SVM affect the regression model?

A. Regularization is a technique that avoids overfitting by penalizing large coefficients in the model. In SVM for Regression, the regularization parameter determines the trade-off between achieving a low error on the training data and minimizing the complexity of the regression model. A higher value of the regularization parameter increases the penalty for large coefficients, which helps to prevent the model from fitting the noise in the training data.

Q3. What are the benefits of using a polynomial kernel in SVM for regression?

A. A polynomial kernel helps in fitting a regression model that can capture more complex relationships in the input data. It transforms the original features into polynomial features of a given degree, thus allowing the model to learn non-linear relationships. This is especially beneficial in scenarios where the relationship between the dependent and independent variables is not linear, providing a more flexible and powerful model.

Alakh Sethi

Aspiring Data Scientist with a passion to play and wrangle with data and get insights from it to help the community know the upcoming trends and products for their better future.With an ambition to develop product used by millions which makes their life easier and better.

Free Courses

4.7

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

4.5

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

4.6

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

4.8

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

4.7

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

MUID

Used by Microsoft Clarity, to store and track visits across websites.

Expiry: 1 Year

Type: HTTP

_clck

Used by Microsoft Clarity, Persists the Clarity User ID and preferences, unique to that site, on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.

Expiry: 1 Year

Type: HTTP

_clsk

Used by Microsoft Clarity, Connects multiple page views by a user into a single Clarity session recording.

Expiry: 1 Day

Type: HTTP

SRM_I

Collects user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Years

Type: HTTP

SM

Use to measure the use of the website for internal analytics

Expiry: 1 Years

Type: HTTP

CLID

The cookie is set by embedded Microsoft Clarity scripts. The purpose of this cookie is for heatmap and session recording.

Expiry: 1 Year

Type: HTTP

SRM_B

Collected user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Months

Type: HTTP

_gid

This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected includes the number of visitors, the source where they have come from, and the pages visited in an anonymous form.

Expiry: 399 Days

Type: HTTP

_ga_#

Used by Google Analytics, to store and count pageviews.

Expiry: 399 Days

Type: HTTP

_gat_#

Used by Google Analytics to collect data on the number of times a user has visited the website as well as dates for the first and most recent visit.

Expiry: 1 Day

Type: HTTP

collect

Used to send data to Google Analytics about the visitor's device and behavior. Tracks the visitor across devices and marketing channels.

Expiry: Session

Type: PIXEL

AEC

cookies ensure that requests within a browsing session are made by the user, and not by other sites.

Expiry: 6 Months

Type: HTTP

G_ENABLED_IDPS

use the cookie when customers want to make a referral from their gmail contacts; it helps auth the gmail account.

Expiry: 2 Years

Type: HTTP

test_cookie

This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.

Expiry: 1 Year

Type: HTTP

_we_us

this is used to send push notification using webengage.

Expiry: 1 Year

Type: HTTP

WebKlipperAuth

used by webenage to track auth of webenagage.

Expiry: Session

Type: HTTP

ln_or

Linkedin sets this cookie to registers statistical data on users' behavior on the website for internal analytics.

Expiry: 1 Day

Type: HTTP

JSESSIONID

Use to maintain an anonymous user session by the server.

Expiry: 1 Year

Type: HTTP

li_rm

Used as part of the LinkedIn Remember Me feature and is set when a user clicks Remember Me on the device to make it easier for him or her to sign in to that device.

Expiry: 1 Year

Type: HTTP

AnalyticsSyncHistory

Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

lms_analytics

Used to store information about the time a sync with the AnalyticsSyncHistory cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

liap

Cookie used for Sign-in with Linkedin and/or to allow for the Linkedin follow feature.

Expiry: 6 Months

Type: HTTP

visit

allow for the Linkedin follow feature.

Expiry: 1 Year

Type: HTTP

li_at

often used to identify you, including your name, interests, and previous activity.

Expiry: 2 Months

Type: HTTP

s_plt

Tracks the time that the previous page took to load

Expiry: Session

Type: HTTP

lang

Used to remember a user's language setting to ensure LinkedIn.com displays in the language selected by the user in their settings

Expiry: Session

Type: HTTP

s_tp

Tracks percent of page viewed

Expiry: Session

Type: HTTP

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

Indicates the start of a session for Adobe Experience Cloud

Expiry: Session

Type: HTTP

s_pltp

Provides page name value (URL) for use by Adobe Analytics

Expiry: Session

Type: HTTP

s_tslv

Used to retain and fetch time since last visit in Adobe Analytics

Expiry: 6 Months

Type: HTTP

li_theme

Remembers a user's display preference/theme setting

Expiry: 6 Months

Type: HTTP

li_theme_set

Remembers which users have updated their display / theme preferences

Expiry: 6 Months

Type: HTTP

Rahul Dev

Thanks for the article,it gave an intuitive understanding about SVR It would be really helpful if you could also include the dataset,used for the demonstration.

Venkat

The code is completely irrelevant to the dataset shown in the picture. Also this code is from Udemy course by Kiril Ermenko. Atleast give them the credit when you have plagiarized the code and content of the tutorial from elsewhere.

Junior Mukenze

Thank you for this article, is very clear and helpful. However, I have one question on the example you gave. And My question concern characteristics variables (X) and target variables (Y). How to use SVR if we have more then one (1) characteristic variables. Like if we want to consider Salary against position level and age?

Reading list

Basics of Machine Learning

Machine Learning Lifecycle

Importance of Stats and EDA

Understanding Data

Probability

Exploring Continuous Variable

Exploring Categorical Variables

Missing Values and Outliers

Central Limit theorem

Bivariate Analysis Introduction

Continuous - Continuous Variables

Continuous Categorical

Categorical Categorical

Multivariate Analysis

Different tasks in Machine Learning

Build Your First Predictive Model

Evaluation Metrics

Preprocessing Data

Linear Models

KNN

Selecting the Right Model

Feature Selection Techniques

Decision Tree

Feature Engineering

Naive Bayes

Multiclass and Multilabel

Basics of Ensemble Techniques

Advance Ensemble Techniques

Hyperparameter Tuning

Support Vector Machine

Advance Dimensionality Reduction

Unsupervised Machine Learning Methods

Recommendation Engines

Improving ML models

Working with Large Datasets

Interpretability of Machine Learning Models

Automated Machine Learning

Model Deployment

Deploying ML Models

Embedded Devices

Support Vector Regression Tutorial for Machine Learning

Learning Outcomes

Table of contents

What is a Support Vector Machine (SVM)?

Hyperparameters of the Support Vector Machine (SVM) Algorithm

Introduction to Support Vector Regression (SVR)

The Idea Behind Support Vector Regression

Implementing Support Vector Regression (SVR) in Python

Step 1: Importing the libraries

Step 2: Reading the dataset

Step 3: Feature Scaling

Step 4: Fitting SVR to the dataset

Step 5. Predicting a New Result

Step 6. Visualizing the SVR results (for higher resolution and smoother curve)

What is the difference between SVM and SVR?

Conclusion

Key Takeaways

Frequently Asked Questions

Login to continue reading and enjoy expert-curated content.

Free Courses

Generative AI - A Way of Life

Getting Started with Large Language Models

Building LLM Applications using Prompt Engineering

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Microsoft Excel: Formulas & Functions

Recommended Articles

Responses From Readers

Write for us

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk