Let’s Start with Image Preprocessing using SKimage

kajal Last Updated : 24 Feb, 2023

9 min read

Introduction

An image is a two-dimensional representation of a visual subject, like a photograph, painting, or drawing. In digital imaging, images are stored as arrays of pixel values, where each pixel represents a sample of the image’s brightness and color. The color of each pixel can be represented by one or multiple channels, like the red, green, and blue (RGB) channels in traditional color images. In this article, you will learn various image preprocessing techniques.

Images can be processed using computer algorithms to change their appearance or extract information. Image processing techniques include operations like resizing, cropping, rotating, filtering, and thresholding. These operations are performed on the pixel values to modify the image or extract information about its content. Image processing is used in so many applications, including computer vision, medical imaging, and digital art.

By learning Image Preprocessing using SKimage, you will be able to:

Understand the importance of image preprocessing in image analysis and machine learning.
Learn how to use various SKimage functions for image filtering, enhancement, restoration, and transformation.
Apply image preprocessing techniques such as noise reduction, edge detection, and image thresholding to improve image quality.
Perform common image preprocessing tasks such as image resizing, cropping, and rotation.
Use feature extraction techniques to extract meaningful information from images.
Implement object detection algorithms for image analysis applications.

This article was published as a part of the Data Science Blogathon.

Understanding the Image data

Let’s go with an image that can be broken down into a matrix of numbers, where each number represents the strength. And this strength could be anywhere between 0 (representing black) and 255 (representing white). So, a monochromatic image can be represented by a single matrix.

But what can we do when it is a colored image below?

Image Preprocessing

If we have to represent an image, we will break it into three images of three different colors: red, green, and blue. We can store the intensities of each color in two single matrices.

So the image will be broken down into three matrices: one for the red color, one for the green color, and one for the blue color so that we can represent the image in an N*M*3 matrix.

Any image that is n*m pixels wide can always be defined as a matrix N*M*3 anywhere in the computer.

When processing image data, it is common to convert the image into a numerical representation, like a matrix, so that computer algorithms can process it. The numerical representation of the image is called a digital image, and the data in the digital image can be manipulated using mathematical operations to perform different image-processing tasks.

Overall, understanding image data is necessary for working with image processing algorithms and extracting information from images.

Steps Involved in Processing an Image

Here are the common steps involved in processing an image in Python:

Importing libraries: You need to import the libraries that you will use to process the image, like NumPy and OpenCV.
Loading the image: You can load the image using the imread function in OpenCV.
Pre-processing: Depending on the image and the desired outcome, you may need to perform pre-processing steps like resizing, grayscaling, or thresholding.
Manipulating pixels: You can manipulate the pixels of the image using NumPy arrays to perform operations like cropping, rotating, and making color-based selections.
Filtering: You can use different filters to smooth or sharpen the image, like Gaussian or median filters.
Edge detection: Edge detection, which can be performed using methods such as Canny, is used to identify boundaries between objects in an image.

Understanding Transformations in Images

Transformations in images refer to mathematical operations applied to an image to change its appearance or extract useful information from it. Many types of transformations can be applied to images, including:

Geometric transformations: These transformations change the spatial relationship between pixels in an image, like rotation, scaling, and translation.
Color transformations: These transformations change the color properties of an image, like brightness, contrast, and saturation.
Filtering: Filtering refers to the process of removing noise from an image or enhancing its features, like smoothing or sharpening.
Edge detection: Edge detection, which can be performed using methods such as Canny, is used to identify boundaries between objects in an image.
Feature extraction: This refers to the process of extracting meaningful information from an image, like corners or key points, using algorithms like Harris corner detection or SIFT.

These transformations are performed using mathematical algorithms and can be implemented in software like Python or MATLAB. Transformations are essential to image processing and are crucial in applications like computer vision, medical image analysis, and facial recognition.

Now we will start to load the image and do some operations on the image using the Scikit Image Library.

How to Load an Image?

Depending on the programming language and tools being used, there are several ways to load an image. Here are a few common ways:

Using an image processing library: Many image processing libraries, like OpenCV, Pillow, and Scikit-Image, provide functions to load images into memory. For example, you can use the imread function to load an image in OpenCV.
Using an image file reader: You can use a function or class specific to a file format (like JPEG or PNG) to read the image data from a file. For example, you can use the imageio library in Python to read image files.
Loading from a URL: You can download an image from a URL and then load it into memory.

Once the image is uploaded into memory, you can perform various operations on it, like resizing, cropping, color conversion, and filtering, using the functions provided by the image processing library.

Here is an example of how to load an image using the scikit-image (skimage) library in Python:

originating from Skimage Import IO
# Open the image
io.imread("image.jpg") = image
# Show the image
io.imshow(image)
io.show()

In this example, the imread function from the io module of the skimage library is used to load an image file called image.jpg into memory. The resulting image data is then displayed using the imshow function from the io module, followed by a call to “io.show()” to display the image. The imshow function automatically adjusts the image for display and handles issues like color channels and aspect ratio.

How to Visualize an Image?

Visualizing an image involves displaying the image data on a screen or output device. Visualizing an image depends on the programming language and tools being used. Here are a few common ways:

Using an image processing library: Many image processing libraries, like OpenCV, Pillow, and scikit-image, provide functions to display images. For example, you can use the imshow function to display an image in OpenCV.
Using a plotting library: You can use a plotting library, like Matplotlib in Python, to display an image. For example, you can use the imshow function from the Matplotlib Pyplot module to display an image in Python.

Once the image is displayed, you can interact with it by zooming in and out, panning, and readjusting the display settings.

Here is an example of how to visualize an image using the scikit-image (skimage) library in Python:

from skimage import io
import matplotlib.pyplot as  plt
# Load the image
image = io.imread("image.jpg")
# Display the image
plt.imshow(image)
plt.show()

In this example, the imread function from the "io ” module of the skimage library is used to load an image file called image.jpg into memory. The resulting image data is then displayed using the imshow function from the matplotlib.pyplot module, followed by a call to plt.show() to display the image.

Image Preprocessing – Resizing Images

Python’s “scikit-image” (skimage) library provides several functions for resizing images. One commonly used function for this purpose is “resize” from the “transform” module.

Here is an example of using skimage to resize an image:

import skimage

from skimage import io, transform

# Load the image
image = io.imread(“example.jpg”)
resized_image = transform.resize(image, (300, 300))

# Save the resized image
io.imsave(“resized_image.jpg”, resized_image)

Image Preprocessing

In this example, the original image is read using the imread function from the io module. The resize the function is then used to resize the image to a size of (300 300) pixels. Finally, the resized image is saved using the imsave function.

Image Preprocessing – Reshaping an Image

In Python, the “scikit-image” (skimage) library provides several functions for reshaping images. Here is an example of using skimage to reshape an image:

# import colour sub-module
from skimage import color

# reading the image
image = imread('index.png')

# converting image to grayscale
grayscale_image = color.rgb2gray(image)
grayscale_image.shape

import numpy as np
new_shape = (grayscale_image.shape[0]*grayscale_image.shape[1])

# reshape 
image2 = np.reshape(grayscale_image, new_shape)
image2.shape

If converting a 4 by 4 2-D image to 1-D, we will have 4×4=16 values.

Image Preprocessing – Image Rotation

Image rotation involves rotating an image about its center by a specified angle. In scikit-image, you can use the rotate function from the transform module to rotate an image. Here’s an example in Python:

import numpy as np
from skimage import io
from skimage.transform import rotate

# Load an image
image = io.imread("image.jpg")

# Rotate the image by 180 degrees
rotated_image = rotate(image, angle=180, resize=True)

# Save the rotated image
io.imsave("rotated_image.jpg", rotated_image)

Image Preprocessing

In this example, the rotate function is used to rotate the input image by 180 degrees. The first argument to the rotate function is the input image, and the angle argument specifies the rotation angle in degrees.

Image Preprocessing – Image Cropping

Image cropping involves extracting a portion of an image by specifying a crop region. In scikit-image, you can use slicing and indexing to crop an image. Here’s an example in Python:

import numpy as np
from skimage import io

# Load an image
image = io.imread("image.jpg")
rows, cols = image.shape[:2]
cropped_image = image[rows//4:-rows//4, cols//4:-cols//4]

# Save the cropped image
io.imsave("cropped_image.jpg", cropped_image)

In this example, the input image is first loaded using the imread function. The crop region is specified by slicing the image along both dimensions, such that the first and last quarter of the rows and columns are removed.

Image Preprocessing – Image Flipping

Image flipping in Python can be performed using the “cv2.flip” function from the OpenCV library. The “cv2.flip” function takes two arguments: the input image and a flip code. The flip code specifies the flipping to be performed and can be one of the following values:

cv2.FLIP_HORIZONTAL: Flip the image horizontally
cv2.FLIP_VERTICAL: Flip the image vertically
cv2.FLIP_BOTH: Flip the image both horizontally and vertically

Flipping can be considered an extension of rotation, allowing left-right and up-down image flipping.

import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline

# reading the image
image = imread('index.png')
image = np.array(image)

imshow(image)
plt.title('Original Image')

Now, what do you do if you have to flip the images, again read the image and now in order to flip it? let’s say we are doing left to right flip. I can easily do that using the “fliplr()” function.

# flip image left-to-right
flipLR = np.fliplr(image)

plt.imshow(flipLR)
plt.title('Left to Right Flipped')

Image Preprocessing

And these are the ways in which you can flip the images.

Image Preprocessing – Brightness Manipulation

Brightness manipulation in Python can be performed using the image library. The image library provides the exposure module, which includes the function of adjusting gamma that can be used to change the brightness of an image.

Images with different brightness can be used to make the model robust to changes in lighting conditions; this is important for systems that work in outdoor lightings, like CCTV cameras on traffic signals.

from skimage.exposure import adjust_gamma

# read the image
image = imread('index.png')

plt.title('Original Image')
imshow(image)

Image Preprocessing

I am going to change the gamma value, and that changes the strength of the image. So, this is my bright image.

# brighten the image
bright = adjust_gamma(image,gamma=0.5,gain=1)

imshow(bright)
plt.title('Brightened IMage')

Image Preprocessing

So, approximately I can make all of these changes to the image.

Conclusion

Scikit-image is a popular Python library for image processing that provides tools and functions for working with images. Here’s a summary of some of the critical features of image for image processing:

Image I/O: image provides functions for reading and writing images to disk, including imread for reading an image and imsave for saving an image.
Image restoration: image provides algorithms for restoring degraded images, including functions for removing noise and correcting for blurring or distortion.
Image analysis: image provides functions for analyzing image properties, including histograms, gradient magnitude, and texture analysis.
Image visualization: image provides functions for visualizing images and their properties, including plotting and displaying images, histograms, and other visual representations of image data.

Overall, image is a comprehensive and well-documented library for image processing that is widely used in scientific, medical, and industrial applications.

The media shown in this article is not owned by Analytics Vidhya and is used at the Author’s discretion.

kajal

Hi, I am Kajal Kumari. have completed my Master’s from IIT(ISM) Dhanbad in Computer Science & Engineering. As of now, I am working as Machine Learning Engineer in Hyderabad.
hope that you have enjoyed the article. If you like it, share it with your friends also. Please feel free to comment if you have any thoughts that can improve my article writing.

If you want to read my previous blogs, you can read Previous Data Science Blog posts here. Connect with me

Free Courses

4.7

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

4.5

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

4.6

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

4.8

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

4.7

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

MUID

Used by Microsoft Clarity, to store and track visits across websites.

Expiry: 1 Year

Type: HTTP

_clck

Used by Microsoft Clarity, Persists the Clarity User ID and preferences, unique to that site, on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.

Expiry: 1 Year

Type: HTTP

_clsk

Used by Microsoft Clarity, Connects multiple page views by a user into a single Clarity session recording.

Expiry: 1 Day

Type: HTTP

SRM_I

Collects user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Years

Type: HTTP

SM

Use to measure the use of the website for internal analytics

Expiry: 1 Years

Type: HTTP

CLID

The cookie is set by embedded Microsoft Clarity scripts. The purpose of this cookie is for heatmap and session recording.

Expiry: 1 Year

Type: HTTP

SRM_B

Collected user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Months

Type: HTTP

_gid

This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected includes the number of visitors, the source where they have come from, and the pages visited in an anonymous form.

Expiry: 399 Days

Type: HTTP

_ga_#

Used by Google Analytics, to store and count pageviews.

Expiry: 399 Days

Type: HTTP

_gat_#

Used by Google Analytics to collect data on the number of times a user has visited the website as well as dates for the first and most recent visit.

Expiry: 1 Day

Type: HTTP

collect

Used to send data to Google Analytics about the visitor's device and behavior. Tracks the visitor across devices and marketing channels.

Expiry: Session

Type: PIXEL

AEC

cookies ensure that requests within a browsing session are made by the user, and not by other sites.

Expiry: 6 Months

Type: HTTP

G_ENABLED_IDPS

use the cookie when customers want to make a referral from their gmail contacts; it helps auth the gmail account.

Expiry: 2 Years

Type: HTTP

test_cookie

This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.

Expiry: 1 Year

Type: HTTP

_we_us

this is used to send push notification using webengage.

Expiry: 1 Year

Type: HTTP

WebKlipperAuth

used by webenage to track auth of webenagage.

Expiry: Session

Type: HTTP

ln_or

Linkedin sets this cookie to registers statistical data on users' behavior on the website for internal analytics.

Expiry: 1 Day

Type: HTTP

JSESSIONID

Use to maintain an anonymous user session by the server.

Expiry: 1 Year

Type: HTTP

li_rm

Used as part of the LinkedIn Remember Me feature and is set when a user clicks Remember Me on the device to make it easier for him or her to sign in to that device.

Expiry: 1 Year

Type: HTTP

AnalyticsSyncHistory

Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

lms_analytics

Used to store information about the time a sync with the AnalyticsSyncHistory cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

liap

Cookie used for Sign-in with Linkedin and/or to allow for the Linkedin follow feature.

Expiry: 6 Months

Type: HTTP

visit

allow for the Linkedin follow feature.

Expiry: 1 Year

Type: HTTP

li_at

often used to identify you, including your name, interests, and previous activity.

Expiry: 2 Months

Type: HTTP

s_plt

Tracks the time that the previous page took to load

Expiry: Session

Type: HTTP

lang

Used to remember a user's language setting to ensure LinkedIn.com displays in the language selected by the user in their settings

Expiry: Session

Type: HTTP

s_tp

Tracks percent of page viewed

Expiry: Session

Type: HTTP

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

Indicates the start of a session for Adobe Experience Cloud

Expiry: Session

Type: HTTP

s_pltp

Provides page name value (URL) for use by Adobe Analytics

Expiry: Session

Type: HTTP

s_tslv

Used to retain and fetch time since last visit in Adobe Analytics

Expiry: 6 Months

Type: HTTP

li_theme

Remembers a user's display preference/theme setting

Expiry: 6 Months

Type: HTTP

li_theme_set

Remembers which users have updated their display / theme preferences

Expiry: 6 Months

Type: HTTP

Reading list

Introduction to Computer Vision

Getting Started with Image Data

Introduction to CNN and Implementation

Introduction to CNN and implementation

Introduction to Transfer Learning

CNN Visualization

Overview of Pretrained Models

Inception

ResNets

DenseNets

CSRNet

Introduction to Object Detection

Region Based Convolutional Neural Network

Single Stage Networks

Transformed Based Object Detection Models

Face Detection

Object Tracking

Pose Estimation

Introduction to Image Segmentation

Understanding Deep Learning Architectures for Image Segmentation

Video Classification

Introduction to Image Generation

Experiments with Generative Adversarial Networks

Zero and Few Shot Learning

Model Deployment

Let’s Start with Image Preprocessing using SKimage

Introduction

Table of Contents

Understanding the Image data

Steps Involved in Processing an Image

Understanding Transformations in Images

How to Load an Image?

How to Visualize an Image?

Image Preprocessing – Resizing Images

Image Preprocessing – Reshaping an Image

Image Preprocessing – Image Rotation

Image Preprocessing – Image Cropping

Image Preprocessing – Image Flipping

Image Preprocessing – Brightness Manipulation

Conclusion

Login to continue reading and enjoy expert-curated content.

Free Courses

Generative AI - A Way of Life

Getting Started with Large Language Models

Building LLM Applications using Prompt Engineering

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Microsoft Excel: Formulas & Functions

Recommended Articles

Responses From Readers

Write for us

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#

collect

AEC

G_ENABLED_IDPS

test_cookie

Webengage (2)

_we_us

WebKlipperAuth

LinkedIn (16)

ln_or

JSESSIONID