Digital Image Processing Real-Life Applications and getting started in Python

Prateek Majumder Last Updated : 21 Oct, 2024

6 min read

This article was published as a part of the Data Science Blogathon

Introduction

Digital Image Processing consists of the various techniques and methods involving in the manipulation of images on a computer. Various types of operations are performed on images, which constitute Digital Image Processing.

Understanding what an Image actually is?

Image is basically a two-dimensional signal. The signal function is f(x,y), where the value of x and y at a point generates the pixel at the point. Image is basically a two-dimensional array consisting of numbers between 0 and 255.

Various factors are involved in Image Processing. Image processing has a few main motives.

Image Processing help in :

1. Improvement in digital information stored by us.

2. Making working with images automated.

3. Better image optimization leading to efficient storage and transmission.

Over the years, image processing has improved a lot, and there are a lot of modern commercial applications of image processing.

Image Processing Uses :

1. Image Correction, Sharpening, and Resolution Correction

Often, we wish we could make old images better. And that is possible nowadays. Zooming, sharpening, edge detection, high dynamic range edits all fall under this category. All these steps help in enhancing the image. Most editing software and Image correction code can do these things easily.

Most editing apps and social media apps provide filters these days.

Above is an example of the original Image and filtered Image. Filters make the image look more visually appealing. Filters are usually a set of functions that change the colors and other aspects in an image that make the image look different. Filters are an interesting application of Image processing.

3. Medical Technology :

In the medical field, Image Processing is used for various tasks like PET scan, X-Ray Imaging, Medical CT, UV imaging, Cancer Cell Image processing, and much more. The introduction of Image Processing to the medical technology field has greatly improved the diagnostics process.

( Image Source – https://axisimagingnews.com/radiology-products/imaging-equipment/x-ray/image-processing-software-mimics-grid-use-improve-image-quality )

The image on the left is the original image. The image on the right is the processed image. We can see that the processed image is far better and can be used for better diagnostics.

4. Computer / Machine Vision :

One of the most interesting and useful applications of Image Processing is in Computer Vision. Computer Vision is used to make the computer see, identify things, and process the whole environment as a whole. An important use of Computer Vision is Self Driving cars, Drones etc. CV helps in obstacle detection, path recognition, and understanding the environment.

( Source: Paris streets in the eyes of Tesla Autopilot https://youtu.be/_1MHGUC_BzQ)

This is how typical Computer Vision works for Car Autopilots. The computer takes in live footage and analyses other cars, the road, and other obstacles.

5. Pattern recognition:

Pattern recognition is a part of Image Processing that involves AI and Machine Learning. Image processing is used to find out various patterns and aspects in images. Pattern Recognition is used for Handwriting analysis, Image recognition, Computer-aided medical diagnosis, and much more.

6. Video Processing:

Video is basically a fast movement of images. Various image processing techniques are used in Video Processing. Some methods of Video Processing are noise removal, image stabilization, frame rate conversion, detail enhancement, and much more.

Getting started with Image Processing in Python:

Let us get started with some basic Image related tasks in Python. We will make use of PIL.

PIL

Python Imaging Library is used for various image processing tasks.

Installation:

pip install pillow

With PIL installed, we can now move to the code.

First, we work with some matplotlib functions.

import matplotlib.image as img
import matplotlib.pyplot as plt
import numpy as np
%matplotlib inline

The following image will be read. It is named image1.jpg.

Python Code:

import matplotlib.image as img
import matplotlib.pyplot as plt

#reading jpg image 
img = img.imread('image1.jpg')
plt.imshow(img)
plt.show()

The image is read.

# modifying the shape of the image
lum1 = img[:, :, 0] 
plt.imshow(lum1)

Now the image shape is modified.

Digital Image Processing image shape modified

Now we will change it into the “hot” colourmap. To read more about colourmap, visit this link.

plt.imshow(lum1, cmap ='hot') 
plt.colorbar()

Image output looks:

Now we try a different colormap.

imgplot = plt.imshow(lum1)
imgplot.set_cmap('nipy_spectral')

Image output:

The reason for using colormaps is that, often in various applications and uses, having a uniform colormap helps. Read more about Colourmaps: Choosing Colormaps in Matplotlib.

Now let us have a look at why we called an image a 2D array.

#data type of lum1

print(type(lum1))

Output: <class ‘numpy.ndarray’>

print(lum1)

[[ 92 91 89 … 169 168 169] [110 110 110 … 168 166 167] [100 103 108 … 164 163 164]

…

[ 97 96 95 … 144 147 147] [ 99 99 98 … 145 139 138] [102 102 103 … 149 137 137]]

The dots are just there to show that there are many more data points in between. But one thing is sure, is that it is all numeric data.

Let us find the size of the array.

len(lum1)

Output: 320

len(lum1[300])

Output: 658

This gives us the number of pixels, and dimensions of the image: 320*658.

We will also verify this later.

Now, we work with PIL.

from PIL import Image

We will use this image file, named as: people.jpg.

img2 = Image.open('people.jpg')
plt.imshow(img2)

The image is read.

Now, we resize the image.

img2.thumbnail((50, 50), Image.ANTIALIAS)  # resizes image in-place
imgplot = plt.imshow(img2)

imgplot1 = plt.imshow(img2, interpolation="nearest")

imgplot2 = plt.imshow(img2, interpolation="bicubic")

But, why do we purposefully blur images in Image Processing? Well, often for Pattern Recognition and Computer Vision algorithms, it becomes difficult to process the images if they are very sharp. Thus blurring is done to make the images smooth. Blurring also makes the colour transition in an image, from one side to the other, a lot more smooth.

Now, let us verify the dimensions of the car image, we worked on earlier.

#some more interesting stuff
file='image1.jpg'
with Image.open(file) as image: 
    width, height = image.size 
#Image width, height is be obtained

These are the dimensions we got earlier as well. So we can conclude that the image is 320*658.

Let us also try rotating and transposing the image.

#Relative Path 
img3 = Image.open("image1.jpg")  
#Angle given 
img_rot= img3.rotate(180)  
#Saved in the same relative location 
img_rot.save("rotated_picture.jpg")

This is the rotated image.

#transposing image  
transposed_img = img3.transpose(Image.FLIP_LEFT_RIGHT)
#Saved in the same relative location 
transposed_img.save("transposed_img.jpg")

This is the transposed image.

Final Words:

Image Processing has various important applications and with time the methods and processes will also improve.

About me:

Prateek Majumder

Data Science and Analytics | Digital Marketing Specialist | SEO | Content Creation

Connect with me on Linkedin.

Thank You.

The media shown in this article on Digital Image Processing are not owned by Analytics Vidhya and is used at the Author’s discretion.

Prateek Majumder

Prateek is a dynamic professional with a strong foundation in Artificial Intelligence and Data Science, currently pursuing his PGP at Jio Institute. He holds a Bachelor's degree in Electrical Engineering and has hands-on experience as a System Engineer at TCS Digital, where he excelled in API management and data integration. Prateek also has a background in product marketing and analytics from his time with start-ups like AppleX and Milkie Way, Inc., where he was involved in growth campaigns and technical blog management. Recognized for his structured thinking and problem-solving abilities, he has received accolades like the Dr. Sudarshan Chakraborty Award for Best Student Performance. Fluent in multiple languages and passionate about technology, Prateek continues to expand his expertise in the rapidly evolving AI and tech landscape.

Free Courses

4.7

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

4.5

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

4.6

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

4.8

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

4.7

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

MUID

Used by Microsoft Clarity, to store and track visits across websites.

Expiry: 1 Year

Type: HTTP

_clck

Used by Microsoft Clarity, Persists the Clarity User ID and preferences, unique to that site, on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.

Expiry: 1 Year

Type: HTTP

_clsk

Used by Microsoft Clarity, Connects multiple page views by a user into a single Clarity session recording.

Expiry: 1 Day

Type: HTTP

SRM_I

Collects user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Years

Type: HTTP

SM

Use to measure the use of the website for internal analytics

Expiry: 1 Years

Type: HTTP

CLID

The cookie is set by embedded Microsoft Clarity scripts. The purpose of this cookie is for heatmap and session recording.

Expiry: 1 Year

Type: HTTP

SRM_B

Collected user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Months

Type: HTTP

_gid

This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected includes the number of visitors, the source where they have come from, and the pages visited in an anonymous form.

Expiry: 399 Days

Type: HTTP

_ga_#

Used by Google Analytics, to store and count pageviews.

Expiry: 399 Days

Type: HTTP

_gat_#

Used by Google Analytics to collect data on the number of times a user has visited the website as well as dates for the first and most recent visit.

Expiry: 1 Day

Type: HTTP

collect

Used to send data to Google Analytics about the visitor's device and behavior. Tracks the visitor across devices and marketing channels.

Expiry: Session

Type: PIXEL

AEC

cookies ensure that requests within a browsing session are made by the user, and not by other sites.

Expiry: 6 Months

Type: HTTP

G_ENABLED_IDPS

use the cookie when customers want to make a referral from their gmail contacts; it helps auth the gmail account.

Expiry: 2 Years

Type: HTTP

test_cookie

This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.

Expiry: 1 Year

Type: HTTP

_we_us

this is used to send push notification using webengage.

Expiry: 1 Year

Type: HTTP

WebKlipperAuth

used by webenage to track auth of webenagage.

Expiry: Session

Type: HTTP

ln_or

Linkedin sets this cookie to registers statistical data on users' behavior on the website for internal analytics.

Expiry: 1 Day

Type: HTTP

JSESSIONID

Use to maintain an anonymous user session by the server.

Expiry: 1 Year

Type: HTTP

li_rm

Used as part of the LinkedIn Remember Me feature and is set when a user clicks Remember Me on the device to make it easier for him or her to sign in to that device.

Expiry: 1 Year

Type: HTTP

AnalyticsSyncHistory

Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

lms_analytics

Used to store information about the time a sync with the AnalyticsSyncHistory cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

liap

Cookie used for Sign-in with Linkedin and/or to allow for the Linkedin follow feature.

Expiry: 6 Months

Type: HTTP

visit

allow for the Linkedin follow feature.

Expiry: 1 Year

Type: HTTP

li_at

often used to identify you, including your name, interests, and previous activity.

Expiry: 2 Months

Type: HTTP

s_plt

Tracks the time that the previous page took to load

Expiry: Session

Type: HTTP

lang

Used to remember a user's language setting to ensure LinkedIn.com displays in the language selected by the user in their settings

Expiry: Session

Type: HTTP

s_tp

Tracks percent of page viewed

Expiry: Session

Type: HTTP

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

Indicates the start of a session for Adobe Experience Cloud

Expiry: Session

Type: HTTP

s_pltp

Provides page name value (URL) for use by Adobe Analytics

Expiry: Session

Type: HTTP

s_tslv

Used to retain and fetch time since last visit in Adobe Analytics

Expiry: 6 Months

Type: HTTP

li_theme

Remembers a user's display preference/theme setting

Expiry: 6 Months

Type: HTTP

li_theme_set

Remembers which users have updated their display / theme preferences

Expiry: 6 Months

Type: HTTP

Popular Categories

Generative AI Tools and Techniques

Popular GenAI Models

Data Science Tools and Techniques

Reading list

Introduction to Computer Vision

Getting Started with Image Data

Introduction to CNN and Implementation

Introduction to CNN and implementation

Introduction to Transfer Learning

CNN Visualization

Overview of Pretrained Models

Inception

ResNets

DenseNets

CSRNet

Introduction to Object Detection

Region Based Convolutional Neural Network

Single Stage Networks

Transformed Based Object Detection Models

Face Detection

Object Tracking

Pose Estimation

Introduction to Image Segmentation

Understanding Deep Learning Architectures for Image Segmentation

Video Classification

Introduction to Image Generation

Experiments with Generative Adversarial Networks

Zero and Few Shot Learning

Model Deployment

Digital Image Processing Real-Life Applications and getting started in Python

Introduction

Understanding what an Image actually is?

Image Processing help in :

Image Processing Uses :

1. Image Correction, Sharpening, and Resolution Correction

2. Filters on Editing Apps and Social Media

3. Medical Technology :

4. Computer / Machine Vision :

5. Pattern recognition:

6. Video Processing:

Getting started with Image Processing in Python:

Installation:

About me:

Login to continue reading and enjoy expert-curated content.

Free Courses

Generative AI - A Way of Life

Getting Started with Large Language Models

Building LLM Applications using Prompt Engineering

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Microsoft Excel: Formulas & Functions

Recommended Articles

Responses From Readers

Write for us

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#

collect

AEC

G_ENABLED_IDPS

test_cookie

Webengage (2)

_we_us

WebKlipperAuth

LinkedIn (16)

ln_or

JSESSIONID

li_rm