A Guide to Understanding Convolutional Neural Networks (CNNs) using Visualization

Saurabh Last Updated : 22 Mar, 2020

10 min read

Introduction

“How did your neural network produce this result?” This question has sent many data scientists into a tizzy. It’s easy to explain how a simple neural network works, but what happens when you increase the layers 1000x in a computer vision project?

Our clients or end users require interpretability – they want to know how our model got to the final result. We can’t take a pen and paper to explain how a deep neural network works. So how do we shed this “black box” image of neural networks?

By visualizing them! The clarity that comes with visualizing the different features of a neural network is unparalleled. This is especially true when we’re dealing with a convolutional neural network (CNN) trained on thousands and millions of images.

In this article, we will look at different techniques for visualizing convolutional neural networks. Additionally, we will also work on extracting insights from these visualizations for tuning our CNN model.

Note: This article assumes you have a basic understanding of Neural Networks and Convolutional Neural Networks. Below are three helpful articles to brush up or get started with this topic:

You can also learn CNNs in a step-by-step manner by enrolling in this free course: Convolutional Neural Networks (CNN) from Scratch

Why Should we use Visualization to Decode Neural Networks?
Setting up the Model Architecture
Accessing Individual Layers of a CNN
Filters – Visualizing the Building Blocks of CNNs
Activation Maximization – Visualizing what a Model Expects
Occlusion Maps – Visualizing what’s important in the Input
Saliency Maps – Visualizing the Contribution of Input Features
Class Activation Maps
Layerwise Output Visualization – Visualizing the Process

Why Should we use Visualization to Decode Neural Networks?

It’s a fair question. There are a number of ways to understand how a neural network works, so why turn to the off-beaten path of visualization?

Let’s answer this question through an example. Consider a project where we need to classify images of animals, like snow leopards and Arabian leopards. Intuitively, we can differentiate between these animals using the image background, right?

Both animals live in starkly contrasting habitats. The majority of the snow leopard images will have snow in the background while most of the Arabian leopard images will have a sprawling desert.

Here’s the problem – the model will start classifying snow versus desert images. So, how do we make sure our model has correctly learned the distinguishing features between these two leopard types? The answer lies in the form of visualization.

Visualization helps us see what features are guiding the model’s decision for classifying an image.

There are multiple ways to visualize a model, and we will try to implement some of them in this article.

Setting up the Model Architecture

I believe the best way of learning is by coding the concept. Hence, this is a very hands-on guide and I’m going to dive into the Python code straight away.

We will be using the VGG16 architecture with pretrained weights on the ImageNet dataset in this article. Let’s first import the model into our program and understand its architecture.

We will visualize the model architecture using the ‘model.summary()’ function in Keras. This is a very important step before we get to the model building part. We need to make sure the input and output shapes match our problem statement, hence we visualize the model summary.

	#importing required modules
	from keras.applications import VGG16
	#loading the saved model
	#we are using the complete architecture thus include_top=True
	model = VGG16(weights='imagenet',include_top=True)
	#show the summary of model
	model.summary()

view raw Importing Model.py hosted with ❤ by GitHub

Below is the model summary generated by the above code:

We have a detailed architecture of the model along with the number of trainable parameters at every layer. I want you to spend a few moments going through the above output to understand what we have at hand.

This is important when we are training only a subset of the model layers (feature extraction). We can generate the model summary and ensure that the number of non-trainable parameters matches the layers that we do not want to train.

Also, we can use the total number of trainable parameters to check whether our GPU will be able to allocate sufficient memory for training the model. That’s a familiar challenge for most of us working on our personal machines!

Accessing Individual Layers

Now that we know how to get the overall architecture of a model, let’s dive deeper and try to explore individual layers.

It’s actually fairly easy to access the individual layers of a Keras model and extract the parameters associated with each layer. This includes the layer weights and other information like the number of filters.

Now, we will create dictionaries that map the layer name to its corresponding characteristics and layer weights:

	#creating a mapping of layer name ot layer details
	#we will create a dictionary layers_info which maps a layer name to its charcteristics
	layers_info = {}
	for i in model.layers:
	layers_info[i.name] = i.get_config()

	#here the layer_weights dictionary will map every layer_name to its corresponding weights
	layer_weights = {}
	for i in model.layers:
	layer_weights[i.name] = i.get_weights()

	print(layers_info['block5_conv1'])

view raw Layers.py hosted with ❤ by GitHub

The above code gives the following output which consists of different parameters of the block5_conv1 layer:

{'name': 'block5_conv1',
 'trainable': True,
 'filters': 512,
 'kernel_size': (3, 3),
 'strides': (1, 1),
 'padding': 'same',
 'data_format': 'channels_last',
 'dilation_rate': (1, 1),
 'activation': 'relu',
 'use_bias': True,
 'kernel_initializer': {'class_name': 'VarianceScaling',
  'config': {'scale': 1.0,
   'mode': 'fan_avg',
   'distribution': 'uniform',
   'seed': None}},
 'bias_initializer': {'class_name': 'Zeros', 'config': {}},
 'kernel_regularizer': None,
 'bias_regularizer': None,
 'activity_regularizer': None,
 'kernel_constraint': None,
 'bias_constraint': None}

Did you notice that the trainable parameter for our layer ‘block5_conv1‘ is true? This means that we can update the layer weights by training the model further.

Visualizing the Building Blocks of CNNs – Filters

Filters are the basic building blocks of any Convolutional Neural Network. Different filters extract different kinds of features from an image. The below GIF illustrates this point really well:

As you can see, every convolutional layer is composed of multiple filters. Check out the output we generated in the previous section – the ‘block5_conv1‘ layer consists of 512 filters. Makes sense, right?

Let’s plot the first filter of the first convolutional layer of every VGG16 block:

	layers = model.layers
	layer_ids = [1,4,7,11,15]
	#plot the filters
	fig,ax = plt.subplots(nrows=1,ncols=5)
	for i in range(5):
	ax[i].imshow(layers[layer_ids[i]].get_weights()[0][:,:,:,0][:,:,0],cmap='gray')
	ax[i].set_title('block'+str(i+1))
	ax[i].set_xticks([])
	ax[i].set_yticks([])

view raw Filters.py hosted with ❤ by GitHub

We can see the filters of different layers in the above output. All the filters are of the same shape since VGG16 uses only 3×3 filters.

Visualizing what a Model Expects – Activation Maximization

Let’s use the image below to understand the concept of activation maximization:

Which features do you feel will be important for the model to identify the elephant? Some major ones I can think of:

Tusks
Trunk
Ears

That’s how we instinctively identify elephants, right? Now, let’s see what we get when we try to optimize a random image to be classified as that of an elephant.

We know that every convolutional layer in a CNN looks for similar patterns in the output of the previous layer. The activation of a convolutional layer is maximized when the input consists of the pattern that it is looking for.

In the activation maximization technique, we update the input to each layer so that the activation maximization loss is minimized.

How do we do this? We calculate the gradient of the activation loss with respect to the input, and then update the input accordingly:

Here’s the code for doing this:

	#importing the required modules
	from vis.visualization import visualize_activation
	from vis.utils import utils
	from keras import activations
	from keras import applications
	import matplotlib.pyplot as plt
	%matplotlib inline
	plt.rcParams['figure.figsize'] = (18,6)
	#creating a VGG16 model using fully connected layers also because then we can
	#visualize the patterns for individual category
	from keras.applications import VGG16
	model = VGG16(weights='imagenet',include_top=True)

	#finding out the layer index using layer name
	#the find_layer_idx function accepts the model and name of layer as parameters and return the index of respective layer
	layer_idx = utils.find_layer_idx(model,'predictions')
	#changing the activation of the layer to linear
	model.layers[layer_idx].activation = activations.linear
	#applying modifications to the model
	model = utils.apply_modifications(model)
	#Indian elephant
	img3 = visualize_activation(model,layer_idx,filter_indices=385,max_iter=5000,verbose=True)
	plt.imshow(img3)

view raw activation_max.py hosted with ❤ by GitHub

Our model generated the below output using a random input for the class corresponding to Indian Elephant:

From the above image, we can observe that the model expects structures like a tusk, large eyes, and trunk. Now, this information is very important for us to check the sanity of our dataset. For example, let’s say that the model was focussing on features like trees or long grass in the background because Indian elephants are generally found in such habitats.

Then, using activation maximization, we can figure out that our dataset is probably not sufficient for the task and we need to add images of elephants in different habitats to our training set.

Visualizing what’s Important in the Input- Occlusion Maps

Activation maximization is used to visualize what the model expects in an image. Occlusion maps, on the other hand, help us find out which part of the image is important for the model.

Now, to understand how occlusion maps work, we consider a model that classifies cars according to their manufacturers, like Toyota, Audi etc.:

Can you figure out which company manufactured the above car? Probably not because the part where the company logo is placed has been occluded in the image. That part of the image is clearly important for our classification purposes.

Similarly, for generating an occlusion map, we occlude some part of the image and then calculate its probability of belonging to a class. If the probability decreases, then it means that occluded part of the image is important for the class. Otherwise, it is not important.

Here, we assign the probability as pixel values for every part of the image and then standardize them to generate a heatmap:

	import numpy as np

	from keras.utils import np_utils
	from keras.models import Sequential
	from keras.layers import Dense, Dropout, Flatten, Activation, Conv2D, MaxPooling2D
	from keras.optimizers import Adam
	from keras.callbacks import EarlyStopping, ModelCheckpoint
	from keras.preprocessing.image import ImageDataGenerator
	from keras.activations import relu

	%matplotlib inline
	import matplotlib.pyplot as plt
	def iter_occlusion(image, size=8):

	occlusion = np.full((size * 5, size * 5, 1), [0.5], np.float32)
	occlusion_center = np.full((size, size, 1), [0.5], np.float32)
	occlusion_padding = size * 2

	# print('padding...')
	image_padded = np.pad(image, ( \
	(occlusion_padding, occlusion_padding), (occlusion_padding, occlusion_padding), (0, 0) \
	), 'constant', constant_values = 0.0)

	for y in range(occlusion_padding, image.shape[0] + occlusion_padding, size):

	for x in range(occlusion_padding, image.shape[1] + occlusion_padding, size):
	tmp = image_padded.copy()

	tmp[y - occlusion_padding:y + occlusion_center.shape[0] + occlusion_padding, \
	x - occlusion_padding:x + occlusion_center.shape[1] + occlusion_padding] \
	= occlusion

	tmp[y:y + occlusion_center.shape[0], x:x + occlusion_center.shape[1]] = occlusion_center

	yield x - occlusion_padding, y - occlusion_padding, \
	tmp[occlusion_padding:tmp.shape[0] - occlusion_padding, occlusion_padding:tmp.shape[1] - occlusion_padding]

view raw occlusion_1.py hosted with ❤ by GitHub

The above code defines a function iter_occlusion that returns an image with different masked portions.

Now, let’s import the image and plot it:

	from keras.preprocessing.image import load_img
	# load an image from file
	image = load_img('car.jpeg', target_size=(224, 224))
	plt.imshow(image)
	plt.title('ORIGINAL IMAGE')

view raw occlusion_2.py hosted with ❤ by GitHub

Original Image

Now, we’ll follow three steps:

Preprocess this image
Calculate the probabilities for different masked portions
Plot the heatmap

	from keras.preprocessing.image import img_to_array
	from keras.applications.vgg16 import preprocess_input
	# convert the image pixels to a numpy array
	image = img_to_array(image)
	# reshape data for the model
	image = image.reshape((1, image.shape[0], image.shape[1], image.shape[2]))
	# prepare the image for the VGG model
	image = preprocess_input(image)
	# predict the probability across all output classes
	yhat = model.predict(image)
	temp = image[0]
	print(temp.shape)
	heatmap = np.zeros((224,224))
	correct_class = np.argmax(yhat)
	for n,(x,y,image) in enumerate(iter_occlusion(temp,14)):
	heatmap[x:x+14,y:y+14] = model.predict(image.reshape((1, image.shape[0], image.shape[1], image.shape[2])))[0][correct_class]
	print(x,y,n,' - ',image.shape)
	heatmap1 = heatmap/heatmap.max()
	plt.imshow(heatmap)

view raw occlusion_3.py hosted with ❤ by GitHub

Heatmap

Really interesting. We will now create a mask using the standardized heatmap probabilities and plot it:

	import skimage.io as io
	#creating mask from the standardised heatmap probabilities
	mask = heatmap1 < 0.85
	mask1 = mask *256
	mask = mask.astype(int)
	io.imshow(mask,cmap='gray')

view raw occlusion_4.py hosted with ❤ by GitHub

Mask

Finally, we will impose the mask on our input image and plot that as well:

	import cv2
	#read the image
	image = cv2.imread('car.jpeg')
	image = cv2.cvtColor(image,cv2.COLOR_BGR2RGB)
	#resize image to appropriate dimensions
	image = cv2.resize(image,(224,224))
	mask = mask.astype('uint8')
	#apply the mask to the image
	final = cv2.bitwise_and(image,image,mask = mask)
	final = cv2.cvtColor(final,cv2.COLOR_BGR2RGB)
	#plot the final image
	plt.imshow(final)

view raw occlusion_5.py hosted with ❤ by GitHub

Masked Car

Can you guess why we’re seeing only certain parts? That’s right – only those parts of the input image that had a significant contribution to its output class probability are visible. That, in a nutshell, is what occlusion maps are all about.

Visualizing the Contribution of Input Features- Saliency Maps

Saliency maps are another visualization technique based on gradients. These maps were introduced in the paper – Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps.

Saliency maps calculate the effect of every pixel on the output of the model. This involves calculating the gradient of the output with respect to every pixel of the input image.

This tells us how to output category changes with respect to small changes in the input image pixels. All the positive values of gradients mean that small changes to the pixel value will increase the output value:

These gradients, which are of the same shape as the image (gradient is calculated with respect to every pixel), provide us with the intuition of attention.

Let’s see how to generate saliency maps for any image. First, we will read the input image using the below code segment.

Input Image

Now, we will generate the saliency map for the image using the VGG16 model:

	# Utility to search for layer index by name.
	# Alternatively we can specify this as -1 since it corresponds to the last layer.
	layer_idx = utils.find_layer_idx(model, 'predictions')

	# Swap softmax with linear
	model.layers[layer_idx].activation = activations.linear
	model = utils.apply_modifications(model)

	#generating saliency map with unguided backprop
	grads1 = visualize_saliency(model, layer_idx,filter_indices=None,seed_input=image)
	#plotting the unguided saliency map
	plt.imshow(grads1,cmap='jet')

view raw saliency_2.py hosted with ❤ by GitHub

We see that the model focuses more on the facial part of the dog. Now, let’s look at the results with guided backpropagation:

	#generating saliency map with guided backprop
	grads2 = visualize_saliency(model, layer_idx,filter_indices=None,seed_input=image,backprop_modifier='guided')
	#plotting the saliency map as heatmap
	plt.imshow(grads2,cmap='jet')

view raw saliency_3.py hosted with ❤ by GitHub

Guided backpropogation truncates all the negative gradients to 0, which means that only the pixels which have a positive influence on the class probability are updated.

Class Activation Maps (Gradient Weighted)

Class activation maps are also a neural network visualization technique based on the idea of weighing the activation maps according to their gradients or their contribution to the output.

The following excerpt from the Grad-CAM paper gives the gist of the technique:

Gradient-weighted Class Activation Mapping (Grad-CAM), uses the gradients of any target concept (say logits for ‘dog’ or even a caption), flowing into the final convolutional layer to produce a coarse localization map highlighting the important regions in the image for predicting the concept.

In essence, we take the feature map of the final convolutional layer and weigh (multiply) every filter with the gradient of the output with respect to the feature map. Grad-CAM involves the following steps:

Take the output feature map of the final convolutional layer. The shape of this feature map is 14x14x512 for VGG16
Calculate the gradient of the output with respect to the feature maps
Apply Global Average Pooling to the gradients
Multiply the feature map with corresponding pooled gradients

We can see the input image and its corresponding Class Activation Map below:

Now let’s generate the Class activation map for the above image.

Visualizing the Process – Layerwise Output Visualization

The starting layers of a CNN generally look for low-level features like edges. The features change as we go deeper into the model.

Visualizing the output at different layers of the model helps us see what features of the image are highlighted at the respective layer. This step is particularly important to fine-tune an architecture for our problems. Why? Because we can see which layers give what kind of features and then decide which layers we want to use in our model.

For example, visualizing layer outputs can help us compare the performance of different layers in the neural style transfer problem.

Let’s see how we can get the output at different layers of a VGG16 model:

	#importing required libraries and functions
	from keras.models import Model
	#defining names of layers from which we will take the output
	layer_names = ['block1_conv1','block2_conv1','block3_conv1','block4_conv2']
	outputs = []
	image = image.reshape((1, image.shape[0], image.shape[1], image.shape[2]))
	#extracting the output and appending to outputs
	for layer_name in layer_names:
	intermediate_layer_model = Model(inputs=model.input,outputs=model.get_layer(layer_name).output)
	intermediate_output = intermediate_layer_model.predict(image)
	outputs.append(intermediate_output)
	#plotting the outputs
	fig,ax = plt.subplots(nrows=4,ncols=5,figsize=(20,20))

	for i in range(4):
	for z in range(5):
	ax[i][z].imshow(outputs[i][0,:,:,z])
	ax[i][z].set_title(layer_names[i])
	ax[i][z].set_xticks([])
	ax[i][z].set_yticks([])
	plt.savefig('layerwise_output.jpg')

view raw layerwise_output.py hosted with ❤ by GitHub

The above image shows the different features that are extracted from the image by every layer of VGG16 (except block 5). We can see that the starting layers correspond to low-level features like edges, whereas the later layers look at features like the roof, exhaust, etc. of the car.

End Notes

Visualization never ceases to amaze me. There are multiple ways to understand how a technique works, but visualizing it makes it a whole lot more fun. Here are a couple of resources you should check out:

The process of feature extraction in neural networks is an active research area and has led to the development of awesome tools like Tensorspace and Activation Atlases
TensorSpace is also a neural network visualization tool that supports multiple model formats. It lets you load your model and visualize it interactively. TensorSpace also has a playground where multiple architectures are available for visualization which you can play around with

Let me know if you have any questions or feedback on this article. I’ll be happy to get into a discussion!

Saurabh

A Data Science enthusiast and Software Engineer by training, Saurabh aims to work at the intersection of both fields.

Free Courses

4.7

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

4.5

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

4.6

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

4.8

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

4.7

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

MUID

Used by Microsoft Clarity, to store and track visits across websites.

Expiry: 1 Year

Type: HTTP

_clck

Used by Microsoft Clarity, Persists the Clarity User ID and preferences, unique to that site, on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.

Expiry: 1 Year

Type: HTTP

_clsk

Used by Microsoft Clarity, Connects multiple page views by a user into a single Clarity session recording.

Expiry: 1 Day

Type: HTTP

SRM_I

Collects user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Years

Type: HTTP

SM

Use to measure the use of the website for internal analytics

Expiry: 1 Years

Type: HTTP

CLID

The cookie is set by embedded Microsoft Clarity scripts. The purpose of this cookie is for heatmap and session recording.

Expiry: 1 Year

Type: HTTP

SRM_B

Collected user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Months

Type: HTTP

_gid

This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected includes the number of visitors, the source where they have come from, and the pages visited in an anonymous form.

Expiry: 399 Days

Type: HTTP

_ga_#

Used by Google Analytics, to store and count pageviews.

Expiry: 399 Days

Type: HTTP

_gat_#

Used by Google Analytics to collect data on the number of times a user has visited the website as well as dates for the first and most recent visit.

Expiry: 1 Day

Type: HTTP

collect

Used to send data to Google Analytics about the visitor's device and behavior. Tracks the visitor across devices and marketing channels.

Expiry: Session

Type: PIXEL

AEC

cookies ensure that requests within a browsing session are made by the user, and not by other sites.

Expiry: 6 Months

Type: HTTP

G_ENABLED_IDPS

use the cookie when customers want to make a referral from their gmail contacts; it helps auth the gmail account.

Expiry: 2 Years

Type: HTTP

test_cookie

This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.

Expiry: 1 Year

Type: HTTP

_we_us

this is used to send push notification using webengage.

Expiry: 1 Year

Type: HTTP

WebKlipperAuth

used by webenage to track auth of webenagage.

Expiry: Session

Type: HTTP

ln_or

Linkedin sets this cookie to registers statistical data on users' behavior on the website for internal analytics.

Expiry: 1 Day

Type: HTTP

JSESSIONID

Use to maintain an anonymous user session by the server.

Expiry: 1 Year

Type: HTTP

li_rm

Used as part of the LinkedIn Remember Me feature and is set when a user clicks Remember Me on the device to make it easier for him or her to sign in to that device.

Expiry: 1 Year

Type: HTTP

AnalyticsSyncHistory

Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

lms_analytics

Used to store information about the time a sync with the AnalyticsSyncHistory cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

liap

Cookie used for Sign-in with Linkedin and/or to allow for the Linkedin follow feature.

Expiry: 6 Months

Type: HTTP

visit

allow for the Linkedin follow feature.

Expiry: 1 Year

Type: HTTP

li_at

often used to identify you, including your name, interests, and previous activity.

Expiry: 2 Months

Type: HTTP

s_plt

Tracks the time that the previous page took to load

Expiry: Session

Type: HTTP

lang

Used to remember a user's language setting to ensure LinkedIn.com displays in the language selected by the user in their settings

Expiry: Session

Type: HTTP

s_tp

Tracks percent of page viewed

Expiry: Session

Type: HTTP

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

Indicates the start of a session for Adobe Experience Cloud

Expiry: Session

Type: HTTP

s_pltp

Provides page name value (URL) for use by Adobe Analytics

Expiry: Session

Type: HTTP

s_tslv

Used to retain and fetch time since last visit in Adobe Analytics

Expiry: 6 Months

Type: HTTP

li_theme

Remembers a user's display preference/theme setting

Expiry: 6 Months

Type: HTTP

li_theme_set

Remembers which users have updated their display / theme preferences

Expiry: 6 Months

Type: HTTP

Jacque

This is lovely. Thank you for sharing.

Xu Zhang

A great article, thanks. Any ideas to visualize 3D convolutional neural networks?

Show 1 reply

saurabh pal

Hi Xu, thanks. You can check out the following Github repo. https://github.com/OlesiaMidiana/3dcnn-vis

DIBIA

Thank you for the exposure. What a wonderful piece of work!

Hi, Dibia. Thanks.

Reading list

Introduction to Computer Vision

Getting Started with Image Data

Introduction to CNN and Implementation

Introduction to CNN and implementation

Introduction to Transfer Learning

CNN Visualization

Overview of Pretrained Models

Inception

ResNets

DenseNets

CSRNet

Introduction to Object Detection

Region Based Convolutional Neural Network

Single Stage Networks

Transformed Based Object Detection Models

Face Detection

Object Tracking

Pose Estimation

Introduction to Image Segmentation

Understanding Deep Learning Architectures for Image Segmentation

Video Classification

Introduction to Image Generation

Experiments with Generative Adversarial Networks

Zero and Few Shot Learning

Model Deployment

A Guide to Understanding Convolutional Neural Networks (CNNs) using Visualization

Introduction

Table Of Contents

Why Should we use Visualization to Decode Neural Networks?

Setting up the Model Architecture

Accessing Individual Layers

Visualizing the Building Blocks of CNNs – Filters

Visualizing what a Model Expects – Activation Maximization

Visualizing what’s Important in the Input- Occlusion Maps

Visualizing the Contribution of Input Features- Saliency Maps

Class Activation Maps (Gradient Weighted)

Visualizing the Process – Layerwise Output Visualization

End Notes

Login to continue reading and enjoy expert-curated content.

Free Courses

Generative AI - A Way of Life

Getting Started with Large Language Models

Building LLM Applications using Prompt Engineering

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Microsoft Excel: Formulas & Functions

Recommended Articles

Responses From Readers

Write for us

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#

collect

AEC

G_ENABLED_IDPS

test_cookie

Webengage (2)

_we_us

WebKlipperAuth

LinkedIn (16)

ln_or

JSESSIONID

li_rm

AnalyticsSyncHistory