Data Structures in Python

Aniruddha Bhandari Last Updated : 03 Apr, 2025

13 min read

Data structures in Python are fundamental constructs used to organize, store, and manage data efficiently. Python offers built-in data structures like lists, tuples, sets, and dictionaries, each serving unique purposes. Lists are ordered and mutable, tuples are ordered and immutable, sets are unordered collections of unique elements, and dictionaries store key-value pairs. These structures enable efficient data manipulation, retrieval, and storage, making Python a powerful tool for programming and problem-solving. Understanding data structures is essential for writing optimized and scalable code. In this article, you will get to know all about data structures in Python.

What are Data Structures?
Data Structure #1: Lists in Python
Data Structure #2: Tuples in Python
Data Structure #3: Dictionary in Python
Data Structure #4: Sets in Python
User-Defined Data Structures
Conclusion
Frequently Asked Questions

What are Data Structures?

Data structures are a way of storing and organizing data efficiently. This will allow you to access and perform operations on the data easily.

There is no one-size-fits-all kind of model when it comes to data structures. You will want to store data in different ways to cater to the needs of the hour. Maybe you want to store all types of data together, or you want something for faster searching of data, or maybe something that stores only distinct data items.

Luckily, Python has a host of in-built data structures that help us to organize our data easily. Therefore, it becomes imperative to get acquainted with these first so that when dealing with data, we know exactly which data structure will solve our purpose effectively.

Data Structure #1: Lists in Python

Lists in Python are the most versatile data structure. They are used to store heterogeneous data items, from integers to strings or even another list! They are also mutable, meaning their elements can be changed even after creating the list.

Creating Lists

Lists are created by enclosing elements within [square] brackets, and each item is separated by a comma:

Python Code:

# Creating a list
alist = ['science', 'math', 'english']
print(type(alist))
print(alist)

Since each element in a list has its own distinct position, having duplicate values in a list is not a problem:

Checkout this article about the all about the Python!

Accessing List Elements

To access elements of a list, we use Indexing. Each element in a list has an index related to it depending on its position in the list. The first element of the list has the index 0, the next element has index 1, and so on. The last element of the list has an index of one less than the length of the list.

Indexing in Python lists | data structures in python

But indexes don’t always have to be positive; they can be negative too. What do you think negative indexes indicate?

While positive indexes return elements from the start of the list, negative indexes return values from the end of the list. This saves us from the trivial calculation we would have to perform otherwise if we wanted to return the nth element from the end of the list. So instead of trying to return List_name[len(List_name)-1] element, we can simply write List_name[-1].

Using negative indexes, we can return the nth element from the end of the list easily. If we want to return the first element from the end of the last index, the associated index is -1. Similarly, the index for the second last element will be -2, and so on. Remember, the 0th index will still refer to the very first element in the list.

But what if we wanted to return a range of elements between two positions in the lists? This is called Slicing. All we have to do is specify the start and end index within which we want to return all the elements – List_name[start : end].

Python lists indexing | data structures in python

One important thing to remember is that the element at the end index is never included. Only elements from the start index to the index equaling end-1 will be returned.

Appending Values in Lists

We can add new elements to an existing list using the append() or insert() methods:

append() – Adds an element to the end of the list
insert() – Adds an element to a specific position in the list that needs to be specified along with the value

Adding elements to Lists | data structures in python

Removing Elements from Lists

Removing elements from a list is as easy as adding them and can be done using the remove() or pop() methods:

remove() – Removes the first occurrence from the list that matches the given value
pop() – This is used when we want to remove an element at a specified index from the list. However, if we don’t provide an index value, the last element will be removed from the list

Removing elements from Lists | data structures in python

Sorting Lists

Most of the time, you will use a list to sort elements. So, it is essential to know about the sort() method. It lets you sort list elements in place in either ascending or descending order:

But where things get a bit tricky is when you want to sort a list containing string elements. How do you compare two strings? Well, string values are sorted using ASCII values of the characters in the string. Each character in the string has an integer value associated with it. We use these values to sort the strings.

On comparing two strings, we compare the integer values of each character from the beginning. If we encounter the same characters in both strings, we compare the next character until we find two differing characters. It is, of course, done internally, so you don’t have to worry about it!

Sorting Python Lists | data structures in python

Concatenating Lists

We can even concatenate two or more lists using the + symbol. This will return a new list containing elements from both the lists:

List Comprehensions

A very interesting application of Lists is List Comprehension, which provides a neat way of creating new lists. These new lists are created by applying an operation on each element of an existing list. It will be easy to see their impact if we first check out how it can be done using the good old for-loops:

Now, we will see how we can concisely perform this operation using list comprehensions:

See the difference? List comprehensions are a useful asset for any data scientist because you have to write concise and readable code on a daily basis!

Stacks & Queues using Lists

A list is an in-built data structure in Python. But we can use it to create user-defined data structures. Two very popular user-defined data structures built using lists are Stacks and Queues.

Stacks are a list of elements in which elements are added or deleted from the end of the list. Think of it as a stack of books. You do it from the top whenever you need to add or remove a book from the stack. It uses the simple concept of Last-In-First-Out.

Queues, on the other hand, are a list of elements in which elements are added at the end of the list, but the deletion of elements takes place from the front of the list. You can think of it as a queue in the real world. The queue becomes shorter when people from the front exit the queue. The queue becomes longer when someone new adds to the queue from the end. It uses the concept of First-In-First-Out.

Now, as a data scientist or an analyst, you might not be employing this concept every day, but knowing it will surely help you when you have to build your own algorithm!

Data Structure #2: Tuples in Python

Tuples are another very popular in-built data structure in Python. These are quite similar to Lists except for one difference – they are immutable. This means that no value can be added, deleted, or edited once a tuple is generated.

We will explore this further, but let’s first see how to create a Python Tuple!

Creating Tuples in Python

Tuples can be generated by writing values within (parentheses), and each element is separated by a comma. But even if you write many values without any parenthesis and assign them to a variable, you will still have a tuple! Have a look for yourself:

Python Tuple | data structures in python

Now that we know how to create tuples let’s talk about immutability.

Immutability of Tuples

Anything that cannot be modified after creation is immutable in Python. Python language can be broken down into mutable and immutable objects.

Lists, dictionaries, and sets (we will explore these in further sections) are mutable objects, meaning they can be modified after creation. On the other hand, integers, floating values, boolean values, strings, and even tuples are immutable objects. But what makes them immutable?

Everything in Python is an object. So, we can use the in-built id() method, which allows us to check an object’s memory location. This is known as the identity of the object. Let’s create a list and determine the location of the list and its elements:

As you can see, both the list and its element have different locations in memory. Since we know lists are mutable, we can alter the value of its elements. Let’s do that and see how it affects the location values:

The location of the list did not change, but that of the element did. This means a new object was created for the element and saved in the list. This is what is meant by mutable. A mutable object can change its state or contents after creation, but an immutable object cannot.

But we can call tuples pseudo-immutable because even though they are immutable, they can contain mutable objects whose values can be modified!

Tuple immutability | data structures in python

As you can see from the example above, we could change the values of an immutable object, list, contained within a tuple.

Tuple Assignment

Tuple packing and unpacking are useful operations you can perform to assign values to a tuple of elements from another tuple in a single line.

We already saw tuple packing when we made our planet tuple. Tuple unpacking is just the opposite-assigning values to variables from a tuple:

It is handy for swapping values in a single line. Honestly, this was one of the first things that got me excited about Python: being able to do so much with such little coding!

Changing Tuple Values

Although I said that tuple values cannot be changed, you can actually make changes to it by converting it to a list using list(). When you are done making the changes, you can again convert it back to a tuple using tuple().

This change, however, is expensive as it involves making a copy of the tuple. But tuples come in handy when you don’t want others to change the content of the data structure.

Data Structure #3: Dictionary in Python

A dictionary is another Python data structure to store heterogeneous objects that are immutable but unordered. This means that when you try to access the elements, they might not be in exactly the same order as the one in which you inserted them.

But what sets dictionaries apart from lists is how elements are stored. Elements in a dictionary are accessed via their key values instead of their index, as we did in a list. So, dictionaries contain key-value pairs instead of just single elements.

Generating Dictionary

Dictionaries are generated by writing keys and values within a { curly } bracket separated by a semi-colon. Each key-value pair is separated by a comma:

Python Dictionary | data structures in python

Using the key of the item, we can easily extract the associated value of the item:

These keys are unique. But even if you have a dictionary with multiple items with the same key, the item value will be the one associated with the last key:

Dictionaries are handy to access items quickly because, unlike lists and tuples, a dictionary does not have to iterate over all the items to find a value. Dictionary uses the item key to find the item value quickly. This concept is called hashing.

Accessing Keys and Values

You can access the keys from a dictionary using the keys() method and the values using the values() method. These we can view using a for-loop or turn them into a list using list():

Dictionary accessing values | data structures in python

We can even access these values simultaneously using the items() method, which returns the respective key and value pair for each element of the dictionary.

Data Structure #4: Sets in Python

Sometimes, you don’t want multiple occurrences of the same element in your list or tuple. It is here that you can use a set data structure. A Set is an unordered but mutable collection of elements that contains only unique values.

You will see that the values are not in the same order as entered in the set. This is because sets are unordered.

Add and Remove Elements from a Set

To add values to a set, use the add() method. It lets you add any value except mutable objects:

Set add elements | data structures in python

To remove values from a set, you have two options to choose from:

The first is the remove() method, which gives an error if the element is not present in the Set
The second is the discard() method, which removes elements but gives no error when the element is not present in the Set

If the value does not exist, remove() will give an error, but discard() won’t.

Set Operations

Using Python Sets, you can perform operations like union, intersection, and difference between two sets, just like you would in mathematics.

The Union of two sets gives values from both sets. But the values are unique. So if both the sets contain the same value, only one copy will be returned:

The Intersection of two sets returns only those values that are common to both sets:

The Difference of a set and another gives only those values that are not present in the first set:

User-Defined Data Structures

User-defined data structures refer to data structures that are created by the programmer based on their specific requirements and needs. These data structures are not built-in to the programming language but are designed and implemented by the programmer to store and organize data in a way that suits their application. User-defined data structures allow programmers to tailor the data storage and manipulation to match the problem they are trying to solve. Let’s look at the different types of user-defined data structures in Python.

Arrays

Arrays are a fundamental data structure that stores elements of the same data type in contiguous memory locations. They have a fixed size and provide constant-time access to elements.

Sample Code:

# Creating an array in Python
numbers = [10, 20, 30, 40, 50]

# Accessing elements of an array
print(numbers[2]) # Output: 30

# Modifying an element
numbers[1] = 25
print(numbers) # Output: [10, 25, 30, 40, 50]

Lists

Lists, also known as dynamic arrays, are similar to arrays but can grow or shrink in size dynamically. They’re implemented using arrays and provide more flexibility.

Sample Code:

# Creating a list in Python
names = ["Alice", "Bob", "Charlie"]

# Adding an element to the end of the list
names.append("David")
print(names) # Output: ["Alice", "Bob", "Charlie", "David"]

# Removing an element from the list
names.remove("Bob")
print(names) # Output: ["Alice", "Charlie", "David"]

Stack

A stack is a linear data structure that follows the Last In First Out (LIFO) principle. Elements are added and removed from the top of the stack.

Sample Code:

# Implementing a stack using Python's list
stack = []

# Pushing elements onto the stack
stack.append(10)
stack.append(20)
stack.append(30)

# Popping elements from the stack
print(stack.pop()) # Output: 30
print(stack.pop()) # Output: 20

Queue

A queue is a linear data structure that follows the First In First Out (FIFO) principle. Elements are added at the rear and removed from the front.

Sample Code:

# Implementing a queue using Python's collections module
from collections import deque

queue = deque()

# Enqueue elements
queue.append(5)
queue.append(10)
queue.append(15)

# Dequeue elements
print(queue.popleft()) # Output: 5
print(queue.popleft()) # Output: 10

Trees

A tree is a hierarchical data structure consisting of nodes connected by edges. Each node has a parent (except the root) and zero or more children.

Sample Code:

# Defining a simple binary tree node
class TreeNode:
def __init__(self, value):
self.value = value
self.left = None
self.right = None

# Creating a binary tree
root = TreeNode(10)
root.left = TreeNode(5)
root.right = TreeNode(15)

Linked Lists

A linked list is a linear data structure where each element (node) points to the next element. They are more memory-efficient than arrays and allow dynamic resizing.

Sample Code:

# Defining a linked list node
class ListNode:
def __init__(self, value):
self.value = value
self.next = None

# Creating a linked list
head = ListNode(10)
head.next = ListNode(20)
head.next.next = ListNode(30)

Graphs

A graph is a collection of nodes (vertices) connected by edges. Graphs can be directed (edges have a direction) or undirected.

Sample Code:

# Using Python's NetworkX library to create a simple undirected graph
import networkx as nx
import matplotlib.pyplot as plt

G = nx.Graph()
G.add_nodes_from([1, 2, 3])
G.add_edges_from([(1, 2), (2, 3)])

nx.draw(G, with_labels=True, font_weight='bold')
plt.show()

HashMaps (Dictionaries)

A hashmap (or dictionary) is a data structure that stores key-value pairs. It provides fast access to values using keys.

Sample Code:

# Creating a dictionary in Python
phonebook = {
"Alice": "123-456-7890",
"Bob": "987-654-3210",
"Charlie": "555-123-4567"
}

# Accessing values using keys
print(phonebook["Alice"]) # Output: 123-456-7890

Conclusion

Isn’t Python a beautiful language? It provides you with many different options to handle your data more efficiently. Learning about data structures in Python is a key aspect of your own learning journey. This article should serve as a good introduction to the in-built data structures in Python. If it got you interested in Python, and you are itching to know more about it in detail and how to use it in your everyday data science or analytics work, I recommend going through the following articles and courses:

Frequently Asked Questions

Q1.What are the data structures in Python?

Ans. Data structures in Python are ways to organize and store data. Common ones include lists, tuples, dictionaries, sets, and more advanced ones like stacks, queues, and linked lists.

Q2. What are the 4 types of data structure?

Ans. The 4 main types of data structures are:
Linear: Data is arranged in a sequence (e.g., lists, arrays, stacks, queues).
Non-linear: Data is not in a sequence (e.g., trees, graphs).
Homogeneous: All elements are of the same type (e.g., arrays).
Heterogeneous: Elements can be of different types (e.g., lists, dictionaries).

Q3. Can I learn DSA in 1 month?

Ans. Python has 4 built-in data types: Integer (int), Float (float), String (str), and Boolean (bool)

Q4. What are the structural types in Python?

Ans. The 2 main types of data structures are primitive data structures and non-primitive or composite data structures.

Aniruddha Bhandari

I am on a journey to becoming a data scientist. I love to unravel trends in data, visualize it and predict the future with ML algorithms! But the most satisfying part of this journey is sharing my learnings, from the challenges that I face, with the community to make the world a better place!

Free Courses

4.7

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

4.5

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

4.6

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

4.8

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

4.7

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

MUID

Used by Microsoft Clarity, to store and track visits across websites.

Expiry: 1 Year

Type: HTTP

_clck

Used by Microsoft Clarity, Persists the Clarity User ID and preferences, unique to that site, on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.

Expiry: 1 Year

Type: HTTP

_clsk

Used by Microsoft Clarity, Connects multiple page views by a user into a single Clarity session recording.

Expiry: 1 Day

Type: HTTP

SRM_I

Collects user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Years

Type: HTTP

SM

Use to measure the use of the website for internal analytics

Expiry: 1 Years

Type: HTTP

CLID

The cookie is set by embedded Microsoft Clarity scripts. The purpose of this cookie is for heatmap and session recording.

Expiry: 1 Year

Type: HTTP

SRM_B

Collected user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Months

Type: HTTP

_gid

This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected includes the number of visitors, the source where they have come from, and the pages visited in an anonymous form.

Expiry: 399 Days

Type: HTTP

_ga_#

Used by Google Analytics, to store and count pageviews.

Expiry: 399 Days

Type: HTTP

_gat_#

Used by Google Analytics to collect data on the number of times a user has visited the website as well as dates for the first and most recent visit.

Expiry: 1 Day

Type: HTTP

collect

Used to send data to Google Analytics about the visitor's device and behavior. Tracks the visitor across devices and marketing channels.

Expiry: Session

Type: PIXEL

AEC

cookies ensure that requests within a browsing session are made by the user, and not by other sites.

Expiry: 6 Months

Type: HTTP

G_ENABLED_IDPS

use the cookie when customers want to make a referral from their gmail contacts; it helps auth the gmail account.

Expiry: 2 Years

Type: HTTP

test_cookie

This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.

Expiry: 1 Year

Type: HTTP

_we_us

this is used to send push notification using webengage.

Expiry: 1 Year

Type: HTTP

WebKlipperAuth

used by webenage to track auth of webenagage.

Expiry: Session

Type: HTTP

ln_or

Linkedin sets this cookie to registers statistical data on users' behavior on the website for internal analytics.

Expiry: 1 Day

Type: HTTP

JSESSIONID

Use to maintain an anonymous user session by the server.

Expiry: 1 Year

Type: HTTP

li_rm

Used as part of the LinkedIn Remember Me feature and is set when a user clicks Remember Me on the device to make it easier for him or her to sign in to that device.

Expiry: 1 Year

Type: HTTP

AnalyticsSyncHistory

Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

lms_analytics

Used to store information about the time a sync with the AnalyticsSyncHistory cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

liap

Cookie used for Sign-in with Linkedin and/or to allow for the Linkedin follow feature.

Expiry: 6 Months

Type: HTTP

visit

allow for the Linkedin follow feature.

Expiry: 1 Year

Type: HTTP

li_at

often used to identify you, including your name, interests, and previous activity.

Expiry: 2 Months

Type: HTTP

s_plt

Tracks the time that the previous page took to load

Expiry: Session

Type: HTTP

lang

Used to remember a user's language setting to ensure LinkedIn.com displays in the language selected by the user in their settings

Expiry: Session

Type: HTTP

s_tp

Tracks percent of page viewed

Expiry: Session

Type: HTTP

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

Indicates the start of a session for Adobe Experience Cloud

Expiry: Session

Type: HTTP

s_pltp

Provides page name value (URL) for use by Adobe Analytics

Expiry: Session

Type: HTTP

s_tslv

Used to retain and fetch time since last visit in Adobe Analytics

Expiry: 6 Months

Type: HTTP

li_theme

Remembers a user's display preference/theme setting

Expiry: 6 Months

Type: HTTP

li_theme_set

Remembers which users have updated their display / theme preferences

Expiry: 6 Months

Type: HTTP

Enrique Pérez

Hi Aniruddha, Excellent Article. THANKS A LOT!

Show 1 reply

Glad you liked it.

Mohammed sani

Thanks a lot for letting us know

B.Nikhil raj

Nice artical Mr.ANIRUDDHA BHANDARI .One small suggestion is you provide more information about data types elaborating more of each datatype

Thanks for the suggestion.

Reading list

Intoduction to Python

Variables and data types

OOPs Concepts

Conditional statement

Looping Constructs

Data Structures

String Manipulation

Functions

Modules, Packages and Standard Libraries

Python Libraries for Data Science

Reading Data Files in Python

Preprocessing, Subsetting and Modifying Pandas Dataframes

Sorting and Aggregating Data in Pandas

Visualizing Patterns and Trends in Data

Programming

Data Structures in Python

Table of contents

What are Data Structures?

Data Structure #1: Lists in Python

Creating Lists

Accessing List Elements

Appending Values in Lists

Removing Elements from Lists

Sorting Lists

Concatenating Lists

List Comprehensions

Stacks & Queues using Lists

Data Structure #2: Tuples in Python

Creating Tuples in Python

Immutability of Tuples

Tuple Assignment

Changing Tuple Values

Data Structure #3: Dictionary in Python

Generating Dictionary

Accessing Keys and Values

Data Structure #4: Sets in Python

Add and Remove Elements from a Set

Set Operations

User-Defined Data Structures

Arrays

Lists

Stack

Queue

Trees

Linked Lists

Graphs

HashMaps (Dictionaries)

Conclusion

Frequently Asked Questions

Login to continue reading and enjoy expert-curated content.

Free Courses

Generative AI - A Way of Life

Getting Started with Large Language Models

Building LLM Applications using Prompt Engineering

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Microsoft Excel: Formulas & Functions

Recommended Articles

Responses From Readers

Write for us

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#

collect