What are Joint, Marginal, and Conditional Probability?

Janvi Kumari Last Updated : 28 Dec, 2024
4 min read

Probability is a cornerstone of statistics and data science, providing a framework to quantify uncertainty and make predictions. Understanding joint, marginal, and conditional probability is critical for analyzing events in both independent and dependent scenarios. This article unpacks these concepts with clear explanations and examples.

What are Joint, Marginal, and Conditional Probability?

What is Probability?

Probability measures the likelihood of an event occurring, expressed as a value between 0 and 1:

  • 0: The event is impossible.
  • 1: The event is certain.

For example, flipping a fair coin has a probability of 0.5 for landing heads.

Joint Probability

Joint probability refers to the probability of two (or more) events occurring simultaneously. For events A and B, it is denoted as:

Joint Probability

Formula:

P(A∩B)=P(A∣B)⋅P(B)=P(B∣A)⋅P(A)

Example

Consider rolling a die and flipping a coin:

  • Event A: Rolling a 4 (probability = 1\6​)
  • Event B: Flipping a head (probability = 1\2​)

If the events are independent:

Joint Probability

Marginal Probability

Marginal probability is the probability of a single event occurring, regardless of other events. It is derived by summing over the joint probabilities involving that event.

For event A:

Marginal Probability

Example

Consider a dataset of students:

  • 60% are male (P(Male)=0.6).
  • 30% play basketball (P(Basketball)=0.3).
  • 20% are males who play basketball (P(Male∩Basketball)=0.2).

The marginal probability of being male:

P(Male)=0.6

Conditional Probability

Conditional probability measures the probability of one event occurring given that another event has already occurred. For events A and B, it is denoted as:

Conditional Probability

Example

From the student dataset:

  • P(Male∩Basketball)=0.2P
  • P(Basketball)=0.3

The probability that a student is male given they play basketball:

P(Male∣Basketball)=P(Male∩Basketball)/P(Basketball)=0.2/0.3=0.67

This means 67% of basketball players are male.

Relationships Between Joint, Marginal, and Conditional Probabilities

  1. Joint Probability and Marginal Probability
    • Joint probability considers multiple events together.
    • Marginal probability considers a single event, often summing over joint probabilities.
  2. Joint Probability and Conditional Probability
    • Joint probability can be expressed using conditional probability: 
      P(A∩B)=P(A∣B)⋅P(B)
  3. Marginal and Conditional Probability
    • Marginal probability can help calculate conditional probabilities and vice versa.

Python Implementation

Here’s a Python implementation of joint, marginal, and conditional probability using simple examples:

# Import necessary library
import numpy as np
import pandas as pd

# Example 1: Joint and Marginal Probabilities
# Simulating a dataset of students
data = {
    'Gender': ['Male', 'Male', 'Male', 'Female', 'Female', 'Female'],
    'Basketball': ['Yes', 'No', 'Yes', 'Yes', 'No', 'No']
}

# Create a DataFrame
df = pd.DataFrame(data)

# Frequency table (Joint Probability Table)
joint_prob_table = pd.crosstab(df['Gender'], df['Basketball'], normalize='all')
print("Joint Probability Table:")
print(joint_prob_table)

# Marginal probabilities
marginal_gender = joint_prob_table.sum(axis=1)
marginal_basketball = joint_prob_table.sum(axis=0)

print("\nMarginal Probability (Gender):")
print(marginal_gender)

print("\nMarginal Probability (Basketball):")
print(marginal_basketball)

# Example 2: Conditional Probability
# P(Male | Basketball = Yes)
joint_male_yes = joint_prob_table.loc['Male', 'Yes']  # P(Male and Basketball = Yes)
prob_yes = marginal_basketball['Yes']  # P(Basketball = Yes)

conditional_prob_male_given_yes = joint_male_yes / prob_yes
print(f"\nConditional Probability P(Male | Basketball = Yes): {conditional_prob_male_given_yes:.2f}")

# Example 3: Joint Probability for Independent Events
# Rolling a die and flipping a coin
P_roll_4 = 1/6  # Probability of rolling a 4
P_flip_heads = 1/2  # Probability of flipping heads

joint_prob_roll_and_heads = P_roll_4 * P_flip_heads
print(f"\nJoint Probability of Rolling a 4 and Flipping Heads: {joint_prob_roll_and_heads:.2f}")
Output

Applications in Real Life

  1. Medical Diagnosis
    • Joint Probability: The probability of having a disease and showing specific symptoms.
    • Marginal Probability: The overall probability of having the disease.
    • Conditional Probability: The probability of having the disease given the symptoms.
  2. Machine Learning
    • Used in Naive Bayes Classifiers, where conditional probabilities are calculated for classification tasks.
  3. Risk Analysis
    • Understanding dependencies between events, such as in financial markets or insurance.

Conclusion

Grasping joint, marginal, and conditional probabilities is crucial for solving real-world problems involving uncertainty and dependencies. These concepts form the foundation for advanced topics in statistics, machine learning, and decision-making under uncertainty. Mastery of these principles enables effective analysis and informed conclusions.

Frequently Asked Questions

Q1. What is joint probability?

Ans. Joint probability is the likelihood of two or more events occurring simultaneously. For example, in a dataset of students, the probability that a student is male and plays basketball is a joint probability.

Q2. How do you calculate joint probability?

Ans. For events A and B, joint probability is calculated as:
P(A∩B)=P(A∣B)⋅P(B)
If A and B are independent, then:
P(A∩B)=P(A)⋅P(B)

Q3. What is marginal probability?

Ans. Marginal probability is the probability of a single event occurring, regardless of other events. For example, the probability that a student plays basketball, irrespective of gender.

Q4. When should I use joint, marginal, and conditional probability?

Ans. Use joint probability when analyzing the likelihood of multiple events happening together.
Use marginal probability when focusing on a single event without considering others.
Use conditional probability when analyzing the likelihood of an event given the occurrence of another event.

Q5. What is the difference between joint and conditional probability?

Ans. Joint probability considers both events happening together (P(A∩B)).
Conditional probability considers the likelihood of one event happening given that another event has occurred (P(A∣B)).

Hi, I am Janvi, a passionate data science enthusiast currently working at Analytics Vidhya. My journey into the world of data began with a deep curiosity about how we can extract meaningful insights from complex datasets.

Responses From Readers

Clear

We use cookies essential for this site to function well. Please click to help us improve its usefulness with additional cookies. Learn about our use of cookies in our Privacy Policy & Cookies Policy.

Show details