Solving Real World Problems using Reinforcement Learning

10 AUGUST 2024 | 09:30AM - 05:30PM | RENAISSANCE :- Race Course Rd, Madhava Nagar Extension

About the workshop

In this workshop, you'll journey through Reinforcement Learning (RL), starting with fundamental concepts and advancing to complex techniques, focusing on real-world applications. Time permitting, you'll also explore how Large Language Models (LLMs) can optimize RL reward functions in a human-centric manner. Whether you're a seasoned AI professional or just beginning, this workshop equips you with the skills and knowledge to tackle real-world challenges using RL.

Discover how cutting-edge technologies leverage Reinforcement Learning (RL) to achieve groundbreaking results! For instance, the revolutionary generative model ChatGPT utilizes RL techniques behind the scenes. The core principle driving ChatGPT is Reinforcement Learning from Human Feedback (RLHF), which aligns Large Language Models (LLMs) with human preferences. This demonstrates the immense potential of RL to solve real-world problems and transform industries. Join our workshop to harness the power of RL and become a part of the AI revolution!

Instructor

Mayank Baranwal

Senior Scientist, Data and Decision Sciences

Modules

Markov Decision Processes
Bellman Equation and Dynamic Programming
Value Iteration
Policy Iteration
Hands-on Experience: Jupyter notebook with a simple numpy-based tutorial on Value Iteration and Policy Iteration, including solutions

Expected Outcome: The emphasis will primarily be on understanding the necessary mathematical tools for RL. This will lay out the foundations for things to come in the next set of modules. We will see some basic examples of optimizing an organization's decision-making and eventually solving them using value and policy iteration algorithms.

Temporal Difference (TD) Learning and Monte Carlo (MC) Methods
RL Framework: OpenAI Gym Environment
Exploration vs Exploitation in RL
Actor-Only, Critic-Only, and Actor-Critic Algorithms
Q-Learning
SARSA
REINFORCE
Hands-on Experience: Jupyter Notebook tutorial on TD, MC, Q-Learning, SARSA, and REINFORCE, including solutions

Expected Outcome: This is when we will start looking at basic RL algorithms. We will better understand objects like state-action value functions (or the Q in Q-learning). Different types of RL algorithms will be covered and used to solve some toy problems.

Introduction to Linear Function Approximation
DQN and Variants
PPO and Variants
DDPG, TD3, SAC
Hands-on Experience: Jupyter Notebook tutorial with OpenAI Stable-Baselines3-based solutions

Expected Outcome: Most practical problems suffer from the curse of dimensionality, and that's when we will need to combine RL with Deep Learning (DL), popularly known as DRL. We will explore multiple state-of-the-art algorithms and their applications in Gym environments.

When to Use RL
RL for Real-Time Control of Power Networks
RL for Real-Time Inventory Management
Decision-Making in the Presence of Delays
Incorporating Delays in Inventory Management

Expected Outcome: This is when we will dive deeper into solving practical, real-world problems using RL.

Prompting Using LLMs
RL with Human Feedback (RLHF)
Direct Preference Optimization (DPO)
Reward Restructuring Using LLMs
Hands-on Experience: Jupyter Notebook tutorial on learning anthropomorphic policies in autonomous navigation

Expected Outcome: We will look to generate human-centric behavior in RL-learned policies using the power of LLMs.

Strong foundation in mathematics, particularly linear algebra, calculus, and probability theory
Proficiency in Python programming
Basic understanding of machine learning concepts
Familiarity with deep learning frameworks (e.g., PyTorch or TensorFlow)
Experience with data analysis and visualization libraries (e.g., NumPy, Pandas, Matplotlib)
Basic knowledge of optimization techniques
Familiarity with Jupyter Notebooks and version control systems (e.g., Git)

*Note: These are tentative details and are subject to change.

Stay informed about DHS 2025

Phone Number

Email Id

I Agree to the Terms & Conditions

Send WhatsApp Updates

Certificate of Participation

Receive a digital (blockchain-enabled) and physical certificate to showcase your accomplishment to the world

Earn your certificate
Share your achievement

Book Tickets

Solving Real World Problems using Reinforcement Learning

10 AUGUST 2024 | 09:30AM - 05:30PM | location RENAISSANCE :- Race Course Rd, Madhava Nagar Extension

About the workshop

Instructor

Mayank Baranwal

Modules

Module 1: Mathematical Foundations of Reinforcement Learning

Module 2: Introduction to Reinforcement Learning

Module 3: Reinforcement Learning with Function Approximation

Module 4: Practical Applications of Reinforcement Learning

Module 5: Human-Centric Reward Design Using LLMs (Time Permitting)

Module 6: Pre-requisites

Certificate of Participation

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#

collect

AEC

G_ENABLED_IDPS

test_cookie

Webengage (2)

_we_us

WebKlipperAuth

LinkedIn (16)

ln_or

JSESSIONID

li_rm

AnalyticsSyncHistory

lms_analytics

liap

visit

li_at

s_plt

lang

s_tp

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

s_pltp

s_tslv

li_theme

li_theme_set

Google (11)

_gcl_au

SID

SAPISID

__Secure-#

APISID

SSID

HSID

DV

NID

1P_JAR

OTZ

Facebook (2)

_fbp

fr

LinkedIn (6)

bscookie

lidc

bcookie

aam_uuid

UserMatchHistory

li_sugr

Microsoft (2)

MR

10 AUGUST 2024 | 09:30AM - 05:30PM | RENAISSANCE :- Race Course Rd, Madhava Nagar Extension