Solving Real World Problems using Reinforcement Learning

10 AUGUST 2024 | 09:30AM - 05:30PM | location RENAISSANCE :- Race Course Rd, Madhava Nagar Extension

About the workshop

In this workshop, you'll journey through Reinforcement Learning (RL), starting with fundamental concepts and advancing to complex techniques, focusing on real-world applications. Time permitting, you'll also explore how Large Language Models (LLMs) can optimize RL reward functions in a human-centric manner. Whether you're a seasoned AI professional or just beginning, this workshop equips you with the skills and knowledge to tackle real-world challenges using RL. 

Discover how cutting-edge technologies leverage Reinforcement Learning (RL) to achieve groundbreaking results! For instance, the revolutionary generative model ChatGPT utilizes RL techniques behind the scenes. The core principle driving ChatGPT is Reinforcement Learning from Human Feedback (RLHF), which aligns Large Language Models (LLMs) with human preferences. This demonstrates the immense potential of RL to solve real-world problems and transform industries. Join our workshop to harness the power of RL and become a part of the AI revolution!

video thumbnail

Instructor

Modules

  • Markov Decision Processes
  • Bellman Equation and Dynamic Programming
  • Value Iteration
  • Policy Iteration
  • Hands-on Experience: Jupyter notebook with a simple numpy-based tutorial on Value Iteration and Policy Iteration, including solutions

Expected Outcome: The emphasis will primarily be on understanding the necessary mathematical tools for RL. This will lay out the foundations for things to come in the next set of modules. We will see some basic examples of optimizing an organization's decision-making and eventually solving them using value and policy iteration algorithms.

  • Temporal Difference (TD) Learning and Monte Carlo (MC) Methods
  • RL Framework: OpenAI Gym Environment
  • Exploration vs Exploitation in RL
  • Actor-Only, Critic-Only, and Actor-Critic Algorithms
  • Q-Learning
  • SARSA
  • REINFORCE
  • Hands-on Experience: Jupyter Notebook tutorial on TD, MC, Q-Learning, SARSA, and REINFORCE, including solutions

Expected Outcome: This is when we will start looking at basic RL algorithms. We will better understand objects like state-action value functions (or the Q in Q-learning). Different types of RL algorithms will be covered and used to solve some toy problems.

  • Introduction to Linear Function Approximation
  • DQN and Variants
  • PPO and Variants
  • DDPG, TD3, SAC
  • Hands-on Experience: Jupyter Notebook tutorial with OpenAI Stable-Baselines3-based solutions

Expected Outcome: Most practical problems suffer from the curse of dimensionality, and that's when we will need to combine RL with Deep Learning (DL), popularly known as DRL. We will explore multiple state-of-the-art algorithms and their applications in Gym environments.

  • When to Use RL
  • RL for Real-Time Control of Power Networks
  • RL for Real-Time Inventory Management
  • Decision-Making in the Presence of Delays
  • Incorporating Delays in Inventory Management

Expected Outcome: This is when we will dive deeper into solving practical, real-world problems using RL.

  • Prompting Using LLMs
  • RL with Human Feedback (RLHF)
  • Direct Preference Optimization (DPO)
  • Reward Restructuring Using LLMs
  • Hands-on Experience: Jupyter Notebook tutorial on learning anthropomorphic policies in autonomous navigation

Expected Outcome: We will look to generate human-centric behavior in RL-learned policies using the power of LLMs.

  • Strong foundation in mathematics, particularly linear algebra, calculus, and probability theory
  • Proficiency in Python programming
  • Basic understanding of machine learning concepts
  • Familiarity with deep learning frameworks (e.g., PyTorch or TensorFlow)
  • Experience with data analysis and visualization libraries (e.g., NumPy, Pandas, Matplotlib)
  • Basic knowledge of optimization techniques
  • Familiarity with Jupyter Notebooks and version control systems (e.g., Git)
*Note: These are tentative details and are subject to change.
Stay informed about DHS 2025

Certificate of Participation

Receive a digital (blockchain-enabled) and physical certificate to showcase your accomplishment to the world

  • Earn your certificate
  • Share your achievement
Book Tickets
Book Tickets

We use cookies essential for this site to function well. Please click to help us improve its usefulness with additional cookies. Learn about our use of cookies in our Privacy Policy & Cookies Policy.

Show details