Demystify Parallel Programming: Hands-on with CUDA for GenAI

About

This session offers an in-depth exploration of leveraging CUDA to optimize NVIDIA hardware, essential for the explosive growth of Generative AI applications. Generative AI models, which include image and text generation and code completion, rely heavily on accelerated hardware for efficient training and inference. While high-level frameworks like PyTorch and TensorFlow simplify the process, true optimization and control are unlocked through CUDA, NVIDIA’s low-level compiler interfaces directly with GPUs.

The session begins with thoroughly reviewing C programming fundamentals, ensuring a solid base. It then demystifies core CUDA concepts, including threads, blocks, grids, and memory hierarchies, teaching participants to think in parallel for efficient GPU utilization. The course is designed to be highly interactive, with practical sessions guiding learners through writing their kernels, the essential workhorses of CUDA programs. This hands-on approach provides participants with real-world experience in parallel programming, ensuring they understand and effectively utilize the power of parallel processing in Generative AI.

Key Takeaways:

Confidently navigate the world of CUDA programming.
Write your own CUDA kernels to leverage the power of GPUs.
Approach problems from a parallel programming perspective for optimal performance.
This session is your gateway to unlocking the true potential of Generative AI. Join us and take your skills to the next level!

Speaker

Anshuman Mishra

Machine Learning Engineer

Stay informed about DHS 2025

Phone Number

Email Id

I Agree to the Terms & Conditions

Send WhatsApp Updates

Demystify Parallel Programming: Hands-on with CUDA for GenAI

About

Key Takeaways:

Speaker

Anshuman Mishra

Download agenda

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#

collect

AEC

G_ENABLED_IDPS

test_cookie

Webengage (2)

_we_us

WebKlipperAuth

LinkedIn (16)

ln_or

JSESSIONID

li_rm

AnalyticsSyncHistory

lms_analytics

liap

visit

li_at

s_plt

lang

s_tp

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

s_pltp

s_tslv

li_theme

li_theme_set

Google (11)

_gcl_au

SID

SAPISID

__Secure-#

APISID

SSID

HSID

DV

NID

1P_JAR

OTZ

Facebook (2)

_fbp

fr

LinkedIn (6)

bscookie

lidc

bcookie

aam_uuid

UserMatchHistory

li_sugr

Microsoft (2)

MR

ANONCHK