Anshuman Mishra

Machine Learning Engineer

Anshuman Mishra is a talented Machine Learning Engineer and Google Developer Expert (GDE) in ML (GenAI) with a diverse background in software engineering and data science. Currently, he is contributing his expertise as a full-time Machine Learning Engineer at Flip AI, where he plays a pivotal role in developing cutting-edge models and software services. Despite being the youngest member of the team, Anshuman is valued for his contributions and enjoys equal respect and responsibilities.

This session offers an in-depth exploration of leveraging CUDA to optimize NVIDIA hardware, essential for the explosive growth of Generative AI applications. Generative AI models, which include image and text generation and code completion, rely heavily on accelerated hardware for efficient training and inference. While high-level frameworks like PyTorch and TensorFlow simplify the process, true optimization and control are unlocked through CUDA, NVIDIA’s low-level compiler interfaces directly with GPUs.

The session begins with thoroughly reviewing C programming fundamentals, ensuring a solid base. It then demystifies core CUDA concepts, including threads, blocks, grids, and memory hierarchies, teaching participants to think in parallel for efficient GPU utilization. The course is designed to be highly interactive, with practical sessions guiding learners through writing their kernels, the essential workhorses of CUDA programs. This hands-on approach provides participants with real-world experience in parallel programming, ensuring they understand and effectively utilize the power of parallel processing in Generative AI.

Managing and scaling ML workloads have never been a bigger challenge in the past. Data scientists are looking for collaboration, building, training, and re-iterating thousands of AI experiments. On the flip side ML engineers are looking for distributed training, artifact management, and automated deployment for high performance

View all speakers

Anshuman Mishra

GenAI Hack Sessions 10:25AM - 11:20AM Demystify Parallel Programming: Hands-on with CUDA for GenAI Anshuman Mishra Machine Learning Engineer

Keynote 10:00 - 11.30AM Generative AI and I – Understanding what the new iPhone moment means to us Arnav Garg Data scientist at Fractal Arnav Garg Data scientist at Fractal

Powertalk 10:00 - 11.30AM • AUDI 1 Generative AI and I – Understanding what the new iPhone moment means to us Arnav Garg Data scientist at Fractal

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#

collect

AEC

G_ENABLED_IDPS

test_cookie

Webengage (2)

_we_us

WebKlipperAuth

LinkedIn (16)

ln_or

JSESSIONID

li_rm

AnalyticsSyncHistory

lms_analytics

liap

visit

li_at

s_plt

lang

s_tp

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

s_pltp

s_tslv

li_theme

li_theme_set

Google (11)

_gcl_au

SID

SAPISID

__Secure-#

APISID

SSID

HSID

DV

NID

1P_JAR

OTZ

Facebook (2)

_fbp

fr

LinkedIn (6)

bscookie

lidc

bcookie

aam_uuid

UserMatchHistory

li_sugr

Microsoft (2)

MR

ANONCHK