Gemini: Google's AI That Reads , Sees, Hears and Understands

About

Gemini is Google's most capable model. It is built from the ground up for multimodality, seamlessly combining and understanding text, code, images, audio, and video. We begin the session with a coding demo of Gemini's multimodal capabilities using Google Cloud Vertex AI. We also demonstrate building agents using Gemini. We demonstrate context-switching and tool-calling capabilities. We proceed with a demo of Gemini's ability to process large inputs. We conclude the session by building a Multimodal RAG application using Gemini. 

 

Key Takeaways:

  • Understanding of Multimodality
  • Hands-on guidance for using Gemini on Google Cloud Vertex AI
  • Deep dive into Gemini's multimodal capabilities
  • Practical exposure to building agents with Gemini
  • Building scalable solutions with Gemini Models

Speaker

video thumbnail
Book Tickets
Stay informed about DHS 2025

Download agenda

We use cookies essential for this site to function well. Please click to help us improve its usefulness with additional cookies. Learn about our use of cookies in our Privacy Policy & Cookies Policy.

Show details