Freelance Data Scientist Develops Gemma-Based Telugu Model, Nandi

K.C. Sabreena Basheer Last Updated : 23 Apr, 2024
2 min read

Artificial intelligence (AI) and AI-driven language models in regional languages are booming in India. The latest member to this evolving landscape is Nandi, a Telugu language model, finely crafted by freelance data scientist Bharadwaj Swarna. Nandi, with its roots intertwined with cultural symbolism and linguistic precision, represents a significant stride towards inclusivity and accessibility in AI. Let’s delve into the creation and implications of this new Indic language model.

Also Read: Microsoft and Sarvam AI Collaborate to Enhance AI Accessibility in India

Nandi, Indic language model by Bharadwaj Swarna

The Creation of Nandi AI

Bharadwaj Swarna, known for his expertise in AI and natural language processing, embarked on a journey fueled by a passion for democratizing access to information. Drawing from his cultural heritage, Swarna conceptualized Nandi with a vision to bridge language barriers, particularly for non-English speakers.

His commitment to linguistic diversity and inclusivity shines through as Nandi aims to facilitate seamless translation and comprehension for the Telugu-speaking community. Through meticulous fine-tuning and ongoing enhancements using Direct Preference Optimization (DPO), Nandi promises accurate and nuanced responses to Telugu language queries.

Also Read: Reliance Hanooman: India’s ChatGPT-Style AI Model

Inspiration, Training, and Architecture

Nandi is greatly inspired by the pioneering efforts of Ramsri Goutham Golla and Ravi Theja Desetty of Telugu LLM Labs. Built upon the foundation of Zephyr-7b-Gemma, it is trained on a robust dataset curated by Telugu LLM Labs. It reflects the collaborative spirit driving linguistic exploration, standing as a testament to the rising tide of Indic language models emanating from the developer community in India.

Exploring the Landscape of Linguistic Diversity

Along the same tide, Telugu LLM Labs has introduced Navarasa 2.0 after their revolutionary Telugu Llama model. This new Gemma 7B/2B instruction-tuned model is capable of processing content in 15 Indian languages along with English. Meanwhile, Swarna’s future endeavors include expanding datasets for DPO and refining the tokenizer. This shows his dedication to continual improvement and innovation in the Telugu language processing domain. As the AI landscape evolves, initiatives like these highlight a concerted push towards linguistic inclusivity.

Also Read: Explore Ola’s Krutrim, the AI that Supports10 Indian Languages

Indic language models | AI in India | Nandi AI

Our Say

The unveiling of Nandi and the continued efforts of passionate individuals like Bharadwaj Swarna mark a transformative juncture in the intersection of AI and linguistic diversity. With each model and initiative, the horizon of linguistic exploration expands, fostering a more inclusive and accessible digital landscape. As we celebrate the richness of language and cultural heritage, let us embrace the journey towards linguistic equity. Let us work towards a future where every regional voice finds resonance in the digital sphere.

Follow us on Google News to stay updated with the latest innovations in the world of AI, Data Science, & GenAI.

Sabreena Basheer is an architect-turned-writer who's passionate about documenting anything that interests her. She's currently exploring the world of AI and Data Science as a Content Manager at Analytics Vidhya.

Responses From Readers

Clear

We use cookies essential for this site to function well. Please click to help us improve its usefulness with additional cookies. Learn about our use of cookies in our Privacy Policy & Cookies Policy.

Show details