Apple’s New MGIE Model Lets You Edit Images Through Descriptions

K.C. Sabreena Basheer Last Updated : 08 Feb, 2024
2 min read

Apple has unveiled an AI model named MGIE, which revolutionizes image editing by enabling users to make edits simply by describing them in natural language. Developed in collaboration with the University of California, Santa Barbara, MGIE promises to streamline the photo editing process, offering a seamless experience through text prompts.

Also Read: RPG: New Technique for Enhanced Text-to-Image Comprehension

The MGIE Model

Apple’s latest innovation, the Multimodal Large-Language Model-Guided Image Editing (MGIE), leverages advanced AI techniques to interpret user instructions and perform pixel-level manipulations. Unlike conventional editing software, MGIE operates solely through text prompts, eliminating the need for manual editing tools.

Apple's New MGIE Model Lets You Edit Images Through Descriptions

How MGIE Works

The underlying mechanism of MGIE involves the integration of Multimodal Large Language Models (MLLMs) into the image editing process. These models interpret user prompts and generate visual representations of the desired edits, which are then executed through pixel-level manipulation. This innovative approach enhances user interaction and improves the overall editing experience.

Also Read: How To Create 3D Images For Instagram Using Bing AI?

Functionality and Capabilities

MGIE offers a wide array of editing functionalities, ranging from simple color adjustments to complex object manipulations. Users can seamlessly crop, resize, rotate, flip, and apply filters to images, all through natural language commands. Additionally, MGIE excels in global photo optimization and local editing, ensuring precise adjustments tailored to user preferences.

Features and capabilities of Apple's new MGIE model

Open-Source Initiative and Industry Impact

Apple’s decision to release MGIE as an open-source project on GitHub marks a significant step towards democratizing AI-driven image editing. By sharing their advancements with the developer community, Apple aims to foster innovation and collaboration in the field of AI research. Moreover, MGIE’s release underscores Apple’s commitment to enhancing its AI capabilities and driving industry-wide innovation.

Also Read: Apple Secretly Launches Its First Open-Source LLM, Ferret

Our Say

MGIE represents a paradigm shift in image editing, offering a more intuitive and efficient approach to photo manipulation. With its seamless integration of natural language processing and image editing techniques, MGIE has the potential to revolutionize the way users interact with digital media. As Apple continues to push the boundaries of AI innovation, we can expect to see further advancements in creative tools and technologies that empower users to unleash their creativity effortlessly.

Follow us on Google News to stay updated with the latest innovations in the world of AI, Data Science, & GenAI.

Sabreena Basheer is an architect-turned-writer who's passionate about documenting anything that interests her. She's currently exploring the world of AI and Data Science as a Content Manager at Analytics Vidhya.

Responses From Readers

Congratulations, You Did It!
Well Done on Completing Your Learning Journey. Stay curious and keep exploring!

We use cookies essential for this site to function well. Please click to help us improve its usefulness with additional cookies. Learn about our use of cookies in our Privacy Policy & Cookies Policy.

Show details