Multimodal Agentic Framework to Create Real Estate Brochures

Nibedita Dutta Last Updated : 24 Jan, 2025

10 min read

Multimodal agentic frameworks represent a cutting-edge approach in artificial intelligence, integrating various data types—such as text, images, audio, and video—to enhance the capabilities of intelligent systems. These frameworks utilize intelligent agents that can autonomously process and analyze diverse information sources, enabling more nuanced understanding and decision-making. By combining multimodality with agentic functionalities, these systems can adapt in real time to dynamic environments and user interactions. This integration not only improves operational efficiency across industries but also enriches human-computer interactions, making them more intuitive and context-aware. As such, multimodal agentic frameworks are poised to transform how we engage with technology in numerous applications.

Learning Objectives

Understanding Agentic AI with Image Generation
Exploring Camel AI Functionalities
Developing a Multimodal Agentic System with CAMEL AI
Benefits to Real Estate Businesses

This article was published as a part of the Data Science Blogathon.

MultiModal Agentic AI: Agents with Image Generation
What is Camel AI?
Toolkits in Camel AI
Hands-On Implementation of a Multi-Modal Agentic System
Outputs
Conclusion
Frequently Asked Questions

MultiModal Agentic AI: Agents with Image Generation

Agentic AI represents a significant evolution in artificial intelligence, characterized by its autonomy and advanced decision-making capabilities. Integrating Agentic Frameworks with Image Generation capabilities can give significant advantages as mentioned below –

New Feature

Get Personalized Learning Path! Set your goal and timeline. Get a path—under 2 mins.

Enhanced Creativity: These systems can assist in creative processes by generating unique visual content, enabling artists, designers, and marketers to explore new ideas and concepts efficiently.
Personalization: By generating tailored images based on user preferences or data inputs, agentic systems can create personalized experiences in marketing, advertising, and entertainment.
Rapid Prototyping: Agentic systems can quickly produce visual prototypes for products or concepts, facilitating faster iterations and feedback during the design process.
Data Visualization: They can transform complex data sets into intuitive visual representations, aiding in better understanding and communication of information across various fields such as business analytics and scientific research.
Accessibility: These systems can democratize access to high-quality visual content, allowing individuals and organizations without extensive design resources to create professional-grade images.
Automation of Repetitive Tasks: By automating the image generation process, agentic systems reduce the time and resources spent on routine design tasks, allowing human creators to focus on more strategic initiatives.

What is Camel AI?

Camel AI (short for Communicative Agents for Mind Exploration of Large-Scale Language Model Society) is an innovative framework dedicated to the development and research of autonomous, communicative agents. Its primary goal is to examine how AI systems interact and collaborate, reducing the need for human involvement in various tasks. Focusing on the analysis of behaviors, abilities, and potential risks within multi-agent systems, Camel AI is an open-source project designed to foster collaboration and drive innovation within the AI research community.

Core Modules in Camel AI

The CAMEL framework is designed for the creation and management of multi-agent systems, incorporating several key components. It includes Models for defining agent intelligence, Messages for communication, and Memory systems for data storage and retrieval. The framework also integrates Tools for specialized tasks, Prompts to guide agent behavior, and Tasks to manage workflows. The Workforce module enables the formation of agent teams for collaboration, while the Society module facilitates interaction among agents. Together, these components enable the development of dynamic, collaborative multi-agent environments.

Toolkits in Camel AI

One of the greatest pros of using Camel AI is its integration with a diverse set of toolkits which can be seamlessly leveraged in creating multi-agentic systems. Camel AI includes several toolkits that enhance the capabilities of its multi-agent framework. Key toolkits include:

Function Tool: This toolkit allows agents to call functions and interact with various APIs, facilitating complex task execution and integration with external services.
Reddit Toolkit: This toolkit enables agents to interact with the Reddit API, allowing them to collect top posts, perform sentiment analysis on comments, and monitor discussions across subreddits.
Retrieval Toolkit: Designed for information retrieval, this toolkit allows agents to query local vector storage systems, retrieving relevant information based on user queries.
Media Tools: This includes functionalities for processing images and audio, enabling agents to handle multimedia content effectively.
Document Tools: This toolkit provides capabilities for processing documents in various formats (e.g., PDF, Word) and includes web scraping features.
Web Tools: These tools enable agents to access and interact with web services, such as search engines and APIs like DuckDuckGo and Wikipedia.
DALL-E Integration: Camel AI also supports integration with image generation models like DALL-E, allowing agents to create images based on textual descriptions, enhancing their creative capabilities.
Search Toolkits. A toolkit for performing web searches using various search engines like Google, DuckDuckGo, Wikipedia, and Wolfram Alpha.

These toolkits collectively empower Camel AI to perform a wide range of tasks, from data retrieval and processing to multimedia handling and creative image generation.

DALL-E

DALL-E is a series of advanced text-to-image models developed by OpenAI that generate digital images based on natural language descriptions, known as prompts. The initial version was released in January 2021, followed by DALL-E 2 in 2022, and the latest iteration, DALL-E 3, was integrated into ChatGPT and made available in late 2023.

DALL-E can create images in various styles, including photorealistic images and artistic renditions. It can manipulate and rearrange objects within images and infer details not explicitly mentioned in prompts.

In the following hands-on tutorial, we create a multi-modal agentic system using CAMEL AI for designing brochures for upcoming real estate projects in a city. This could help real estate businesses immensely as this aids in the automated creation of the brochures needed for giving out to clients when any of their new projects come up in a city without minimal human intervention.

Step 1. Installation of Necessary Libraries

!pip install 'camel-ai[all]'

Step 2. Defining Open AI API Keys

import os
os.environ['OPENAI_API_KEY'] = ''

Step 3. Importing Necessary Libraries

from camel.agents.chat_agent import ChatAgent
from camel.messages.base import BaseMessage
from camel.models import ModelFactory
from camel.societies.workforce import Workforce
from camel.tasks.task import Task
from camel.toolkits import (
    FunctionTool,
    GoogleMapsToolkit,
    SearchToolkit,
)
from camel.toolkits import DalleToolkit

from camel.types import ModelPlatformType, ModelType

import nest_asyncio
nest_asyncio.apply()

Step 4. Defining the Agents

search_toolkit = SearchToolkit()
search_tools = [
        FunctionTool(search_toolkit.search_duckduckgo)]
    


#Define the Model for the Agent as well. Default model is "gpt-4o-mini" and model platform type is OpenAI
guide_agent_model = ModelFactory.create(
        model_platform=ModelPlatformType.DEFAULT,
        model_type=ModelType.DEFAULT,
    )  

#Defining the Real Estate Agent for crafting the brochures
real_estate_agent = ChatAgent(
        BaseMessage.make_assistant_message(
            role_name="Real Estate Specialist",
            content="You are a Real Estate Specialist who is an expert in creating Description of Upcoming Residential Projects",
        ),
        model=guide_agent_model,
    )

#Defining the Agent for Real Estate Property Names
property_title_agent = ChatAgent(
        BaseMessage.make_assistant_message(
            role_name="Real Estate Project Name Specialist",
            content="You are a Real Estate Project Name Specialist who is an expert in Generating Trendy Names FoR Residental Projects in india",
        ),
        model=guide_agent_model,
    )


#Defining the agent for generating all the amenities near a location
location_benefits_agent = ChatAgent(
        BaseMessage.make_assistant_message(
            role_name="Real Estate Location Specialist",
            content="You are a Real Estate Location Specialist who is an expert in Generating All the amenities like malls, airports, markets, metro stations, railway stations etc with distances from a location of the mentioned property",
        ),
        model=guide_agent_model, tools =search_tools
    )


#Define the web search tool for the Agent using Tavily (we need to define the Tavily API Key beforehand)
dalletool = DalleToolkit()
imagegen_tools = [
    FunctionTool(dalletool.get_dalle_img),
    
]

#Define the Image Generation Agent with the pre-defined model and tools and Prompt
image_generation_agent = ChatAgent(
        system_message=BaseMessage.make_assistant_message(
            role_name="Image Generation Specialist",
            content="You can Generate Images For Upcoming Real Estate Projects For Showing to Clients",
        ),
        model=guide_agent_model,
        tools=imagegen_tools,
    )

This code snippet defines several agents using a model factory and a chat agent framework.

Model Creation: It first creates a default model (guide_agent_model) for the agents, specifically using the “GPT-4o-mini” model from OpenAI.
Real Estate Agents: Two agents are instantiated: one as a “Real Estate Specialist” focused on creating descriptions for upcoming residential projects, and another as a “Real Estate Project Name Specialist” tasked with generating trendy names for residential projects in India.
Real Estate Location Specialist : This agent is for generating all the amenities like malls, airports, markets, metro stations, railway stations etc with distances from a location of the mentioned property
Image Generation Tool: An image generation tool (dalletool) which allows the agents to generate images related to real estate projects.
Image Generation Agent: Finally, an “Image Generation Specialist” agent is created, equipped with the previously defined model and image generation tools to create visuals for upcoming real estate projects to present to clients.

Step 5. Defining the WorkForce

#Define the workforce that can take case of multiple agents
workforce = Workforce('Real Estate Brochure Generator')
workforce.add_single_agent_worker(
        "Real Estate Specialist",
        worker=real_estate_agent).add_single_agent_worker(
        "Real Estate Project Name Specialist",
        worker=property_title_agent).add_single_agent_worker(
        "Location Amenity Specialist",worker=location_benefits_agent).add_single_agent_worker(
        "Image Generation Specialist",
        worker=image_generation_agent)

 # specify the task to be solved Defining the exact task needed
human_task = Task(
        content=(
            """Craft a Brochure Content For a Upcoming Residential Real Estate Project in Sector 47,Gurgaon. The content should contain all the types of flats it has, all amenities in it and other such necessary details . 
            Provide a Name for this Property as well.
            Generate all the amenities of the location (with respect to its proximity to all public places) to this brochure content.
            Generate an Image of this Upcoming Project as well."""
        ),
        id='0',
    )
task = workforce.process_task(human_task)

This code defines a “workforce” that manages multiple agents for generating a real estate brochure. It adds 4 agents: a Real Estate Specialist, a Property Name Specialist, Location Amenity Specialist and an Image Generation Specialist. Then, it specifies a task for the workforce to complete—creating brochure content, providing a project name, and generating an image for a new real estate project in Gurgaon. The workforce processes the task by coordinating the agents to execute their respective roles.

Outputs

1. Output from Brochure Content Agent

Upcoming Residential Project in Sector 47, Gurgaon

Welcome to Your New Home
Discover the perfect blend of luxury and comfort in our upcoming residential
 project located in the heart of Sector 47, Gurgaon. Designed to cater to
 diverse lifestyles, our project offers a variety of flats that promise to
 meet your needs and exceed your expectations.

---

Flat Types Available:

1. **1 BHK Flats**
- **Size:** 600 sq. ft.
- **Description:** Ideal for young professionals or couples, these cozy 1 BHK
 flats feature an open living area, a modern kitchen, and a comfortable
 bedroom. Enjoy a well-designed space that maximizes functionality without
 compromising on style.

2. **2 BHK Flats**
- **Size:** 1,200 sq. ft.
- **Description:** Perfect for small families, our 2 BHK flats offer spacious
 living areas, two well-appointed bedrooms, and ample storage. Experience a
 harmonious blend of elegance and practicality, with large windows that
 invite natural light into your home.

3. **3 BHK Flats**
- **Size:** 1,800 sq. ft.
- **Description:** Designed for larger families, these expansive 3 BHK flats
 provide generous living spaces, three bedrooms, and a modern kitchen. Enjoy
 the luxury of space and comfort, with thoughtfully designed layouts that
 cater to your family’s needs.

4. **Penthouse Suites**
- **Size:** 2,500 sq. ft.
- **Description:** Elevate your living experience with our exclusive
 penthouse suites. Featuring stunning views, expansive terraces, and high-end
 finishes, these luxurious homes are perfect for those who appreciate the
 finer things in life. Enjoy private outdoor spaces and a lifestyle of
 sophistication.

---

 Amenities:
- **Clubhouse:** A state-of-the-art clubhouse with recreational facilities.
- **Swimming Pool:** Relax and unwind in our beautifully designed pool.
- **Gymnasium:** Stay fit with our fully equipped gym.
- **Landscaped Gardens:** Enjoy serene green spaces for relaxation and
 leisure.
- **24/7 Security:** Ensuring your safety and peace of mind.

---

Location Benefits:
- Proximity to major schools, hospitals, and shopping centers.
- Excellent connectivity to Delhi and other parts of Gurgaon.
- A vibrant neighborhood with parks, restaurants, and entertainment options.

---

Conclusion:
Don’t miss the opportunity to be a part of this exceptional residential
 community in Sector 47, Gurgaon. Whether you are looking for a cozy 1 BHK or
 a luxurious penthouse, we have the perfect home waiting for you. For more
 information and to schedule a visit, contact us today!

2. Output from Real Estate Project Name Specialist Agent

**Gurgaon Heights**

Output from Location Amenity Specialist Agent

Amenities and Proximity to Public Places near Gurgaon Heights, Sector 47,
 Gurgaon

1. Shopping Malls:-

**Ambience Mall** - 5 km
- **DLF Mega Mall** - 4.5 km
- **Sahara Mall** - 6 km

2. **Metro Stations:**
- **Huda City Centre Metro Station** - 4 km
- **Sikandarpur Metro Station** - 7 km

3. **Railway Stations:**
- **Gurgaon Railway Station** - 8 km
- **New Delhi Railway Station** - 30 km

4. Airports: -

**Indira Gandhi International Airport** - 15 km

5. Schools: -

**The Shri Ram School** - 2 km
- **G.D. Goenka Public School** - 3 km
- **Delhi Public School, Sector 45** - 3.5 km

6. Hospitals:-

**Medanta - The Medicity** - 6 km
- **Fortis Memorial Research Institute** - 5 km
- **Max Hospital, Gurgaon** - 7 km

7. Parks and Recreation:-

**Aravali Golf Course** - 3 km
- **Leisure Valley Park** - 4 km
- **Sukhna Lake Park** - 5 km

8. Restaurants and Cafes:-

**Cyber Hub** - 6 km
- **Sector 29 Food Street** - 5 km
- **The Great India Place** - 7 km

9. Entertainment: -

**PVR Cinemas, Ambience Mall** - 5 km
- **Kingdom of Dreams** - 8 km

3. Output From Image Generation Specialist:-

Conclusion

In conclusion, the integration of agentic AI systems with image generation capabilities, such as those found in the Camel AI framework (MultiModal Agentic Framework), represents a transformative advancement in both creativity and automation. By combining the power of autonomous decision-making with advanced image generation tools, these systems offer significant potential for rapid prototyping, personalized experiences, and enhanced accessibility to high-quality visual content. As Camel AI (MultiModal Agentic Framework) continues to evolve, it can drive innovation across various industries, reducing human involvement in routine tasks while empowering more strategic and creative endeavours.

Key Takeaways

Autonomous Creativity: Agentic AI systems with image generation capabilities enhance creative processes, allowing artists and designers to quickly generate unique and innovative visual content.
Personalized Experiences: These systems can tailor images based on user preferences, enabling customized marketing, advertising, and entertainment experiences.
Efficient Prototyping: Agentic AI accelerates the prototyping process by generating visual prototypes rapidly, fostering quicker iterations and feedback in design workflows.
Data Visualization: Agentic AI systems can convert complex data into clear, visually intuitive representations, aiding in better understanding and communication across diverse fields.
Multi-Agent Collaboration: Camel AI’s framework promotes collaboration among autonomous agents, enhancing task execution and facilitating the development of advanced, multi-agent systems for a wide range of applications.

The media shown in this article is not owned by Analytics Vidhya and is used at the Author’s discretion.

Frequently Asked Questions

Q1. What are Agentic AI systems and how do they work with image generation?

Ans. Agentic AI systems are autonomous AI frameworks with advanced decision-making capabilities. When integrated with image generation capabilities, they can create unique visual content, enhance creativity, and automate tasks, making processes like design, marketing, and prototyping more efficient.

Q2. How can Agentic AI benefit creative professionals?

Ans. Agentic AI helps creative professionals like artists, designers, and marketers by generating tailored and unique visual content. This assists in exploring new ideas, improving creativity, and speeding up design iterations and prototyping.

Q3. What is Camel AI and how does it support multi-agent collaboration?

Ans. Camel AI is an open-source framework for developing autonomous, communicative agents. It promotes collaboration among agents through its modules and toolkits, enabling dynamic, multi-agent systems that can interact, share data, and perform complex tasks without human intervention.

Q4. What types of tasks can Camel AI’s toolkits help with?

Ans. Camel AI’s toolkits support a variety of tasks, including information retrieval, sentiment analysis, image processing, document handling, and web interactions. Additionally, it integrates with models like DALL-E to generate images based on textual input, expanding its creative capabilities.

Q5. How does Camel AI enable automation and reduce human involvement?

Ans. By using its multi-agent system and specialized toolkits, Camel AI automates repetitive and complex tasks such as data processing, image generation, and workflow management. This reduces the need for human input, allowing users to focus on strategic and creative endeavours.

Nibedita Dutta

Nibedita completed her master’s in Chemical Engineering from IIT Kharagpur in 2014 and is currently working as a Senior Data Scientist. In her current capacity, she works on building intelligent ML-based solutions to improve business processes.

Advanced AI Agents Best of Tech Generative AI Application

Free Courses

4.7

Generative AI - A Way of Life

Explore Generative AI for beginners: create text and images, use top AI tools, learn practical skills, and ethics.

4.5

Getting Started with Large Language Models

Master Large Language Models (LLMs) with this course, offering clear guidance in NLP and model training made simple.

4.6

Building LLM Applications using Prompt Engineering

This free course guides you on building LLM apps, mastering prompt engineering, and developing chatbots with enterprise data.

4.8

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Explore practical solutions, advanced retrieval strategies, and agentic RAG systems to improve context, relevance, and accuracy in AI-driven applications.

4.7

Microsoft Excel: Formulas & Functions

Master MS Excel for data analysis with key formulas, functions, and LookUp tools in this comprehensive course.

MUID

Used by Microsoft Clarity, to store and track visits across websites.

Expiry: 1 Year

Type: HTTP

_clck

Used by Microsoft Clarity, Persists the Clarity User ID and preferences, unique to that site, on the browser. This ensures that behavior in subsequent visits to the same site will be attributed to the same user ID.

Expiry: 1 Year

Type: HTTP

_clsk

Used by Microsoft Clarity, Connects multiple page views by a user into a single Clarity session recording.

Expiry: 1 Day

Type: HTTP

SRM_I

Collects user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Years

Type: HTTP

SM

Use to measure the use of the website for internal analytics

Expiry: 1 Years

Type: HTTP

CLID

The cookie is set by embedded Microsoft Clarity scripts. The purpose of this cookie is for heatmap and session recording.

Expiry: 1 Year

Type: HTTP

SRM_B

Collected user data is specifically adapted to the user or device. The user can also be followed outside of the loaded website, creating a picture of the visitor's behavior.

Expiry: 2 Months

Type: HTTP

_gid

This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected includes the number of visitors, the source where they have come from, and the pages visited in an anonymous form.

Expiry: 399 Days

Type: HTTP

_ga_#

Used by Google Analytics, to store and count pageviews.

Expiry: 399 Days

Type: HTTP

_gat_#

Used by Google Analytics to collect data on the number of times a user has visited the website as well as dates for the first and most recent visit.

Expiry: 1 Day

Type: HTTP

collect

Used to send data to Google Analytics about the visitor's device and behavior. Tracks the visitor across devices and marketing channels.

Expiry: Session

Type: PIXEL

AEC

cookies ensure that requests within a browsing session are made by the user, and not by other sites.

Expiry: 6 Months

Type: HTTP

G_ENABLED_IDPS

use the cookie when customers want to make a referral from their gmail contacts; it helps auth the gmail account.

Expiry: 2 Years

Type: HTTP

test_cookie

This cookie is set by DoubleClick (which is owned by Google) to determine if the website visitor's browser supports cookies.

Expiry: 1 Year

Type: HTTP

_we_us

this is used to send push notification using webengage.

Expiry: 1 Year

Type: HTTP

WebKlipperAuth

used by webenage to track auth of webenagage.

Expiry: Session

Type: HTTP

ln_or

Linkedin sets this cookie to registers statistical data on users' behavior on the website for internal analytics.

Expiry: 1 Day

Type: HTTP

JSESSIONID

Use to maintain an anonymous user session by the server.

Expiry: 1 Year

Type: HTTP

li_rm

Used as part of the LinkedIn Remember Me feature and is set when a user clicks Remember Me on the device to make it easier for him or her to sign in to that device.

Expiry: 1 Year

Type: HTTP

AnalyticsSyncHistory

Used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

lms_analytics

Used to store information about the time a sync with the AnalyticsSyncHistory cookie took place for users in the Designated Countries.

Expiry: 6 Months

Type: HTTP

liap

Cookie used for Sign-in with Linkedin and/or to allow for the Linkedin follow feature.

Expiry: 6 Months

Type: HTTP

visit

allow for the Linkedin follow feature.

Expiry: 1 Year

Type: HTTP

li_at

often used to identify you, including your name, interests, and previous activity.

Expiry: 2 Months

Type: HTTP

s_plt

Tracks the time that the previous page took to load

Expiry: Session

Type: HTTP

lang

Used to remember a user's language setting to ensure LinkedIn.com displays in the language selected by the user in their settings

Expiry: Session

Type: HTTP

s_tp

Tracks percent of page viewed

Expiry: Session

Type: HTTP

AMCV_14215E3D5995C57C0A495C55%40AdobeOrg

Indicates the start of a session for Adobe Experience Cloud

Expiry: Session

Type: HTTP

s_pltp

Provides page name value (URL) for use by Adobe Analytics

Expiry: Session

Type: HTTP

s_tslv

Used to retain and fetch time since last visit in Adobe Analytics

Expiry: 6 Months

Type: HTTP

li_theme

Remembers a user's display preference/theme setting

Expiry: 6 Months

Type: HTTP

li_theme_set

Remembers which users have updated their display / theme preferences

Expiry: 6 Months

Type: HTTP

Reading list

Introduction to Generative AI

Introduction to Generative AI applications

No-code Generative AI app development

Code-focused Generative AI App Development

Introduction to Responsible AI

LLMS

Prompt Engineering

Finetuning LLMs

Training LLMs from Scratch

Langchain

RAG

LlamaIndex

Stable Diffusion

Multimodal Agentic Framework to Create Real Estate Brochures

Learning Objectives

Table of contents

MultiModal Agentic AI: Agents with Image Generation

Get Personalized Learning Path! Set your goal and timeline. Get a path—under 2 mins.

What is Camel AI?

Core Modules in Camel AI

Toolkits in Camel AI

DALL-E

Hands-On Implementation of a Multi-Modal Agentic System

Step 1. Installation of Necessary Libraries

Step 2. Defining Open AI API Keys

Step 3. Importing Necessary Libraries

Step 4. Defining the Agents

Step 5. Defining the WorkForce

Outputs

1. Output from Brochure Content Agent

2. Output from Real Estate Project Name Specialist Agent

3. Output From Image Generation Specialist:-

Conclusion

Key Takeaways

Frequently Asked Questions

Login to continue reading and enjoy expert-curated content.

Free Courses

Generative AI - A Way of Life

Getting Started with Large Language Models

Building LLM Applications using Prompt Engineering

Improving Real World RAG Systems: Key Challenges & Practical Solutions

Microsoft Excel: Formulas & Functions

Recommended Articles

Responses From Readers

Write for us

Analytics Vidhya (4)

brahmaid

csrftoken

Identityid

sessionid

Google (1)

g_state

Microsoft (7)

MUID

_clck

_clsk

SRM_I

SM

CLID

SRM_B

Google (7)

_gid

_ga_#

_gat_#

collect

AEC

G_ENABLED_IDPS

test_cookie

Webengage (2)

_we_us

WebKlipperAuth

LinkedIn (16)

ln_or

JSESSIONID

li_rm

AnalyticsSyncHistory

lms_analytics

liap

visit