Skip to content

Commit

Permalink
docs: update multimodal agents guide and mint.json configuration
Browse files Browse the repository at this point in the history
  • Loading branch information
tonykipkemboi committed Jan 15, 2025
1 parent 835557e commit c12343a
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 6 deletions.
10 changes: 5 additions & 5 deletions docs/how-to/multimodal-agents.mdx
Original file line number Diff line number Diff line change
@@ -1,14 +1,14 @@
---
title: Using Multimodal Agents
description: Learn how to enable and use multimodal capabilities in your agents for processing images and other non-text content within the CrewAI framework.
icon: image
icon: video
---

# Using Multimodal Agents
## Using Multimodal Agents

CrewAI supports multimodal agents that can process both text and non-text content like images. This guide will show you how to enable and use multimodal capabilities in your agents.

## Enabling Multimodal Capabilities
### Enabling Multimodal Capabilities

To create a multimodal agent, simply set the `multimodal` parameter to `True` when initializing your agent:

Expand All @@ -25,7 +25,7 @@ agent = Agent(

When you set `multimodal=True`, the agent is automatically configured with the necessary tools for handling non-text content, including the `AddImageTool`.

## Working with Images
### Working with Images

The multimodal agent comes pre-configured with the `AddImageTool`, which allows it to process images. You don't need to manually add this tool - it's automatically included when you enable multimodal capabilities.

Expand Down Expand Up @@ -108,7 +108,7 @@ The multimodal agent will automatically handle the image processing through its
- Process image content with optional context or specific questions
- Provide analysis and insights based on the visual information and task requirements

## Best Practices
### Best Practices

When working with multimodal agents, keep these best practices in mind:

Expand Down
2 changes: 1 addition & 1 deletion docs/mint.json
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,7 @@
"how-to/custom-manager-agent",
"how-to/llm-connections",
"how-to/customizing-agents",
"how-to/multimodal-agents.mdx",
"how-to/multimodal-agents",
"how-to/coding-agents",
"how-to/force-tool-output-as-result",
"how-to/human-input-on-execution",
Expand Down

0 comments on commit c12343a

Please sign in to comment.