Skip to content

Releases: phidatahq/phidata

v2.7.7

09 Jan 09:29
04440aa
Compare
Choose a tag to compare

Changelog

New Features:

  • Image-to-Image Generation Support: Extended FalTools to support image-to-image generation.
  • MongoDB Integrations: Added MongoDB support across agent storage, agent knowledgebase, agent memory, and workflow storage.
  • Download Image Utility: Introduced a utility to download agent-generated images directly for enhanced workflows.

Improvements:

  • Audio Transcript Handling: Refined the handling of audio transcripts to ensure accurate message output when processing audio files
  • Optimized Chunking: Enhanced fixed and recursive chunking methods to improve document processing performance.
  • HuggingFace Embedder: Resolved compatibility issues with HuggingFace embedder for better reliability.
  • Ollama Embedder: Upgraded the Ollama embedder for more robust and efficient embedding capabilities.
  • OpenAI o1 Model Specification Support: Added support for OpenAI o1 model specifications, enabling compatibility with the latest OpenAI features.
  • Spider Tool Improvements: Addressed performance issues with the Spider tool to enhance reliability.
  • LanceDB Enhancements: Added support for accessing existing database tables in LanceDB
  • Github tool repo creation: Added support for repo creation via the Github tool.
  • ScrapeGraph tool addition: Adds integration with ScrapeGraph AI for smart web scraping and markdown conversion capabilities.

Bug Fixes:

  • OpenAI rejecting functions with too large description: This issue was discovered with Composio tools, but affected all tools. Resolved and made tool definitions better match JSON Schema.
  • Google Embedder Compatibility: Fixed an issue with Pydantic 2.10.x that caused the rejection of the Google embedder, restoring full functionality.
  • ChromaDB Upsert Issue: Resolved bugs that caused errors during upsert operations in ChromaDB.
  • Async Error Messaging: Improved error messages for unsupported async operations in models.
  • Gemini Tool Parameters: Fixed an issue with parameter handling in Gemini tools

What's Changed

New Contributors

Full Changelog: v2.7.6...v2.7.7

v2.7.6

30 Dec 16:45
be67e8d
Compare
Choose a tag to compare

Changelog

Extending Vector database support

This update adds new vector database integrations, improvements to DuckDuckGo tool and Gemini model and new cookbooks.

New Features

Cassandra Vector Database: Introduced support for Apache Cassandra as a Vector Database, leveraging CassIO for vector storage and retrieval.
ClickHouse as a Vector Database: Added support for ClickHouse as a Vector Database

Improvements

DuckDuckGo Modifier: Added a modifier parameter to the DuckDuckGo tool, allowing users to refine searches with site-specific queries, file type filters, and safe search toggles.
Enhanced Error Handling for python-docx: Improved error messaging for scenarios where the python-docx library is not installed, providing clearer guidance and smoother debugging.

Bug Fixes

Qdrant Default Embedder : Removed OpenAI as the default embedder for Qdrant, enabling user-configurable embeddings.
Gemini Import Fix : Fixed an issue where missing OpenAI library dependencies caused errors when using Gemini.

What's Changed

New Contributors

Full Changelog: v2.7.5...v2.7.6

v2.7.5

20 Dec 14:51
b65c7a7
Compare
Choose a tag to compare

This update introduces the Confluence tool for collaboration, improves tool compatibility, and resolves deep copy issues for Ollama chat agents.

New Features:

  • Confluence Tool: Added a new tool using the Atlassian Confluence SDK, enabling operations such as listing pages in a space, creating and updating pages, retrieving page content by title, and getting space details.

Improvements:

  • Tool Compatibility: Enhanced older custom functions with manually specified descriptions and parameters to align with the updated tool-building system.

Bug Fixes:

  • Deep Copy for Ollama Chat Agents: Addressed an issue where manually set clients caused errors during agent model copying, ensuring all properties are properly handled.

What's Changed

New Contributors

Full Changelog: v2.7.4...v2.7.5

v2.7.4

18 Dec 15:21
1688f6d
Compare
Choose a tag to compare

This release enhances multimodal capabilities with audio support, improves session page performance, and fixes various bugs for better stability and usability.

New Features

  • Audio Response Support: Added support for audio responses, enhancing multimodal interaction capabilities.
  • Audio Generation Tools: Integrated Eleven Labs for Audio Generation.
  • Cohere Embedder: Introduced a new Cohere Embedder class with a corresponding cookbook example to demonstrate its usage.
  • JSON and YAML Agent Storage: Now you can persist your data locally in JSON and YAML extension files

Improvements

  • Version Checker for OpenAI: Added a warning for users with OpenAI versions below 1.52.0 to ensure compatibility with features like audio in ChatCompletionMessage.
  • Agent Response Handling: Enhanced processing of agent responses to support lists, improving handling of multi-item outputs.

Bug Fixes

  • AWS Bedrock Tool Descriptions: Fixed an issue where the transfer tool description was missing, causing incompatibility with AWS Bedrock Claude.
  • Response Content Handling: Resolved crashes on the session page caused by non-string response content.
  • Deep Copy Agent Memory: Addressed deep copy errors when using agent memory on the playground.
  • Session Page Enhancements: Fixed the refresh button
  • Fix Tool Parsing for Ollama: Fixed JSON schema tool parsing by transforming ['string', 'null'] parameters to 'string' for compatibility.
  • Response Parsing for Gemini Tool: Improved response parsing to handle unserializable objects in tool_calls for Gemini on the playground.
  • Memory Handling for Google Provider: Fixed an issue in monitoring_data where memory was removed for all providers, causing blank titles on Phidata.app; now only modifies memory for Google provider.
  • RecursiveChunking ID Conflict: Resolved an issue in RecursiveChunking where processing large files with multiple chunks caused duplicate chunk record IDs, leading to psycopg.errors.UniqueViolation.

What's Changed

New Contributors

Full Changelog: v2.7.3...v2.7.4

v2.7.3

16 Dec 14:19
Compare
Choose a tag to compare

Changelog

Multi-Modal Tools

This update introduces new multimodal tools, enhances the multimodal capabilities of existing models, and includes several quality-of-life improvements.

New Features

Giphy Tool: Added Giphy integration to enhance creative collaboration.
Native image upload support for Claude: Added native support for uploading images natively to the Anthropic API.
Youtube Knowledge base: Added support for new YouTube knowledge base, allowing it to be loaded directly using YouTube video links.

Improvements

API key validation: Added API key validation for model classes.
Gemini audio: Improved native audio upload support for Gemini Model.
YoutubeTools: A new Youtube tool which allows generation of timestamps
Workspace Configuration Flexibility: Refactored type-checking logic using isinstance() to enhance flexibility and maintainability.
Web Crawler Stability: Switched to crawl4ai async to resolve issues and improve performance.
Memory Optimization: Improved memory usage in large-scale workflows for better efficiency.
User Interface: Refined the UI for better tracking of team activities.

Bug Fixes

Role-Based Access Control Bugs: Resolved issues affecting access permissions.
Gemini Functions: Fixed errors when functions had no specified parameters.

What's Changed

New Contributors

Full Changelog: v2.7.2...v2.7.3

v2.7.2

12 Dec 19:16
Compare
Choose a tag to compare

Changelog

Improvements

Improvements to Gemini Multimodality

Full Changelog: v2.7.1...v2.7.2

v2.7.1

12 Dec 17:22
Compare
Choose a tag to compare

Changelog

New Features

Gemini Multimodal Support: Added support for multimodal (image, video, text) input processing with Gemini.
Mem0 Integration Example: Introduced a cookbook example demonstrating Mem0 integration for Agent memory.
CSV URL Knowledgebase: Added functionality to create and manage knowledgebases from CSV URLs.

Improvements

Vertex AI Gemini 2 Update: The Vertex AI class has been updated to support the Gemini 2 model

Bug Fixes

Structured Output Fix: Resolved issues with Ollama structured output to ensure consistent data formatting.

What's Changed

Full Changelog: v2.7.0...v2.7.1

v2.7.0

12 Dec 10:55
7871732
Compare
Choose a tag to compare

This update introduces image and video multi-modal support for the agent playground and adds image and video generation tools like FAL, replicate, and ModelLabs.

Highlights

  • Support for video: Agents now natively support video
  • Multi-Modal rendering on Agent Playground: Agent Playground can now render images and videos

New Feature

  • New Tools: Added Replicate, FAL, and ModelLabs tools to generate video and images

Improvements

  • Various cookbook examples were added to cover real-world agent use cases

New Contributors

Full Changelog: v2.6.7...v2.7.0

v2.6.7

11 Dec 23:05
Compare
Choose a tag to compare

What's Changed

Full Changelog: v2.6.6...v2.6.7

v2.6.6

10 Dec 15:07
Compare
Choose a tag to compare

Full Changelog: v2.6.5...v2.6.6