Enhance ApifyTool with additional methods #79

TerminalGravity · 2024-07-23T18:36:14Z

This commit enhances the ApifyTool by adding the following methods:

get_dataset: Get the dataset associated with a given actor run
get_dataset_items: Get all items from a given dataset
get_key_value_store: Get the key-value store associated with a given actor run
download_file: Download a file from the key-value store of a given actor run
store_dataset: Store data in an Apify dataset

These additional methods provide more functionality for interacting with Apify services, such as fetching datasets, accessing key-value stores, downloading files, and storing data in datasets.

The README.md file has also been updated to reflect these changes and provide usage examples for the new methods.

This commit enhances the ApifyTool by adding the following methods: - get_dataset: Get the dataset associated with a given actor run - get_dataset_items: Get all items from a given dataset - get_key_value_store: Get the key-value store associated with a given actor run - download_file: Download a file from the key-value store of a given actor run - store_dataset: Store data in an Apify dataset These additional methods provide more functionality for interacting with Apify services, such as fetching datasets, accessing key-value stores, downloading files, and storing data in datasets. The README.md file has also been updated to reflect these changes and provide usage examples for the new methods.

TerminalGravity · 2024-07-31T01:03:43Z

Hi, can I get some feedback on this pull request?

joaomdmoura · 2024-08-11T02:49:42Z

Pretty interesting, but I think we might want to break some of those function into individual tools so we can be specific around attributes as well

joaomdmoura

thanks for the initial push!! It's looking good

joaomdmoura · 2024-08-11T02:50:28Z

crewai_tools/tools/apify_tool/README.md

+- `store_dataset`: Store data in an Apify dataset.
+
+You can use these methods to interact with various Apify services, such as running actors, waiting for actor runs to complete, fetching datasets, accessing the key-value store, downloading files, and storing data in datasets.
+


If you don't mind taking some inspiration from existing READMEs for other tools, it would be nice to have most following a standard

joaomdmoura · 2024-08-11T02:50:53Z

crewai_tools/tools/__init__.py

@@ -37,3 +37,4 @@
 from .youtube_channel_search_tool.youtube_channel_search_tool import YoutubeChannelSearchTool
 from .youtube_video_search_tool.youtube_video_search_tool import YoutubeVideoSearchTool
 from .spider_tool.spider_tool import SpiderTool
+from .apify_tool.apify_tool import ApifyTool


Nice, I think we just miss one extra import on the other init.py file 💪🏼

joaomdmoura · 2024-08-11T02:51:42Z

crewai_tools/tools/apify_tool/apify_tool.py

+
+        # Execute the Apify action using the client
+        # For example, to run an actor:
+        actor_run = self.run_actor(actor_id="your_actor_id", run_input={"query": query})


I think we want the agent to pass the actor id in this case together with the query?
Also might be worth having another tool to lsit the actor ids so it can decide / learn what to call?

joaomdmoura · 2024-08-11T02:52:36Z

crewai_tools/tools/apify_tool/apify_tool.py

+    def get_dataset(self, actor_run: ActorRun, dataset_id: str = None) -> Dataset:
+        """
+        Get the dataset associated with the given actor run.
+        """
+        if dataset_id:
+            return self.client.dataset.get(dataset_id)
+        else:
+            return actor_run.dataset
+
+    def get_dataset_items(self, dataset: Dataset) -> List[Dict[str, Any]]:
+        """
+        Get all items from the given dataset.
+        """
+        return list(dataset.iterate_items())
+
+    def get_key_value_store(self, actor_run: ActorRun) -> Dict[str, Any]:
+        """
+        Get the key-value store associated with the given actor run.
+        """
+        return actor_run.key_value_store
+
+    def download_file(self, actor_run: ActorRun, key: str, file_path: str):
+        """
+        Download a file from the key-value store of the given actor run.
+        """
+        with open(file_path, "wb") as file:
+            file.write(actor_run.key_value_store.get_file(key).read())
+
+    async def _arun(self, query: str) -> str:
+        raise NotImplementedError("This tool does not support async mode.")


I think these specifically cpould be broken into other tools as well so it's pretty specific, hte way it's setup now the agent won't be able to use them

TerminalGravity marked this pull request as draft July 23, 2024 18:36

TerminalGravity marked this pull request as ready for review July 23, 2024 18:36

joaomdmoura requested changes Aug 11, 2024

View reviewed changes

theCyberTech added the under review label Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance ApifyTool with additional methods #79

Enhance ApifyTool with additional methods #79

TerminalGravity commented Jul 23, 2024

TerminalGravity commented Jul 31, 2024

joaomdmoura commented Aug 11, 2024

joaomdmoura left a comment

joaomdmoura Aug 11, 2024

joaomdmoura Aug 11, 2024

joaomdmoura Aug 11, 2024

joaomdmoura Aug 11, 2024

		- `store_dataset`: Store data in an Apify dataset.

		You can use these methods to interact with various Apify services, such as running actors, waiting for actor runs to complete, fetching datasets, accessing the key-value store, downloading files, and storing data in datasets.

Enhance ApifyTool with additional methods #79

Are you sure you want to change the base?

Enhance ApifyTool with additional methods #79

Conversation

TerminalGravity commented Jul 23, 2024

TerminalGravity commented Jul 31, 2024

joaomdmoura commented Aug 11, 2024

joaomdmoura left a comment

Choose a reason for hiding this comment

joaomdmoura Aug 11, 2024

Choose a reason for hiding this comment

joaomdmoura Aug 11, 2024

Choose a reason for hiding this comment

joaomdmoura Aug 11, 2024

Choose a reason for hiding this comment

joaomdmoura Aug 11, 2024

Choose a reason for hiding this comment