Use keyword matching for CodeAct microagents #4568

rbren · 2024-10-25T17:29:33Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below
Better support for pushing changes to GitHub

Give a summary of what the PR does, explaining any non-trivial design decisions

This redesigns the way that microagents plug into CodeAct

Remove required env vars--this doesn't work with remote runtimes
Use keyword matching to pull in additional prompts, rather than making the user choose a microagent
Modify the GitHub microagent a bit, to replace the current frontend-based prompt

Link of any specific issues this addresses

frontend/src/components/project-menu/ProjectMenuCard.tsx

enyst · 2024-10-25T18:41:17Z

openhands/agenthub/codeact_agent/codeact_agent.py

+            user_turns_processed = 0
+            for message in reversed(messages):
+                if message.role == 'user' and user_turns_processed < 2:
+                    message.content[-1].cache_prompt = True


It looks like the re-ordering here would make the reminder TextContent get the "cache_prompt"? That would break prompt caching, since the reminder text changes with each request so the cache is never hit.

I'm pretty sure the logic is unchanged here, but could be wrong.

messages should be exactly the same here (system, user example, rest) as it was previously when we did this check. I just had to move the logic around, because now the example_user_message function needs the most recent user message in order to do the keyword-matching.

LMK if there's a good way to test this

Let's say the last user message is simple, no images, just a string:

it will be in the Message, represented as a list with a single TextContent object

we add the cache marker on it

then we add the reminder as a second TextContent in the list
Result: ask the Anthropic API to cache the actual content only, not the reminder.

Currently on branch, if GitHub diff doesn't fool me 😅

the last Message would have in its list the TextContent with the actual content

we add the reminder in a second TextContent

we add cache marker to the last TextContent => that's the reminder now.

ohhhh I see what you're saying :/

Let me bring this up in slack

Re: testing. It's Anthropic only, and we have to send a few more messages than the first two. The log in console should show both cache writes and cache hits.

We might end up seeing that after some 4k (system message, user message), it does or doesn't hit any more tokens

enyst · 2024-10-25T21:03:15Z

openhands/utils/prompt.py

-        )
-        return rendered.strip()
+        if len(micro_agent_prompts) > 0:
+            micro_text = "EXTRA INFO: the following information has been included based on a keyword match. It may or may not be relevant to the user's request.\n\n"


This may sound like magical thinking, but since we're at it on other branches: the Anthropic prompts everywhere seem to show heavy use of xml tags. While this isn't exactly news, the latest bunch of reveals seems worth trying more and hey they probably don't hurt anyway. We could add this like <EXTRA_INFO> </EXTRA_INFO> ?

On a side note, I feel like it's becoming more important to figure out what to do with other LLMs... when optimizing for one, it perhaps shouldn't come as a surprise that it seems to win "big". I mean, I'm not talking about tool use; that one, I think, may yet prove very good for all LLMs that support it, but apart from tool use itself, we also do stuff like this. ^ 🤷

rbren added 5 commits October 16, 2024 13:12

first pass at gh microagent triggers

e798bc6

first pass at using gh micro

c9a3cc7

Merge branch 'main' into rb/gh-micro-agent

2461491

more instructions

186f2ac

Merge branch 'main' into rb/gh-micro-agent

f45fa1c

rbren commented Oct 25, 2024

View reviewed changes

frontend/src/components/project-menu/ProjectMenuCard.tsx Outdated Show resolved Hide resolved

Update frontend/src/components/project-menu/ProjectMenuCard.tsx

56a9469

enyst reviewed Oct 25, 2024

View reviewed changes

rbren and others added 6 commits October 25, 2024 15:11

fix test

2669fdb

Merge branch 'main' into rb/gh-micro-agent

004ffc0

fix tests

d891626

better messages

32b7ef2

better prompt hints

4f5c8f9

more fixes

f7ee9fe

enyst reviewed Oct 25, 2024

View reviewed changes

rbren added 3 commits October 25, 2024 17:08

fix up last_user_message logic

eb0f056

move env reminder back to bottom

3366e87

remove microagents template

300c0fc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use keyword matching for CodeAct microagents #4568

Use keyword matching for CodeAct microagents #4568

rbren commented Oct 25, 2024

enyst Oct 25, 2024

rbren Oct 25, 2024

rbren Oct 25, 2024

enyst Oct 25, 2024

rbren Oct 25, 2024

enyst Oct 25, 2024

enyst Oct 25, 2024

Use keyword matching for CodeAct microagents #4568

Are you sure you want to change the base?

Use keyword matching for CodeAct microagents #4568

Conversation

rbren commented Oct 25, 2024

enyst Oct 25, 2024

Choose a reason for hiding this comment

rbren Oct 25, 2024

Choose a reason for hiding this comment

rbren Oct 25, 2024

Choose a reason for hiding this comment

enyst Oct 25, 2024

Choose a reason for hiding this comment

rbren Oct 25, 2024

Choose a reason for hiding this comment

enyst Oct 25, 2024

Choose a reason for hiding this comment

enyst Oct 25, 2024

Choose a reason for hiding this comment