Delegation fixes #6165

enyst · 2025-01-09T07:30:37Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below
Fix agent delegation; use events for communication between parent and delegates.
Fix the lockup when the model returned a message.

Give a summary of what the PR does, explaining any non-trivial design decisions

Delegation was broken after we made the agent loop rely exclusively on a controller-as-observer logic. This PR proposes to fix it in a simple way: by forwarding to the delegate

refactor the current logic (of unsubscribing parent when delegate starts and vice versa): now ONLY the parent is subscribed and stays subscribed, and it forwards to the delegate when it has one
should_step on both MessageActions from 'user' and 'agent', except when waiting for user input is explicitly set
should_step on DelegateAction too, it will create a MessageAction to kickstart the delegate
refactor ending conditions
added integration tests for DelegatorAgent

Also:

the delegate starts with a MessageAction
and ends with a FinishAction

The code is ready for review - or this logic of delegation.
(please ignore the print() stuff, will clean up later)

Link of any specific issues this addresses
Fix #6162

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:0f00ea6-nikolaik   --name openhands-app-0f00ea6   docker.all-hands.dev/all-hands-ai/openhands:0f00ea6

.github/workflows/integration-runner.yml

enyst · 2025-01-09T08:30:22Z

@openhands-agent Read the diff of this PR carefully. Understand what it tries to achieve. Then, we have two things to do:

The diff has added debug prints. We need them! And we need to enhance them:

all events have an event.id, add it to the print() statements after the class name, like '({event.id})'

The unit tests in test_agent_controller.py are outdated and failing. Update them to the new behavior. Understand the difference in the context of the changes of this PR and fix them.

Important:
You don't need to test the rest. Just this test file.

github-actions · 2025-01-09T16:47:31Z

Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly.

openhands/controller/agent_controller.py

…elegation

github-actions · 2025-01-10T02:58:13Z

Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly.

github-actions · 2025-01-10T03:02:21Z

Trigger by: Pull Request (integration-test label on PR #6165)
Commit: 01c459a
Integration Tests Report (Haiku)
Haiku LLM Test Results:

uncomment me

Integration Tests Report (DeepSeek)
DeepSeek LLM Test Results:

uncomment me

Integration Tests Report Delegator (Haiku)
Success rate: 50.00% (1/2)

Total cost: USD 0.00

instance_id	success	reason	cost	error_message
t02_add_bash_hello	True		0	nan
t01_fix_simple_typo	False	File not fixed: This is a silly text.	0	RuntimeError: Agent reached maximum iteration in headless mode. Current iteration: 30, max iteration: 30
		Really?
		No more text!
		Enjoy!

Integration Tests Report Delegator (DeepSeek)
Success rate: 100.00% (2/2)

Total cost: USD 0.00

instance_id	success	reason	cost	error_message
t01_fix_simple_typo	True		0
t02_add_bash_hello	True		0

Download testing outputs (includes both Haiku and DeepSeek results): Download

li-boxuan

Thanks for working on this fix!

li-boxuan · 2025-01-10T05:31:05Z

openhands/controller/agent_controller.py

@@ -165,7 +169,11 @@ async def close(self) -> None:
        )

        # unsubscribe from the event stream
-        self.event_stream.unsubscribe(EventStreamSubscriber.AGENT_CONTROLLER, self.id)
+        # only the root parent controller subscribes to the event stream
+        if not self.is_delegate:


Question: does this design work for multi-layer delegation?

No, I thought about keeping that, but I don't know how useful it really is in the practice around us, and I'd leave it for a follow-up. I think it did work before, but I don't think we ever tested for it? Do you find it useful?

A use case that keeps appearing around is planner / executor type of workflows (for example, and it's only one example 3770) - and those don't need it, but I would love it if we support them well.

It would be nice to have a more generalized solution that can handle a tree of multi-level delegation.
We don't use multi-level right now, but it would be annoying if we had to hack up the framework to make it work later and diverge the agent_controller implementation.

I think it did work before, but I don't think we ever tested for it?

Yeah it did work before, and I think we had a test for it; the dummy agent used to delegate to itself then itself and then itself again...

Do you find it useful?

Not really 😁 in my imaginary setting, it wasn't useful because LLMs were not powerful enough to be used for super long-horizon tasks (e.g. found a company and release a product). Could it be? Maybe, maybe not.

I am just babbling: maybe multi-layer delegation setting would be more useful in robotic/industrial engineering areas? Where "agents" really don't care about the past and future, and there's very narrow but different action spaces that they shall follow.

Or maybe this: a very intelligent yet expensive agent that makes decisions, which hands over to good and not-too-expensive agents, which sometimes needs some work to be done by mediocre and cheap agents.

If CodeAct ever decides to delegate to BrowsingAgent again or delegate to a micro agent, then we will need multi layer for anything that uses planner -> CodeAct -> Browsing/Micro

It doesn't need to be that crazy deep but needing to support 2 to 3 layers doesn't seem that unlikely.

Those are good points! I think it will work fine. This PR changes the behavior at the edges, so that the delegated agent is exactly like a non-delegated agent, no special handling necessary to become a delegate or to delegate, they use the event stream.

Like if you want to use CodeAct, you send a delegate action asking for codeact, and you don't have to modify the agent for it to work. If an agent knows about the delegate tool, it can use it and become a parent, if not, it's just a kid.😅

I think that makes it easier to add depth, not harder. I kept it simple to see and test the flow, but it should be just making use of the level counter. (famous last words? 🤣) Will look!

enyst added 6 commits January 9, 2025 08:27

refactor delegation

1c5b982

handle terminal states

588b25d

fix close

f1a4ac4

refactor delegate end

382f94c

debug

a6501b7

workflows

a1331b1

enyst marked this pull request as draft January 9, 2025 07:30

enyst added the integration-test label Jan 9, 2025

This comment was marked as outdated.

Sign in to view

enyst commented Jan 9, 2025

View reviewed changes

.github/workflows/integration-runner.yml Show resolved Hide resolved

enyst added the fix-me-experimental label Jan 9, 2025

enyst added 2 commits January 9, 2025 16:42

add missing haiku section

ea3ab1d

more clear trace

fca199a

All-Hands-AI deleted a comment from openhands-agent Jan 9, 2025

All-Hands-AI deleted a comment from github-actions bot Jan 9, 2025

fix delegate end

0393f9b

All-Hands-AI deleted a comment from openhands-agent Jan 9, 2025

enyst added integration-test and removed integration-test labels Jan 9, 2025

enyst marked this pull request as ready for review January 9, 2025 16:47

enyst commented Jan 9, 2025

View reviewed changes

openhands/controller/agent_controller.py Outdated Show resolved Hide resolved

enyst requested a review from rbren January 9, 2025 17:01

enyst commented Jan 9, 2025

View reviewed changes

openhands/controller/agent_controller.py Show resolved Hide resolved

enyst requested a review from li-boxuan January 9, 2025 17:10

This comment was marked as outdated.

Sign in to view

enyst added 2 commits January 9, 2025 18:35

Update openhands/controller/agent_controller.py

84fb5c5

fix delegate too early

f62b249

Merge branch 'main' of github.com:All-Hands-AI/OpenHands into enyst/d…

24f15cd

…elegation

enyst added integration-test and removed integration-test fix-me-experimental labels Jan 9, 2025

All-Hands-AI deleted a comment from github-actions bot Jan 9, 2025

temporary run faster

d07f225

enyst added integration-test and removed integration-test labels Jan 9, 2025

This comment was marked as outdated.

Sign in to view

fix results report

1fb90ec

enyst added integration-test and removed integration-test labels Jan 10, 2025

This comment was marked as outdated.

Sign in to view

enyst added integration-test and removed integration-test labels Jan 10, 2025

All-Hands-AI deleted a comment from github-actions bot Jan 10, 2025

enyst force-pushed the enyst/delegation branch from 0c734f3 to 870dd39 Compare January 10, 2025 02:47

All-Hands-AI deleted a comment from github-actions bot Jan 10, 2025

enyst added integration-test and removed integration-test labels Jan 10, 2025

fix iterations

0f00ea6

enyst force-pushed the enyst/delegation branch from 870dd39 to 0f00ea6 Compare January 10, 2025 02:57

All-Hands-AI deleted a comment from github-actions bot Jan 10, 2025

enyst added integration-test and removed integration-test labels Jan 10, 2025

li-boxuan approved these changes Jan 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delegation fixes #6165

Delegation fixes #6165

enyst commented Jan 9, 2025 •

edited by github-actions bot

Loading

This comment was marked as outdated.

enyst commented Jan 9, 2025

github-actions bot commented Jan 9, 2025

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

github-actions bot commented Jan 10, 2025

github-actions bot commented Jan 10, 2025

li-boxuan left a comment

li-boxuan Jan 10, 2025

enyst Jan 10, 2025

diwu-sf Jan 10, 2025

li-boxuan Jan 10, 2025

li-boxuan Jan 10, 2025

diwu-sf Jan 10, 2025

enyst Jan 10, 2025

Delegation fixes #6165

Are you sure you want to change the base?

Delegation fixes #6165

Conversation

enyst commented Jan 9, 2025 • edited by github-actions bot Loading

This comment was marked as outdated.

enyst commented Jan 9, 2025

github-actions bot commented Jan 9, 2025

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

github-actions bot commented Jan 10, 2025

github-actions bot commented Jan 10, 2025

uncomment me

uncomment me

li-boxuan left a comment

Choose a reason for hiding this comment

li-boxuan Jan 10, 2025

Choose a reason for hiding this comment

enyst Jan 10, 2025

Choose a reason for hiding this comment

diwu-sf Jan 10, 2025

Choose a reason for hiding this comment

li-boxuan Jan 10, 2025

Choose a reason for hiding this comment

li-boxuan Jan 10, 2025

Choose a reason for hiding this comment

diwu-sf Jan 10, 2025

Choose a reason for hiding this comment

enyst Jan 10, 2025

Choose a reason for hiding this comment

enyst commented Jan 9, 2025 •

edited by github-actions bot

Loading