fix: handle multiple task outputs correctly in conditional tasks #1937

devin-ai-integration · 2025-01-21T17:23:32Z

Fix Multiple Task Outputs in Conditional Tasks

Description

This PR fixes an issue where using multiple conditional tasks, or having two tasks before a conditional task, causes an IndexError in _handle_conditional_task. The root cause was that task outputs were being reset instead of accumulated, and the task output indexing was incorrect.

Changes

Fix IndexError in _handle_conditional_task by using first output instead of task_index - 1
Modify _execute_tasks to accumulate task outputs instead of resetting them
Update _create_crew_output to handle multiple outputs correctly
Add comprehensive tests for multiple tasks with conditional and multiple conditional tasks in sequence

Tests Added

test_multiple_tasks_with_conditional: Verifies that having multiple tasks before a conditional task works correctly
test_multiple_conditional_tasks: Verifies that having multiple conditional tasks in sequence works correctly

Link to Devin run: https://app.devin.ai/sessions/551953893ee74e29ad18839ccc39bb87

Closes #1928

- Fix IndexError in _handle_conditional_task by using first output - Modify _execute_tasks to accumulate task outputs instead of resetting - Update _create_crew_output to handle multiple outputs correctly - Add tests for multiple tasks with conditional and multiple conditional tasks Co-Authored-By: [email protected] <[email protected]>

devin-ai-integration · 2025-01-21T17:23:36Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add "(aside)" to your comment to have me ignore it.
Look at CI failures and help fix them

⚙️ Control Options:

Disable automatic comment and CI monitoring

joaomdmoura · 2025-01-21T17:26:03Z

Disclaimer: This review was made by a crew of AI Agents.

Code Review Comment for PR #1937

Overview

This pull request addresses key issues related to handling multiple task outputs in conditional tasks. The modifications improve both the logic in the source code and the coverage of test cases. Below are detailed insights regarding the code changes, historical context from related PRs, implications for related files, and suggestions for improvement.

1. src/crewai/crew.py Changes

a) Task Output Accumulation

Previous Code:

task_outputs = [task_output]

This implementation overwrote previous task outputs, leading to potential data loss.

Improved Code:

task_outputs.append(task_output)

✅ This is a positive change that successfully accumulates outputs from multiple tasks, preserving critical data.

b) Conditional Task Handling

Previous Code:

previous_output = task_outputs[task_index - 1] if task_outputs else None

This could lead to an IndexError with insufficient outputs available.

Improved Code:

previous_output = task_outputs[0] if task_outputs else None

⚠️ While this fixes the IndexError, it lacks clarity regarding which output is relevant to the conditional evaluation.

Recommendation:
To enhance clarity, consider refactoring to utilize the final output instead:

relevant_output = task_outputs[-1] if task_outputs else None

This ensures we assess the most relevant task output and promote code readability.

c) Crew Output Creation

Previous Code:

if len(task_outputs) != 1:
    raise ValueError("Something went wrong. Kickoff should return only one task output.")

This assumes a singular output exists, which conflicts with the new logic accommodating multiple outputs.

Improved Code:

final_task_output = task_outputs[-1] if task_outputs else None
if not final_task_output:
    raise ValueError("No task outputs available to create crew output.")

This change efficiently addresses multiple outputs but could benefit from clearer error messaging to improve user experience.

2. tests/crew_test.py Additions

Positive Aspects

Enhanced test coverage addressing various task scenarios indicates the commit's reliability.
Effective use of mocking ensures that the simulation of task execution is well-controlled and can be leveraged in different contexts.

Suggested Improvements

Consider implementing edge cases such as:

@pytest.mark.vcr(filter_headers=["authorization"])
def test_conditional_task_with_empty_outputs():
    """Test handling of conditional tasks with empty previous outputs"""
    ...
    with pytest.raises(ValueError, match="No previous output available"):
        crew.kickoff()

This would further test the robustness of the new logic under varying conditions.

3. General Recommendations

Error Handling: Incorporate more specific exception handling for different failure scenarios to simplify debugging.
Documentation: Add detailed docstrings explaining conditional execution flows and expected formats for task outputs.
Logging: Increase log detail around task execution decisions to better understand execution paths.
Type Safety: Strengthen type hinting and consider runtime type checks to catch issues early.

4. Implementation Concerns

Memory Management: The task_outputs list expands continuously; a cleanup strategy should be evaluated for long-running processes.
Performance: The increased complexity of handling multiple outputs might impact performance, particularly with larger task sets.
Maintainability: Abstracting conditional logic into a separate class could improve the maintainability of the code.

Conclusion

The proposed changes effectively rectify the handling of multiple task outputs, enhancing the overall functionality of the crew module. While the code structure is sound and aligns well with the intended functionality, further refinements in error handling, logging, documentation, and edge case scenarios remain recommended. These modifications also align with insights from related pull requests where similar challenges have been addressed, fostering a more resilient and maintainable codebase.

Co-Authored-By: [email protected] <[email protected]>

…y task check Co-Authored-By: [email protected] <[email protected]>

bhancockio · 2025-01-22T20:06:45Z

src/crewai/crew.py

@@ -783,7 +797,7 @@ def _handle_conditional_task(
            task_outputs = self._process_async_tasks(futures, was_replayed)
            futures.clear()

-        previous_output = task_outputs[task_index - 1] if task_outputs else None
+        previous_output = task_outputs[0] if task_outputs else None


Devin this should be previous_output = task_outputs[-1] if task_outputs else None instead.

Grab the last element in the list. not the first.

bhancockio · 2025-01-22T20:08:05Z

src/crewai/crew.py

@@ -918,7 +936,7 @@ def _create_crew_output(self, task_outputs: List[TaskOutput]) -> CrewOutput:
            raw=final_task_output.raw,
            pydantic=final_task_output.pydantic,
            json_dict=final_task_output.json_dict,
-            tasks_output=[task.output for task in self.tasks if task.output],
+            tasks_output=task_outputs,  # Keep all task outputs


Drop comment.

bhancockio · 2025-01-22T20:20:34Z

Devin, we need to add additional tests:

We should create a crew that has 3 conditional tasks and verify that the validation throws an error
We should verify that we are grabbing the proper previous_output in the _handle_conditional_task function.

…tion test Co-Authored-By: [email protected] <[email protected]>

…ub.com/crewAIInc/crewAI into devin/1737479945-fix-conditional-index

bhancockio · 2025-01-22T20:37:47Z

tests/crew_test.py

+
+    # Set up mocks to return different outputs for each task
+    with patch.object(Task, "execute_sync") as mock_execute:
+        mock_execute.side_effect = [mock_first, mock_second, mock_first]  # Third value is for conditional task if it runs


Why are you doing [mock_first, mock_second, mock_first] instead of `[mock_first, mock_second, mock_conditional_task]

You aren't even really testing the conditional task at all. You also aren't checking that the conditional task is properly using the previous output.

The way you could update this task is to create 3 task:

Task 1: regular task

Task 2: Conditional Task where the condition is never met

Task 3: Conditional task where the condition is met.

In this test, you want to make sure that task 3 is getting the previous output from task 1 because that was the last successfully executed task.

bhancockio · 2025-01-22T20:40:46Z

tests/crew_test.py

+        assert len(result.tasks_output) == 3
+
+@pytest.mark.vcr(filter_headers=["authorization"])
+def test_multiple_conditional_tasks():


This test is basically the exact same as the one above it so we don't really need it.

Instead, let's change this test to verify that the result we get from crew.kickoff() properly contains all the task outputs. For example, if we have 3 tasks where 1 is normal and 2 are conditional tasks and 1 of those conditioanl tasks is never triggered, the task_outputs from crew.kickoff should only have 2 task outputs.

Co-Authored-By: [email protected] <[email protected]>

devin-ai-integration bot and others added 3 commits January 21, 2025 17:39

feat: validate at least one non-conditional task and refine task outputs

4b767b2

Co-Authored-By: [email protected] <[email protected]>

Revert to single output in _create_crew_output; remove redundant empt…

200ecff

…y task check Co-Authored-By: [email protected] <[email protected]>

Merge branch 'main' into devin/1737479945-fix-conditional-index

83ceacb

bhancockio requested changes Jan 22, 2025

View reviewed changes

devin-ai-integration bot and others added 2 commits January 22, 2025 20:28

Address PR feedback: use last output in conditional tasks, add valida…

9148e99

…tion test Co-Authored-By: [email protected] <[email protected]>

Merge branch 'devin/1737479945-fix-conditional-index' of https://gith…

f641668

…ub.com/crewAIInc/crewAI into devin/1737479945-fix-conditional-index

bhancockio requested changes Jan 22, 2025

View reviewed changes

Address PR feedback: updated conditional tasks tests and indexing

b21e492

Co-Authored-By: [email protected] <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: handle multiple task outputs correctly in conditional tasks #1937

fix: handle multiple task outputs correctly in conditional tasks #1937

devin-ai-integration bot commented Jan 21, 2025 •

edited by bhancockio

Loading

devin-ai-integration bot commented Jan 21, 2025

joaomdmoura commented Jan 21, 2025

bhancockio Jan 22, 2025

bhancockio Jan 22, 2025

bhancockio commented Jan 22, 2025

bhancockio Jan 22, 2025

bhancockio Jan 22, 2025

fix: handle multiple task outputs correctly in conditional tasks #1937

Are you sure you want to change the base?

fix: handle multiple task outputs correctly in conditional tasks #1937

Conversation

devin-ai-integration bot commented Jan 21, 2025 • edited by bhancockio Loading

Fix Multiple Task Outputs in Conditional Tasks

Description

Changes

Tests Added

devin-ai-integration bot commented Jan 21, 2025

🤖 Devin AI Engineer

joaomdmoura commented Jan 21, 2025

Code Review Comment for PR #1937

Overview

1. src/crewai/crew.py Changes

a) Task Output Accumulation

b) Conditional Task Handling

c) Crew Output Creation

2. tests/crew_test.py Additions

Positive Aspects

Suggested Improvements

3. General Recommendations

4. Implementation Concerns

Conclusion

bhancockio Jan 22, 2025

Choose a reason for hiding this comment

bhancockio Jan 22, 2025

Choose a reason for hiding this comment

bhancockio commented Jan 22, 2025

bhancockio Jan 22, 2025

Choose a reason for hiding this comment

bhancockio Jan 22, 2025

Choose a reason for hiding this comment

devin-ai-integration bot commented Jan 21, 2025 •

edited by bhancockio

Loading