Feature/kickoff consistent output (#847)

* Cleaned up task execution to now have separate paths for async and sync execution. Updating all kickoff functions to return CrewOutput. WIP. Waiting for Joao feedback on async task execution with task_output * Consistently storing async and sync output for context * outline tests I need to create going forward * Major rehaul of TaskOutput and CrewOutput. Updated all tests to work with new change. Need to add in a few final tricky async tests and add a few more to verify output types on TaskOutput and CrewOutput. * Encountering issues with callback. Need to test on main. WIP * working on tests. WIP * WIP. Figuring out disconnect issue. * Cleaned up logs now that I've isolated the issue to the LLM * more wip. * WIP. It looks like usage metrics has always been broken for async * Update parent crew who is managing for_each loop * Merge in main to bugfix/kickoff-for-each-usage-metrics * Clean up code for review * Add new tests * Final cleanup. Ready for review. * Moving copy functionality from Agent to BaseAgent * Fix renaming issue * Fix linting errors * use BaseAgent instead of Agent where applicable * Fixing missing function. Working on tests. * WIP. Needing team to review change * Fixing issues brought about by merge * WIP * Implement major fixes from yesterdays group conversation. Now working on tests. * The majority of tasks are working now. Need to fix converter class * Fix final failing test * Fix linting and type-checker issues * Add more tests to fully test CrewOutput and TaskOutput changes * Add in validation for async cannot depend on other async tasks. * Update validators and tests
2026-01-10 16:48:30 +00:00 · 2024-07-10 23:35:02 -04:00
parent 691b094a40
commit 7b53457ef3
30 changed files with 260509 additions and 697 deletions
--- a/tests/agent_test.py
+++ b/tests/agent_test.py
@@ -631,8 +631,9 @@ def test_agent_use_specific_tasks_output_as_context(capsys):

    crew = Crew(agents=[agent1, agent2], tasks=tasks)
    result = crew.kickoff()
-    assert "bye" not in result.lower()
-    assert "hi" in result.lower() or "hello" in result.lower()
+    print("LOWER RESULT", result.raw)
+    assert "bye" not in result.raw.lower()
+    assert "hi" in result.raw.lower() or "hello" in result.raw.lower()


@pytest.mark.vcr(filter_headers=["authorization"])
@@ -644,7 +645,7 @@ def test_agent_step_callback():
    with patch.object(StepCallback, "callback") as callback:

        @tool
-        def learn_about_AI(topic) -> float:
+        def learn_about_AI(topic) -> str:
            """Useful for when you need to learn about AI to write an paragraph about it."""
            return "AI is a very broad field."

@@ -678,7 +679,7 @@ def test_agent_function_calling_llm():
    with patch.object(llm.client, "create", wraps=llm.client.create) as private_mock:

        @tool
-        def learn_about_AI(topic) -> float:
+        def learn_about_AI(topic) -> str:
            """Useful for when you need to learn about AI to write an paragraph about it."""
            return "AI is a very broad field."

@@ -750,7 +751,8 @@ def test_tool_result_as_answer_is_the_final_answer_for_the_agent():
    crew = Crew(agents=[agent1], tasks=tasks)

    result = crew.kickoff()
-    assert result == "Howdy!"
+    print("RESULT: ", result.raw)
+    assert result.raw == "Howdy!"


@pytest.mark.vcr(filter_headers=["authorization"])