Lorenze/enh decouple executor from crew (#4209)

* wip restrcuturing agent executor and liteagent * fix: handle None task in AgentExecutor to prevent errors Added a check to ensure that if the task is None, the method returns early without attempting to access task properties. This change improves the robustness of the AgentExecutor by preventing potential errors when the task is not set. * refactor: streamline AgentExecutor initialization by removing redundant parameters Updated the Agent class to simplify the initialization of the AgentExecutor by removing unnecessary task and crew parameters in standalone mode. This change enhances code clarity and maintains backward compatibility by ensuring that the executor is correctly configured without redundant assignments. * ensure executors work inside a flow due to flow in flow async structure * refactor: enhance agent kickoff preparation by separating common logic Updated the Agent class to introduce a new private method that consolidates the common setup logic for both synchronous and asynchronous kickoff executions. This change improves code clarity and maintainability by reducing redundancy in the kickoff process, while ensuring that the agent can still execute effectively within both standalone and flow contexts. * linting and tests * fix test * refactor: improve test for Agent kickoff parameters Updated the test for the Agent class to ensure that the kickoff method correctly preserves parameters. The test now verifies the configuration of the agent after kickoff, enhancing clarity and maintainability. Additionally, the test for asynchronous kickoff within a flow context has been updated to reflect the Agent class instead of LiteAgent. * refactor: update test task guardrail process output for improved validation Refactored the test for task guardrail process output to enhance the validation of the output against the OpenAPI schema. The changes include a more structured request body and updated response handling to ensure compliance with the guardrail requirements. This update aims to improve the clarity and reliability of the test cases, ensuring that task outputs are correctly validated and feedback is appropriately provided. * test fix cassette * test fix cassette * working * working cassette * refactor: streamline agent execution and enhance flow compatibility Refactored the Agent class to simplify the execution method by removing the event loop check and clarifying the behavior when called from synchronous and asynchronous contexts. The changes ensure that the method operates seamlessly within flow methods, improving clarity in the documentation. Additionally, updated the AgentExecutor to set the response model to None, enhancing flexibility. New test cassettes were added to validate the functionality of agents within flow contexts, ensuring robust testing for both synchronous and asynchronous operations. * fixed cassette * Enhance Flow Execution Logic - Introduced conditional execution for start methods in the Flow class. - Unconditional start methods are prioritized during kickoff, while conditional starts are executed only if no unconditional starts are present. - Improved handling of cyclic flows by allowing re-execution of conditional start methods triggered by routers. - Added checks to continue execution chains for completed conditional starts. These changes improve the flexibility and control of flow execution, ensuring that the correct methods are triggered based on the defined conditions. * Enhance Agent and Flow Execution Logic - Updated the Agent class to automatically detect the event loop and return a coroutine when called within a Flow, simplifying async handling for users. - Modified Flow class to execute listeners sequentially, preventing race conditions on shared state during listener execution. - Improved handling of coroutine results from synchronous methods, ensuring proper execution flow and state management. These changes enhance the overall execution logic and user experience when working with agents and flows in CrewAI. * Enhance Flow Listener Logic and Agent Imports - Updated the Flow class to track fired OR listeners, ensuring that multi-source OR listeners only trigger once during execution. This prevents redundant executions and improves flow efficiency. - Cleared fired OR listeners during cyclic flow resets to allow re-execution in new cycles. - Modified the Agent class imports to include Coroutine from collections.abc, enhancing type handling for asynchronous operations. These changes improve the control and performance of flow execution in CrewAI, ensuring more predictable behavior in complex scenarios. * adjusted test due to new cassette * ensure we dont finalize batch on just a liteagent finishing * feat: cancellable parallelized flow methods * feat: allow methods to be cancelled & run parallelized * feat: ensure state is thread safe through proxy * fix: check for proxy state * fix: mimic BaseModel method * chore: update final attr checks; test * better description * fix test * chore: update test assumptions * extra --------- Co-authored-by: Greyson LaLonde <greyson.r.lalonde@gmail.com>
2026-05-03 00:02:36 +00:00 · 2026-01-20 21:44:45 -08:00
parent b267bb4054
commit 741bf12bf4
21 changed files with 3145 additions and 1376 deletions
--- a/lib/crewai/tests/agents/test_crew_agent_executor_flow.py
+++ b/lib/crewai/tests/agents/test_crew_agent_executor_flow.py
@@ -1,4 +1,4 @@
-"""Unit tests for CrewAgentExecutorFlow.
+"""Unit tests for AgentExecutor.

 Tests the Flow-based agent executor implementation including state management,
 flow methods, routing logic, and error handling.
@@ -8,9 +8,9 @@ from unittest.mock import Mock, patch

 import pytest

-from crewai.experimental.crew_agent_executor_flow import (
+from crewai.experimental.agent_executor import (
    AgentReActState,
-    CrewAgentExecutorFlow,
+    AgentExecutor,
 )
 from crewai.agents.parser import AgentAction, AgentFinish

@@ -43,8 +43,8 @@ class TestAgentReActState:
        assert state.ask_for_human_input is True


-class TestCrewAgentExecutorFlow:
-    """Test CrewAgentExecutorFlow class."""
+class TestAgentExecutor:
+    """Test AgentExecutor class."""

    @pytest.fixture
    def mock_dependencies(self):
@@ -87,8 +87,8 @@ class TestCrewAgentExecutorFlow:
        }

    def test_executor_initialization(self, mock_dependencies):
-        """Test CrewAgentExecutorFlow initialization."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        """Test AgentExecutor initialization."""
+        executor = AgentExecutor(**mock_dependencies)

        assert executor.llm == mock_dependencies["llm"]
        assert executor.task == mock_dependencies["task"]
@@ -100,9 +100,9 @@ class TestCrewAgentExecutorFlow:
    def test_initialize_reasoning(self, mock_dependencies):
        """Test flow entry point."""
        with patch.object(
-            CrewAgentExecutorFlow, "_show_start_logs"
+            AgentExecutor, "_show_start_logs"
        ) as mock_show_start:
-            executor = CrewAgentExecutorFlow(**mock_dependencies)
+            executor = AgentExecutor(**mock_dependencies)
            result = executor.initialize_reasoning()

            assert result == "initialized"
@@ -110,7 +110,7 @@ class TestCrewAgentExecutorFlow:

    def test_check_max_iterations_not_reached(self, mock_dependencies):
        """Test routing when iterations < max."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        executor.state.iterations = 5

        result = executor.check_max_iterations()
@@ -118,7 +118,7 @@ class TestCrewAgentExecutorFlow:

    def test_check_max_iterations_reached(self, mock_dependencies):
        """Test routing when iterations >= max."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        executor.state.iterations = 10

        result = executor.check_max_iterations()
@@ -126,7 +126,7 @@ class TestCrewAgentExecutorFlow:

    def test_route_by_answer_type_action(self, mock_dependencies):
        """Test routing for AgentAction."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        executor.state.current_answer = AgentAction(
            thought="thinking", tool="search", tool_input="query", text="action text"
        )
@@ -136,7 +136,7 @@ class TestCrewAgentExecutorFlow:

    def test_route_by_answer_type_finish(self, mock_dependencies):
        """Test routing for AgentFinish."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        executor.state.current_answer = AgentFinish(
            thought="final thoughts", output="Final answer", text="complete"
        )
@@ -146,7 +146,7 @@ class TestCrewAgentExecutorFlow:

    def test_continue_iteration(self, mock_dependencies):
        """Test iteration continuation."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)

        result = executor.continue_iteration()

@@ -154,8 +154,8 @@ class TestCrewAgentExecutorFlow:

    def test_finalize_success(self, mock_dependencies):
        """Test finalize with valid AgentFinish."""
-        with patch.object(CrewAgentExecutorFlow, "_show_logs") as mock_show_logs:
-            executor = CrewAgentExecutorFlow(**mock_dependencies)
+        with patch.object(AgentExecutor, "_show_logs") as mock_show_logs:
+            executor = AgentExecutor(**mock_dependencies)
            executor.state.current_answer = AgentFinish(
                thought="final thinking", output="Done", text="complete"
            )
@@ -168,7 +168,7 @@ class TestCrewAgentExecutorFlow:

    def test_finalize_failure(self, mock_dependencies):
        """Test finalize skips when given AgentAction instead of AgentFinish."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        executor.state.current_answer = AgentAction(
            thought="thinking", tool="search", tool_input="query", text="action text"
        )
@@ -181,7 +181,7 @@ class TestCrewAgentExecutorFlow:

    def test_format_prompt(self, mock_dependencies):
        """Test prompt formatting."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        inputs = {"input": "test input", "tool_names": "tool1, tool2", "tools": "desc"}

        result = executor._format_prompt("Prompt {input} {tool_names} {tools}", inputs)
@@ -192,18 +192,18 @@ class TestCrewAgentExecutorFlow:

    def test_is_training_mode_false(self, mock_dependencies):
        """Test training mode detection when not in training."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        assert executor._is_training_mode() is False

    def test_is_training_mode_true(self, mock_dependencies):
        """Test training mode detection when in training."""
        mock_dependencies["crew"]._train = True
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        assert executor._is_training_mode() is True

    def test_append_message_to_state(self, mock_dependencies):
        """Test message appending to state."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        initial_count = len(executor.state.messages)

        executor._append_message_to_state("test message")
@@ -216,7 +216,7 @@ class TestCrewAgentExecutorFlow:
        callback = Mock()
        mock_dependencies["step_callback"] = callback

-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        answer = AgentFinish(thought="thinking", output="test", text="final")

        executor._invoke_step_callback(answer)
@@ -226,14 +226,14 @@ class TestCrewAgentExecutorFlow:
    def test_invoke_step_callback_none(self, mock_dependencies):
        """Test step callback when none provided."""
        mock_dependencies["step_callback"] = None
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)

        # Should not raise error
        executor._invoke_step_callback(
            AgentFinish(thought="thinking", output="test", text="final")
        )

-    @patch("crewai.experimental.crew_agent_executor_flow.handle_output_parser_exception")
+    @patch("crewai.experimental.agent_executor.handle_output_parser_exception")
    def test_recover_from_parser_error(
        self, mock_handle_exception, mock_dependencies
    ):
@@ -242,7 +242,7 @@ class TestCrewAgentExecutorFlow:

        mock_handle_exception.return_value = None

-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        executor._last_parser_error = OutputParserError("test error")
        initial_iterations = executor.state.iterations

@@ -252,12 +252,12 @@ class TestCrewAgentExecutorFlow:
        assert executor.state.iterations == initial_iterations + 1
        mock_handle_exception.assert_called_once()

-    @patch("crewai.experimental.crew_agent_executor_flow.handle_context_length")
+    @patch("crewai.experimental.agent_executor.handle_context_length")
    def test_recover_from_context_length(
        self, mock_handle_context, mock_dependencies
    ):
        """Test recovery from context length error."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        executor._last_context_error = Exception("context too long")
        initial_iterations = executor.state.iterations

@@ -270,16 +270,16 @@ class TestCrewAgentExecutorFlow:
    def test_use_stop_words_property(self, mock_dependencies):
        """Test use_stop_words property."""
        mock_dependencies["llm"].supports_stop_words.return_value = True
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        assert executor.use_stop_words is True

        mock_dependencies["llm"].supports_stop_words.return_value = False
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        assert executor.use_stop_words is False

    def test_compatibility_properties(self, mock_dependencies):
        """Test compatibility properties for mixin."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        executor.state.messages = [{"role": "user", "content": "test"}]
        executor.state.iterations = 5

@@ -321,8 +321,8 @@ class TestFlowErrorHandling:
            "tools_handler": Mock(),
        }

-    @patch("crewai.experimental.crew_agent_executor_flow.get_llm_response")
-    @patch("crewai.experimental.crew_agent_executor_flow.enforce_rpm_limit")
+    @patch("crewai.experimental.agent_executor.get_llm_response")
+    @patch("crewai.experimental.agent_executor.enforce_rpm_limit")
    def test_call_llm_parser_error(
        self, mock_enforce_rpm, mock_get_llm, mock_dependencies
    ):
@@ -332,15 +332,15 @@ class TestFlowErrorHandling:
        mock_enforce_rpm.return_value = None
        mock_get_llm.side_effect = OutputParserError("parse failed")

-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        result = executor.call_llm_and_parse()

        assert result == "parser_error"
        assert executor._last_parser_error is not None

-    @patch("crewai.experimental.crew_agent_executor_flow.get_llm_response")
-    @patch("crewai.experimental.crew_agent_executor_flow.enforce_rpm_limit")
-    @patch("crewai.experimental.crew_agent_executor_flow.is_context_length_exceeded")
+    @patch("crewai.experimental.agent_executor.get_llm_response")
+    @patch("crewai.experimental.agent_executor.enforce_rpm_limit")
+    @patch("crewai.experimental.agent_executor.is_context_length_exceeded")
    def test_call_llm_context_error(
        self,
        mock_is_context_exceeded,
@@ -353,7 +353,7 @@ class TestFlowErrorHandling:
        mock_get_llm.side_effect = Exception("context length")
        mock_is_context_exceeded.return_value = True

-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        result = executor.call_llm_and_parse()

        assert result == "context_error"
@@ -397,10 +397,10 @@ class TestFlowInvoke:
            "tools_handler": Mock(),
        }

-    @patch.object(CrewAgentExecutorFlow, "kickoff")
-    @patch.object(CrewAgentExecutorFlow, "_create_short_term_memory")
-    @patch.object(CrewAgentExecutorFlow, "_create_long_term_memory")
-    @patch.object(CrewAgentExecutorFlow, "_create_external_memory")
+    @patch.object(AgentExecutor, "kickoff")
+    @patch.object(AgentExecutor, "_create_short_term_memory")
+    @patch.object(AgentExecutor, "_create_long_term_memory")
+    @patch.object(AgentExecutor, "_create_external_memory")
    def test_invoke_success(
        self,
        mock_external_memory,
@@ -410,7 +410,7 @@ class TestFlowInvoke:
        mock_dependencies,
    ):
        """Test successful invoke without human feedback."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)

        # Mock kickoff to set the final answer in state
        def mock_kickoff_side_effect():
@@ -429,10 +429,10 @@ class TestFlowInvoke:
        mock_long_term_memory.assert_called_once()
        mock_external_memory.assert_called_once()

-    @patch.object(CrewAgentExecutorFlow, "kickoff")
+    @patch.object(AgentExecutor, "kickoff")
    def test_invoke_failure_no_agent_finish(self, mock_kickoff, mock_dependencies):
        """Test invoke fails without AgentFinish."""
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)
        executor.state.current_answer = AgentAction(
            thought="thinking", tool="test", tool_input="test", text="action text"
        )
@@ -442,10 +442,10 @@ class TestFlowInvoke:
        with pytest.raises(RuntimeError, match="without reaching a final answer"):
            executor.invoke(inputs)

-    @patch.object(CrewAgentExecutorFlow, "kickoff")
-    @patch.object(CrewAgentExecutorFlow, "_create_short_term_memory")
-    @patch.object(CrewAgentExecutorFlow, "_create_long_term_memory")
-    @patch.object(CrewAgentExecutorFlow, "_create_external_memory")
+    @patch.object(AgentExecutor, "kickoff")
+    @patch.object(AgentExecutor, "_create_short_term_memory")
+    @patch.object(AgentExecutor, "_create_long_term_memory")
+    @patch.object(AgentExecutor, "_create_external_memory")
    def test_invoke_with_system_prompt(
        self,
        mock_external_memory,
@@ -459,7 +459,7 @@ class TestFlowInvoke:
            "system": "System: {input}",
            "user": "User: {input} {tool_names} {tools}",
        }
-        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor = AgentExecutor(**mock_dependencies)

        def mock_kickoff_side_effect():
            executor.state.current_answer = AgentFinish(
--- a/lib/crewai/tests/agents/test_lite_agent.py
+++ b/lib/crewai/tests/agents/test_lite_agent.py
@@ -72,62 +72,53 @@ class ResearchResult(BaseModel):

@pytest.mark.vcr()
@pytest.mark.parametrize("verbose", [True, False])
-def test_lite_agent_created_with_correct_parameters(monkeypatch, verbose):
-    """Test that LiteAgent is created with the correct parameters when Agent.kickoff() is called."""
+def test_agent_kickoff_preserves_parameters(verbose):
+    """Test that Agent.kickoff() uses the correct parameters from the Agent."""
    # Create a test agent with specific parameters
-    llm = LLM(model="gpt-4o-mini")
+    mock_llm = Mock(spec=LLM)
+    mock_llm.call.return_value = "Final Answer: Test response"
+    mock_llm.stop = []
+
+    from crewai.types.usage_metrics import UsageMetrics
+
+    mock_usage_metrics = UsageMetrics(
+        total_tokens=100,
+        prompt_tokens=50,
+        completion_tokens=50,
+        cached_prompt_tokens=0,
+        successful_requests=1,
+    )
+    mock_llm.get_token_usage_summary.return_value = mock_usage_metrics
+
    custom_tools = [WebSearchTool(), CalculatorTool()]
    max_iter = 10
-    max_execution_time = 300

    agent = Agent(
        role="Test Agent",
        goal="Test Goal",
        backstory="Test Backstory",
-        llm=llm,
+        llm=mock_llm,
        tools=custom_tools,
        max_iter=max_iter,
-        max_execution_time=max_execution_time,
        verbose=verbose,
    )

-    # Create a mock to capture the created LiteAgent
-    created_lite_agent = None
-    original_lite_agent = LiteAgent
+    # Call kickoff and verify it works
+    result = agent.kickoff("Test query")

-    # Define a mock LiteAgent class that captures its arguments
-    class MockLiteAgent(original_lite_agent):
-        def __init__(self, **kwargs):
-            nonlocal created_lite_agent
-            created_lite_agent = kwargs
-            super().__init__(**kwargs)
+    # Verify the agent was configured correctly
+    assert agent.role == "Test Agent"
+    assert agent.goal == "Test Goal"
+    assert agent.backstory == "Test Backstory"
+    assert len(agent.tools) == 2
+    assert isinstance(agent.tools[0], WebSearchTool)
+    assert isinstance(agent.tools[1], CalculatorTool)
+    assert agent.max_iter == max_iter
+    assert agent.verbose == verbose

-    # Patch the LiteAgent class
-    monkeypatch.setattr("crewai.agent.core.LiteAgent", MockLiteAgent)
-
-    # Call kickoff to create the LiteAgent
-    agent.kickoff("Test query")
-
-    # Verify all parameters were passed correctly
-    assert created_lite_agent is not None
-    assert created_lite_agent["role"] == "Test Agent"
-    assert created_lite_agent["goal"] == "Test Goal"
-    assert created_lite_agent["backstory"] == "Test Backstory"
-    assert created_lite_agent["llm"] == llm
-    assert len(created_lite_agent["tools"]) == 2
-    assert isinstance(created_lite_agent["tools"][0], WebSearchTool)
-    assert isinstance(created_lite_agent["tools"][1], CalculatorTool)
-    assert created_lite_agent["max_iterations"] == max_iter
-    assert created_lite_agent["max_execution_time"] == max_execution_time
-    assert created_lite_agent["verbose"] == verbose
-    assert created_lite_agent["response_format"] is None
-
-    # Test with a response_format
-    class TestResponse(BaseModel):
-        test_field: str
-
-    agent.kickoff("Test query", response_format=TestResponse)
-    assert created_lite_agent["response_format"] == TestResponse
+    # Verify kickoff returned a result
+    assert result is not None
+    assert result.raw is not None


@pytest.mark.vcr()
@@ -310,7 +301,8 @@ def verify_agent_parent_flow(result, agent, flow):


 def test_sets_parent_flow_when_inside_flow():
-    captured_agent = None
+    """Test that an Agent can be created and executed inside a Flow context."""
+    captured_event = None

    mock_llm = Mock(spec=LLM)
    mock_llm.call.return_value = "Test response"
@@ -343,15 +335,17 @@ def test_sets_parent_flow_when_inside_flow():
    event_received = threading.Event()

    @crewai_event_bus.on(LiteAgentExecutionStartedEvent)
-    def capture_agent(source, event):
-        nonlocal captured_agent
-        captured_agent = source
+    def capture_event(source, event):
+        nonlocal captured_event
+        captured_event = event
        event_received.set()

-    flow.kickoff()
+    result = flow.kickoff()

    assert event_received.wait(timeout=5), "Timeout waiting for agent execution event"
-    assert captured_agent.parent_flow is flow
+    assert captured_event is not None
+    assert captured_event.agent_info["role"] == "Test Agent"
+    assert result is not None


@pytest.mark.vcr()
@@ -373,16 +367,14 @@ def test_guardrail_is_called_using_string():

    @crewai_event_bus.on(LLMGuardrailStartedEvent)
    def capture_guardrail_started(source, event):
-        assert isinstance(source, LiteAgent)
-        assert source.original_agent == agent
+        assert isinstance(source, Agent)
        with condition:
            guardrail_events["started"].append(event)
            condition.notify()

    @crewai_event_bus.on(LLMGuardrailCompletedEvent)
    def capture_guardrail_completed(source, event):
-        assert isinstance(source, LiteAgent)
-        assert source.original_agent == agent
+        assert isinstance(source, Agent)
        with condition:
            guardrail_events["completed"].append(event)
            condition.notify()
@@ -683,3 +675,151 @@ def test_agent_kickoff_with_mcp_tools(mock_get_mcp_tools):

    # Verify MCP tools were retrieved
    mock_get_mcp_tools.assert_called_once_with("https://mcp.exa.ai/mcp?api_key=test_exa_key&profile=research")
+
+
+# ============================================================================
+# Tests for LiteAgent inside Flow (magic auto-async pattern)
+# ============================================================================
+
+from crewai.flow.flow import listen
+
+
+@pytest.mark.vcr()
+def test_lite_agent_inside_flow_sync():
+    """Test that LiteAgent.kickoff() works magically inside a Flow.
+
+    This tests the "magic auto-async" pattern where calling agent.kickoff()
+    from within a Flow automatically detects the event loop and returns a
+    coroutine that the Flow framework awaits. Users don't need to use async/await.
+    """
+    # Track execution
+    execution_log = []
+
+    class TestFlow(Flow):
+        @start()
+        def run_agent(self):
+            execution_log.append("flow_started")
+            agent = Agent(
+                role="Test Agent",
+                goal="Answer questions",
+                backstory="A helpful test assistant",
+                llm=LLM(model="gpt-4o-mini"),
+                verbose=False,
+            )
+            # Magic: just call kickoff() normally - it auto-detects Flow context
+            result = agent.kickoff(messages="What is 2+2? Reply with just the number.")
+            execution_log.append("agent_completed")
+            return result
+
+    flow = TestFlow()
+    result = flow.kickoff()
+
+    # Verify the flow executed successfully
+    assert "flow_started" in execution_log
+    assert "agent_completed" in execution_log
+    assert result is not None
+    assert isinstance(result, LiteAgentOutput)
+
+
+@pytest.mark.vcr()
+def test_lite_agent_inside_flow_with_tools():
+    """Test that LiteAgent with tools works correctly inside a Flow."""
+    class TestFlow(Flow):
+        @start()
+        def run_agent_with_tools(self):
+            agent = Agent(
+                role="Calculator Agent",
+                goal="Perform calculations",
+                backstory="A math expert",
+                llm=LLM(model="gpt-4o-mini"),
+                tools=[CalculatorTool()],
+                verbose=False,
+            )
+            result = agent.kickoff(messages="Calculate 10 * 5")
+            return result
+
+    flow = TestFlow()
+    result = flow.kickoff()
+
+    assert result is not None
+    assert isinstance(result, LiteAgentOutput)
+    assert result.raw is not None
+
+
+@pytest.mark.vcr()
+def test_multiple_agents_in_same_flow():
+    """Test that multiple LiteAgents can run sequentially in the same Flow."""
+    class MultiAgentFlow(Flow):
+        @start()
+        def first_step(self):
+            agent1 = Agent(
+                role="First Agent",
+                goal="Greet users",
+                backstory="A friendly greeter",
+                llm=LLM(model="gpt-4o-mini"),
+                verbose=False,
+            )
+            return agent1.kickoff(messages="Say hello")
+
+        @listen(first_step)
+        def second_step(self, first_result):
+            agent2 = Agent(
+                role="Second Agent",
+                goal="Say goodbye",
+                backstory="A polite farewell agent",
+                llm=LLM(model="gpt-4o-mini"),
+                verbose=False,
+            )
+            return agent2.kickoff(messages="Say goodbye")
+
+    flow = MultiAgentFlow()
+    result = flow.kickoff()
+
+    assert result is not None
+    assert isinstance(result, LiteAgentOutput)
+
+
+@pytest.mark.vcr()
+def test_lite_agent_kickoff_async_inside_flow():
+    """Test that Agent.kickoff_async() works correctly from async Flow methods."""
+    class AsyncAgentFlow(Flow):
+        @start()
+        async def async_agent_step(self):
+            agent = Agent(
+                role="Async Test Agent",
+                goal="Answer questions asynchronously",
+                backstory="An async helper",
+                llm=LLM(model="gpt-4o-mini"),
+                verbose=False,
+            )
+            result = await agent.kickoff_async(messages="What is 3+3?")
+            return result
+
+    flow = AsyncAgentFlow()
+    result = flow.kickoff()
+
+    assert result is not None
+    assert isinstance(result, LiteAgentOutput)
+
+
+@pytest.mark.vcr()
+def test_lite_agent_standalone_still_works():
+    """Test that LiteAgent.kickoff() still works normally outside of a Flow.
+
+    This verifies that the magic auto-async pattern doesn't break standalone usage
+    where there's no event loop running.
+    """
+    agent = Agent(
+        role="Standalone Agent",
+        goal="Answer questions",
+        backstory="A helpful assistant",
+        llm=LLM(model="gpt-4o-mini"),
+        verbose=False,
+    )
+
+    # This should work normally - no Flow, no event loop
+    result = agent.kickoff(messages="What is 5+5? Reply with just the number.")
+
+    assert result is not None
+    assert isinstance(result, LiteAgentOutput)
+    assert result.raw is not None
--- a/lib/crewai/tests/cassettes/agents/test_lite_agent_inside_flow_sync.yaml
+++ b/lib/crewai/tests/cassettes/agents/test_lite_agent_inside_flow_sync.yaml
@@ -0,0 +1,119 @@
+interactions:
+- request:
+    body: '{"messages":[{"role":"system","content":"You are Test Agent. A helpful
+      test assistant\nYour personal goal is: Answer questions\nTo give my best complete
+      final answer to the task respond using the exact following format:\n\nThought:
+      I now can give a great answer\nFinal Answer: Your final answer must be the great
+      and the most complete as possible, it must be outcome described.\n\nI MUST use
+      these formats, my job depends on it!"},{"role":"user","content":"\nCurrent Task:
+      What is 2+2? Reply with just the number.\n\nBegin! This is VERY important to
+      you, use the tools available and give your best Final Answer, your job depends
+      on it!\n\nThought:"}],"model":"gpt-4o-mini"}'
+    headers:
+      User-Agent:
+      - X-USER-AGENT-XXX
+      accept:
+      - application/json
+      accept-encoding:
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
+      connection:
+      - keep-alive
+      content-length:
+      - '673'
+      content-type:
+      - application/json
+      host:
+      - api.openai.com
+      x-stainless-arch:
+      - X-STAINLESS-ARCH-XXX
+      x-stainless-async:
+      - 'false'
+      x-stainless-lang:
+      - python
+      x-stainless-os:
+      - X-STAINLESS-OS-XXX
+      x-stainless-package-version:
+      - 1.83.0
+      x-stainless-read-timeout:
+      - X-STAINLESS-READ-TIMEOUT-XXX
+      x-stainless-retry-count:
+      - '0'
+      x-stainless-runtime:
+      - CPython
+      x-stainless-runtime-version:
+      - 3.13.3
+    method: POST
+    uri: https://api.openai.com/v1/chat/completions
+  response:
+    body:
+      string: "{\n  \"id\": \"chatcmpl-Cy7b0HjL79y39EkUcMLrRhPFe3XGj\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768444914,\n  \"model\": \"gpt-4o-mini-2024-07-18\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"I now can give a great answer  \\nFinal
+        Answer: 4\",\n        \"refusal\": null,\n        \"annotations\": []\n      },\n
+        \     \"logprobs\": null,\n      \"finish_reason\": \"stop\"\n    }\n  ],\n
+        \ \"usage\": {\n    \"prompt_tokens\": 136,\n    \"completion_tokens\": 13,\n
+        \   \"total_tokens\": 149,\n    \"prompt_tokens_details\": {\n      \"cached_tokens\":
+        0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_8bbc38b4db\"\n}\n"
+    headers:
+      CF-RAY:
+      - CF-RAY-XXX
+      Connection:
+      - keep-alive
+      Content-Type:
+      - application/json
+      Date:
+      - Thu, 15 Jan 2026 02:41:55 GMT
+      Server:
+      - cloudflare
+      Set-Cookie:
+      - SET-COOKIE-XXX
+      Strict-Transport-Security:
+      - STS-XXX
+      Transfer-Encoding:
+      - chunked
+      X-Content-Type-Options:
+      - X-CONTENT-TYPE-XXX
+      access-control-expose-headers:
+      - ACCESS-CONTROL-XXX
+      alt-svc:
+      - h3=":443"; ma=86400
+      cf-cache-status:
+      - DYNAMIC
+      content-length:
+      - '857'
+      openai-organization:
+      - OPENAI-ORG-XXX
+      openai-processing-ms:
+      - '341'
+      openai-project:
+      - OPENAI-PROJECT-XXX
+      openai-version:
+      - '2020-10-01'
+      x-envoy-upstream-service-time:
+      - '358'
+      x-openai-proxy-wasm:
+      - v0.1
+      x-ratelimit-limit-requests:
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
+      x-ratelimit-limit-tokens:
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
+      x-ratelimit-remaining-requests:
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
+      x-ratelimit-remaining-tokens:
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
+      x-ratelimit-reset-requests:
+      - X-RATELIMIT-RESET-REQUESTS-XXX
+      x-ratelimit-reset-tokens:
+      - X-RATELIMIT-RESET-TOKENS-XXX
+      x-request-id:
+      - X-REQUEST-ID-XXX
+    status:
+      code: 200
+      message: OK
+version: 1
--- a/lib/crewai/tests/cassettes/agents/test_lite_agent_inside_flow_with_tools.yaml
+++ b/lib/crewai/tests/cassettes/agents/test_lite_agent_inside_flow_with_tools.yaml
@@ -0,0 +1,255 @@
+interactions:
+- request:
+    body: '{"messages":[{"role":"system","content":"You are Calculator Agent. A math
+      expert\nYour personal goal is: Perform calculations\nYou ONLY have access to
+      the following tools, and should NEVER make up tools that are not listed here:\n\nTool
+      Name: calculate\nTool Arguments: {\n  \"properties\": {\n    \"expression\":
+      {\n      \"title\": \"Expression\",\n      \"type\": \"string\"\n    }\n  },\n  \"required\":
+      [\n    \"expression\"\n  ],\n  \"title\": \"CalculatorToolSchema\",\n  \"type\":
+      \"object\",\n  \"additionalProperties\": false\n}\nTool Description: Calculate
+      the result of a mathematical expression.\n\nIMPORTANT: Use the following format
+      in your response:\n\n```\nThought: you should always think about what to do\nAction:
+      the action to take, only one name of [calculate], just the name, exactly as
+      it''s written.\nAction Input: the input to the action, just a simple JSON object,
+      enclosed in curly braces, using \" to wrap keys and values.\nObservation: the
+      result of the action\n```\n\nOnce all necessary information is gathered, return
+      the following format:\n\n```\nThought: I now know the final answer\nFinal Answer:
+      the final answer to the original input question\n```"},{"role":"user","content":"\nCurrent
+      Task: Calculate 10 * 5\n\nBegin! This is VERY important to you, use the tools
+      available and give your best Final Answer, your job depends on it!\n\nThought:"}],"model":"gpt-4o-mini"}'
+    headers:
+      User-Agent:
+      - X-USER-AGENT-XXX
+      accept:
+      - application/json
+      accept-encoding:
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
+      connection:
+      - keep-alive
+      content-length:
+      - '1403'
+      content-type:
+      - application/json
+      host:
+      - api.openai.com
+      x-stainless-arch:
+      - X-STAINLESS-ARCH-XXX
+      x-stainless-async:
+      - 'false'
+      x-stainless-lang:
+      - python
+      x-stainless-os:
+      - X-STAINLESS-OS-XXX
+      x-stainless-package-version:
+      - 1.83.0
+      x-stainless-read-timeout:
+      - X-STAINLESS-READ-TIMEOUT-XXX
+      x-stainless-retry-count:
+      - '0'
+      x-stainless-runtime:
+      - CPython
+      x-stainless-runtime-version:
+      - 3.13.3
+    method: POST
+    uri: https://api.openai.com/v1/chat/completions
+  response:
+    body:
+      string: "{\n  \"id\": \"chatcmpl-Cy7avghVPSpszLmlbHpwDQlWDoD6O\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768444909,\n  \"model\": \"gpt-4o-mini-2024-07-18\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"Thought: I need to calculate the expression
+        10 * 5.\\nAction: calculate\\nAction Input: {\\\"expression\\\":\\\"10 * 5\\\"}\\nObservation:
+        50\",\n        \"refusal\": null,\n        \"annotations\": []\n      },\n
+        \     \"logprobs\": null,\n      \"finish_reason\": \"stop\"\n    }\n  ],\n
+        \ \"usage\": {\n    \"prompt_tokens\": 291,\n    \"completion_tokens\": 33,\n
+        \   \"total_tokens\": 324,\n    \"prompt_tokens_details\": {\n      \"cached_tokens\":
+        0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_c4585b5b9c\"\n}\n"
+    headers:
+      CF-RAY:
+      - CF-RAY-XXX
+      Connection:
+      - keep-alive
+      Content-Type:
+      - application/json
+      Date:
+      - Thu, 15 Jan 2026 02:41:49 GMT
+      Server:
+      - cloudflare
+      Set-Cookie:
+      - SET-COOKIE-XXX
+      Strict-Transport-Security:
+      - STS-XXX
+      Transfer-Encoding:
+      - chunked
+      X-Content-Type-Options:
+      - X-CONTENT-TYPE-XXX
+      access-control-expose-headers:
+      - ACCESS-CONTROL-XXX
+      alt-svc:
+      - h3=":443"; ma=86400
+      cf-cache-status:
+      - DYNAMIC
+      content-length:
+      - '939'
+      openai-organization:
+      - OPENAI-ORG-XXX
+      openai-processing-ms:
+      - '579'
+      openai-project:
+      - OPENAI-PROJECT-XXX
+      openai-version:
+      - '2020-10-01'
+      x-envoy-upstream-service-time:
+      - '598'
+      x-openai-proxy-wasm:
+      - v0.1
+      x-ratelimit-limit-requests:
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
+      x-ratelimit-limit-tokens:
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
+      x-ratelimit-remaining-requests:
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
+      x-ratelimit-remaining-tokens:
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
+      x-ratelimit-reset-requests:
+      - X-RATELIMIT-RESET-REQUESTS-XXX
+      x-ratelimit-reset-tokens:
+      - X-RATELIMIT-RESET-TOKENS-XXX
+      x-request-id:
+      - X-REQUEST-ID-XXX
+    status:
+      code: 200
+      message: OK
+- request:
+    body: '{"messages":[{"role":"system","content":"You are Calculator Agent. A math
+      expert\nYour personal goal is: Perform calculations\nYou ONLY have access to
+      the following tools, and should NEVER make up tools that are not listed here:\n\nTool
+      Name: calculate\nTool Arguments: {\n  \"properties\": {\n    \"expression\":
+      {\n      \"title\": \"Expression\",\n      \"type\": \"string\"\n    }\n  },\n  \"required\":
+      [\n    \"expression\"\n  ],\n  \"title\": \"CalculatorToolSchema\",\n  \"type\":
+      \"object\",\n  \"additionalProperties\": false\n}\nTool Description: Calculate
+      the result of a mathematical expression.\n\nIMPORTANT: Use the following format
+      in your response:\n\n```\nThought: you should always think about what to do\nAction:
+      the action to take, only one name of [calculate], just the name, exactly as
+      it''s written.\nAction Input: the input to the action, just a simple JSON object,
+      enclosed in curly braces, using \" to wrap keys and values.\nObservation: the
+      result of the action\n```\n\nOnce all necessary information is gathered, return
+      the following format:\n\n```\nThought: I now know the final answer\nFinal Answer:
+      the final answer to the original input question\n```"},{"role":"user","content":"\nCurrent
+      Task: Calculate 10 * 5\n\nBegin! This is VERY important to you, use the tools
+      available and give your best Final Answer, your job depends on it!\n\nThought:"},{"role":"assistant","content":"Thought:
+      I need to calculate the expression 10 * 5.\nAction: calculate\nAction Input:
+      {\"expression\":\"10 * 5\"}\nObservation: The result of 10 * 5 is 50"}],"model":"gpt-4o-mini"}'
+    headers:
+      User-Agent:
+      - X-USER-AGENT-XXX
+      accept:
+      - application/json
+      accept-encoding:
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
+      connection:
+      - keep-alive
+      content-length:
+      - '1591'
+      content-type:
+      - application/json
+      cookie:
+      - COOKIE-XXX
+      host:
+      - api.openai.com
+      x-stainless-arch:
+      - X-STAINLESS-ARCH-XXX
+      x-stainless-async:
+      - 'false'
+      x-stainless-lang:
+      - python
+      x-stainless-os:
+      - X-STAINLESS-OS-XXX
+      x-stainless-package-version:
+      - 1.83.0
+      x-stainless-read-timeout:
+      - X-STAINLESS-READ-TIMEOUT-XXX
+      x-stainless-retry-count:
+      - '0'
+      x-stainless-runtime:
+      - CPython
+      x-stainless-runtime-version:
+      - 3.13.3
+    method: POST
+    uri: https://api.openai.com/v1/chat/completions
+  response:
+    body:
+      string: "{\n  \"id\": \"chatcmpl-Cy7avDhDZCLvv8v2dh8ZQRrLdci6A\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768444909,\n  \"model\": \"gpt-4o-mini-2024-07-18\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"Thought: I now know the final answer.\\nFinal
+        Answer: 50\",\n        \"refusal\": null,\n        \"annotations\": []\n      },\n
+        \     \"logprobs\": null,\n      \"finish_reason\": \"stop\"\n    }\n  ],\n
+        \ \"usage\": {\n    \"prompt_tokens\": 337,\n    \"completion_tokens\": 14,\n
+        \   \"total_tokens\": 351,\n    \"prompt_tokens_details\": {\n      \"cached_tokens\":
+        0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_c4585b5b9c\"\n}\n"
+    headers:
+      CF-RAY:
+      - CF-RAY-XXX
+      Connection:
+      - keep-alive
+      Content-Type:
+      - application/json
+      Date:
+      - Thu, 15 Jan 2026 02:41:50 GMT
+      Server:
+      - cloudflare
+      Strict-Transport-Security:
+      - STS-XXX
+      Transfer-Encoding:
+      - chunked
+      X-Content-Type-Options:
+      - X-CONTENT-TYPE-XXX
+      access-control-expose-headers:
+      - ACCESS-CONTROL-XXX
+      alt-svc:
+      - h3=":443"; ma=86400
+      cf-cache-status:
+      - DYNAMIC
+      content-length:
+      - '864'
+      openai-organization:
+      - OPENAI-ORG-XXX
+      openai-processing-ms:
+      - '429'
+      openai-project:
+      - OPENAI-PROJECT-XXX
+      openai-version:
+      - '2020-10-01'
+      x-envoy-upstream-service-time:
+      - '457'
+      x-openai-proxy-wasm:
+      - v0.1
+      x-ratelimit-limit-requests:
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
+      x-ratelimit-limit-tokens:
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
+      x-ratelimit-remaining-requests:
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
+      x-ratelimit-remaining-tokens:
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
+      x-ratelimit-reset-requests:
+      - X-RATELIMIT-RESET-REQUESTS-XXX
+      x-ratelimit-reset-tokens:
+      - X-RATELIMIT-RESET-TOKENS-XXX
+      x-request-id:
+      - X-REQUEST-ID-XXX
+    status:
+      code: 200
+      message: OK
+version: 1
--- a/lib/crewai/tests/cassettes/agents/test_lite_agent_kickoff_async_inside_flow.yaml
+++ b/lib/crewai/tests/cassettes/agents/test_lite_agent_kickoff_async_inside_flow.yaml
@@ -0,0 +1,119 @@
+interactions:
+- request:
+    body: '{"messages":[{"role":"system","content":"You are Async Test Agent. An async
+      helper\nYour personal goal is: Answer questions asynchronously\nTo give my best
+      complete final answer to the task respond using the exact following format:\n\nThought:
+      I now can give a great answer\nFinal Answer: Your final answer must be the great
+      and the most complete as possible, it must be outcome described.\n\nI MUST use
+      these formats, my job depends on it!"},{"role":"user","content":"\nCurrent Task:
+      What is 3+3?\n\nBegin! This is VERY important to you, use the tools available
+      and give your best Final Answer, your job depends on it!\n\nThought:"}],"model":"gpt-4o-mini"}'
+    headers:
+      User-Agent:
+      - X-USER-AGENT-XXX
+      accept:
+      - application/json
+      accept-encoding:
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
+      connection:
+      - keep-alive
+      content-length:
+      - '657'
+      content-type:
+      - application/json
+      host:
+      - api.openai.com
+      x-stainless-arch:
+      - X-STAINLESS-ARCH-XXX
+      x-stainless-async:
+      - 'false'
+      x-stainless-lang:
+      - python
+      x-stainless-os:
+      - X-STAINLESS-OS-XXX
+      x-stainless-package-version:
+      - 1.83.0
+      x-stainless-read-timeout:
+      - X-STAINLESS-READ-TIMEOUT-XXX
+      x-stainless-retry-count:
+      - '0'
+      x-stainless-runtime:
+      - CPython
+      x-stainless-runtime-version:
+      - 3.13.3
+    method: POST
+    uri: https://api.openai.com/v1/chat/completions
+  response:
+    body:
+      string: "{\n  \"id\": \"chatcmpl-Cy7atOGxtc4y3oYNI62WiQ0Vogsdv\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768444907,\n  \"model\": \"gpt-4o-mini-2024-07-18\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"I now can give a great answer  \\nFinal
+        Answer: The sum of 3 + 3 is 6. Therefore, the outcome is that if you add three
+        and three together, you will arrive at the total of six.\",\n        \"refusal\":
+        null,\n        \"annotations\": []\n      },\n      \"logprobs\": null,\n
+        \     \"finish_reason\": \"stop\"\n    }\n  ],\n  \"usage\": {\n    \"prompt_tokens\":
+        131,\n    \"completion_tokens\": 46,\n    \"total_tokens\": 177,\n    \"prompt_tokens_details\":
+        {\n      \"cached_tokens\": 0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_29330a9688\"\n}\n"
+    headers:
+      CF-RAY:
+      - CF-RAY-XXX
+      Connection:
+      - keep-alive
+      Content-Type:
+      - application/json
+      Date:
+      - Thu, 15 Jan 2026 02:41:48 GMT
+      Server:
+      - cloudflare
+      Set-Cookie:
+      - SET-COOKIE-XXX
+      Strict-Transport-Security:
+      - STS-XXX
+      Transfer-Encoding:
+      - chunked
+      X-Content-Type-Options:
+      - X-CONTENT-TYPE-XXX
+      access-control-expose-headers:
+      - ACCESS-CONTROL-XXX
+      alt-svc:
+      - h3=":443"; ma=86400
+      cf-cache-status:
+      - DYNAMIC
+      content-length:
+      - '983'
+      openai-organization:
+      - OPENAI-ORG-XXX
+      openai-processing-ms:
+      - '944'
+      openai-project:
+      - OPENAI-PROJECT-XXX
+      openai-version:
+      - '2020-10-01'
+      x-envoy-upstream-service-time:
+      - '1192'
+      x-openai-proxy-wasm:
+      - v0.1
+      x-ratelimit-limit-requests:
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
+      x-ratelimit-limit-tokens:
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
+      x-ratelimit-remaining-requests:
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
+      x-ratelimit-remaining-tokens:
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
+      x-ratelimit-reset-requests:
+      - X-RATELIMIT-RESET-REQUESTS-XXX
+      x-ratelimit-reset-tokens:
+      - X-RATELIMIT-RESET-TOKENS-XXX
+      x-request-id:
+      - X-REQUEST-ID-XXX
+    status:
+      code: 200
+      message: OK
+version: 1
--- a/lib/crewai/tests/cassettes/agents/test_lite_agent_standalone_still_works.yaml
+++ b/lib/crewai/tests/cassettes/agents/test_lite_agent_standalone_still_works.yaml
@@ -0,0 +1,119 @@
+interactions:
+- request:
+    body: '{"messages":[{"role":"system","content":"You are Standalone Agent. A helpful
+      assistant\nYour personal goal is: Answer questions\nTo give my best complete
+      final answer to the task respond using the exact following format:\n\nThought:
+      I now can give a great answer\nFinal Answer: Your final answer must be the great
+      and the most complete as possible, it must be outcome described.\n\nI MUST use
+      these formats, my job depends on it!"},{"role":"user","content":"\nCurrent Task:
+      What is 5+5? Reply with just the number.\n\nBegin! This is VERY important to
+      you, use the tools available and give your best Final Answer, your job depends
+      on it!\n\nThought:"}],"model":"gpt-4o-mini"}'
+    headers:
+      User-Agent:
+      - X-USER-AGENT-XXX
+      accept:
+      - application/json
+      accept-encoding:
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
+      connection:
+      - keep-alive
+      content-length:
+      - '674'
+      content-type:
+      - application/json
+      host:
+      - api.openai.com
+      x-stainless-arch:
+      - X-STAINLESS-ARCH-XXX
+      x-stainless-async:
+      - 'false'
+      x-stainless-lang:
+      - python
+      x-stainless-os:
+      - X-STAINLESS-OS-XXX
+      x-stainless-package-version:
+      - 1.83.0
+      x-stainless-read-timeout:
+      - X-STAINLESS-READ-TIMEOUT-XXX
+      x-stainless-retry-count:
+      - '0'
+      x-stainless-runtime:
+      - CPython
+      x-stainless-runtime-version:
+      - 3.13.3
+    method: POST
+    uri: https://api.openai.com/v1/chat/completions
+  response:
+    body:
+      string: "{\n  \"id\": \"chatcmpl-Cy7azhPwUHQ0p5tdhxSAmLPoE8UgC\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768444913,\n  \"model\": \"gpt-4o-mini-2024-07-18\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"I now can give a great answer  \\nFinal
+        Answer: 10\",\n        \"refusal\": null,\n        \"annotations\": []\n      },\n
+        \     \"logprobs\": null,\n      \"finish_reason\": \"stop\"\n    }\n  ],\n
+        \ \"usage\": {\n    \"prompt_tokens\": 136,\n    \"completion_tokens\": 13,\n
+        \   \"total_tokens\": 149,\n    \"prompt_tokens_details\": {\n      \"cached_tokens\":
+        0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_29330a9688\"\n}\n"
+    headers:
+      CF-RAY:
+      - CF-RAY-XXX
+      Connection:
+      - keep-alive
+      Content-Type:
+      - application/json
+      Date:
+      - Thu, 15 Jan 2026 02:41:54 GMT
+      Server:
+      - cloudflare
+      Set-Cookie:
+      - SET-COOKIE-XXX
+      Strict-Transport-Security:
+      - STS-XXX
+      Transfer-Encoding:
+      - chunked
+      X-Content-Type-Options:
+      - X-CONTENT-TYPE-XXX
+      access-control-expose-headers:
+      - ACCESS-CONTROL-XXX
+      alt-svc:
+      - h3=":443"; ma=86400
+      cf-cache-status:
+      - DYNAMIC
+      content-length:
+      - '858'
+      openai-organization:
+      - OPENAI-ORG-XXX
+      openai-processing-ms:
+      - '455'
+      openai-project:
+      - OPENAI-PROJECT-XXX
+      openai-version:
+      - '2020-10-01'
+      x-envoy-upstream-service-time:
+      - '583'
+      x-openai-proxy-wasm:
+      - v0.1
+      x-ratelimit-limit-requests:
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
+      x-ratelimit-limit-tokens:
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
+      x-ratelimit-remaining-requests:
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
+      x-ratelimit-remaining-tokens:
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
+      x-ratelimit-reset-requests:
+      - X-RATELIMIT-RESET-REQUESTS-XXX
+      x-ratelimit-reset-tokens:
+      - X-RATELIMIT-RESET-TOKENS-XXX
+      x-request-id:
+      - X-REQUEST-ID-XXX
+    status:
+      code: 200
+      message: OK
+version: 1
--- a/lib/crewai/tests/cassettes/agents/test_multiple_agents_in_same_flow.yaml
+++ b/lib/crewai/tests/cassettes/agents/test_multiple_agents_in_same_flow.yaml
@@ -0,0 +1,239 @@
+interactions:
+- request:
+    body: '{"messages":[{"role":"system","content":"You are First Agent. A friendly
+      greeter\nYour personal goal is: Greet users\nTo give my best complete final
+      answer to the task respond using the exact following format:\n\nThought: I now
+      can give a great answer\nFinal Answer: Your final answer must be the great and
+      the most complete as possible, it must be outcome described.\n\nI MUST use these
+      formats, my job depends on it!"},{"role":"user","content":"\nCurrent Task: Say
+      hello\n\nBegin! This is VERY important to you, use the tools available and give
+      your best Final Answer, your job depends on it!\n\nThought:"}],"model":"gpt-4o-mini"}'
+    headers:
+      User-Agent:
+      - X-USER-AGENT-XXX
+      accept:
+      - application/json
+      accept-encoding:
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
+      connection:
+      - keep-alive
+      content-length:
+      - '632'
+      content-type:
+      - application/json
+      host:
+      - api.openai.com
+      x-stainless-arch:
+      - X-STAINLESS-ARCH-XXX
+      x-stainless-async:
+      - 'false'
+      x-stainless-lang:
+      - python
+      x-stainless-os:
+      - X-STAINLESS-OS-XXX
+      x-stainless-package-version:
+      - 1.83.0
+      x-stainless-read-timeout:
+      - X-STAINLESS-READ-TIMEOUT-XXX
+      x-stainless-retry-count:
+      - '0'
+      x-stainless-runtime:
+      - CPython
+      x-stainless-runtime-version:
+      - 3.13.3
+    method: POST
+    uri: https://api.openai.com/v1/chat/completions
+  response:
+    body:
+      string: "{\n  \"id\": \"chatcmpl-CyRKzgODZ9yn3F9OkaXsscLk2Ln3N\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768520801,\n  \"model\": \"gpt-4o-mini-2024-07-18\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"I now can give a great answer  \\nFinal
+        Answer: Hello! Welcome! I'm so glad to see you here. If you need any assistance
+        or have any questions, feel free to ask. Have a wonderful day!\",\n        \"refusal\":
+        null,\n        \"annotations\": []\n      },\n      \"logprobs\": null,\n
+        \     \"finish_reason\": \"stop\"\n    }\n  ],\n  \"usage\": {\n    \"prompt_tokens\":
+        127,\n    \"completion_tokens\": 43,\n    \"total_tokens\": 170,\n    \"prompt_tokens_details\":
+        {\n      \"cached_tokens\": 0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_c4585b5b9c\"\n}\n"
+    headers:
+      CF-RAY:
+      - CF-RAY-XXX
+      Connection:
+      - keep-alive
+      Content-Type:
+      - application/json
+      Date:
+      - Thu, 15 Jan 2026 23:46:42 GMT
+      Server:
+      - cloudflare
+      Set-Cookie:
+      - SET-COOKIE-XXX
+      Strict-Transport-Security:
+      - STS-XXX
+      Transfer-Encoding:
+      - chunked
+      X-Content-Type-Options:
+      - X-CONTENT-TYPE-XXX
+      access-control-expose-headers:
+      - ACCESS-CONTROL-XXX
+      alt-svc:
+      - h3=":443"; ma=86400
+      cf-cache-status:
+      - DYNAMIC
+      content-length:
+      - '990'
+      openai-organization:
+      - OPENAI-ORG-XXX
+      openai-processing-ms:
+      - '880'
+      openai-project:
+      - OPENAI-PROJECT-XXX
+      openai-version:
+      - '2020-10-01'
+      x-envoy-upstream-service-time:
+      - '1160'
+      x-openai-proxy-wasm:
+      - v0.1
+      x-ratelimit-limit-requests:
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
+      x-ratelimit-limit-tokens:
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
+      x-ratelimit-remaining-requests:
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
+      x-ratelimit-remaining-tokens:
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
+      x-ratelimit-reset-requests:
+      - X-RATELIMIT-RESET-REQUESTS-XXX
+      x-ratelimit-reset-tokens:
+      - X-RATELIMIT-RESET-TOKENS-XXX
+      x-request-id:
+      - X-REQUEST-ID-XXX
+    status:
+      code: 200
+      message: OK
+- request:
+    body: '{"messages":[{"role":"system","content":"You are Second Agent. A polite
+      farewell agent\nYour personal goal is: Say goodbye\nTo give my best complete
+      final answer to the task respond using the exact following format:\n\nThought:
+      I now can give a great answer\nFinal Answer: Your final answer must be the great
+      and the most complete as possible, it must be outcome described.\n\nI MUST use
+      these formats, my job depends on it!"},{"role":"user","content":"\nCurrent Task:
+      Say goodbye\n\nBegin! This is VERY important to you, use the tools available
+      and give your best Final Answer, your job depends on it!\n\nThought:"}],"model":"gpt-4o-mini"}'
+    headers:
+      User-Agent:
+      - X-USER-AGENT-XXX
+      accept:
+      - application/json
+      accept-encoding:
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
+      connection:
+      - keep-alive
+      content-length:
+      - '640'
+      content-type:
+      - application/json
+      host:
+      - api.openai.com
+      x-stainless-arch:
+      - X-STAINLESS-ARCH-XXX
+      x-stainless-async:
+      - 'false'
+      x-stainless-lang:
+      - python
+      x-stainless-os:
+      - X-STAINLESS-OS-XXX
+      x-stainless-package-version:
+      - 1.83.0
+      x-stainless-read-timeout:
+      - X-STAINLESS-READ-TIMEOUT-XXX
+      x-stainless-retry-count:
+      - '0'
+      x-stainless-runtime:
+      - CPython
+      x-stainless-runtime-version:
+      - 3.13.3
+    method: POST
+    uri: https://api.openai.com/v1/chat/completions
+  response:
+    body:
+      string: "{\n  \"id\": \"chatcmpl-CyRL1Ua2PkK5xXPp3KeF0AnGAk3JP\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768520803,\n  \"model\": \"gpt-4o-mini-2024-07-18\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"I now can give a great answer  \\nFinal
+        Answer: As we reach the end of our conversation, I want to express my gratitude
+        for the time we've shared. It's been a pleasure assisting you, and I hope
+        you found our interaction helpful and enjoyable. Remember, whenever you need
+        assistance, I'm just a message away. Wishing you all the best in your future
+        endeavors. Goodbye and take care!\",\n        \"refusal\": null,\n        \"annotations\":
+        []\n      },\n      \"logprobs\": null,\n      \"finish_reason\": \"stop\"\n
+        \   }\n  ],\n  \"usage\": {\n    \"prompt_tokens\": 126,\n    \"completion_tokens\":
+        79,\n    \"total_tokens\": 205,\n    \"prompt_tokens_details\": {\n      \"cached_tokens\":
+        0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_29330a9688\"\n}\n"
+    headers:
+      CF-RAY:
+      - CF-RAY-XXX
+      Connection:
+      - keep-alive
+      Content-Type:
+      - application/json
+      Date:
+      - Thu, 15 Jan 2026 23:46:44 GMT
+      Server:
+      - cloudflare
+      Set-Cookie:
+      - SET-COOKIE-XXX
+      Strict-Transport-Security:
+      - STS-XXX
+      Transfer-Encoding:
+      - chunked
+      X-Content-Type-Options:
+      - X-CONTENT-TYPE-XXX
+      access-control-expose-headers:
+      - ACCESS-CONTROL-XXX
+      alt-svc:
+      - h3=":443"; ma=86400
+      cf-cache-status:
+      - DYNAMIC
+      content-length:
+      - '1189'
+      openai-organization:
+      - OPENAI-ORG-XXX
+      openai-processing-ms:
+      - '1363'
+      openai-project:
+      - OPENAI-PROJECT-XXX
+      openai-version:
+      - '2020-10-01'
+      x-envoy-upstream-service-time:
+      - '1605'
+      x-openai-proxy-wasm:
+      - v0.1
+      x-ratelimit-limit-requests:
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
+      x-ratelimit-limit-tokens:
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
+      x-ratelimit-remaining-requests:
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
+      x-ratelimit-remaining-tokens:
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
+      x-ratelimit-reset-requests:
+      - X-RATELIMIT-RESET-REQUESTS-XXX
+      x-ratelimit-reset-tokens:
+      - X-RATELIMIT-RESET-TOKENS-XXX
+      x-request-id:
+      - X-REQUEST-ID-XXX
+    status:
+      code: 200
+      message: OK
+version: 1
--- a/lib/crewai/tests/cassettes/test_multiple_before_after_kickoff.yaml
+++ b/lib/crewai/tests/cassettes/test_multiple_before_after_kickoff.yaml
--- a/lib/crewai/tests/cassettes/test_task_guardrail_process_output.yaml
+++ b/lib/crewai/tests/cassettes/test_task_guardrail_process_output.yaml
@@ -1,456 +1,528 @@
 interactions:
 - request:
-    body: '{"trace_id": "00000000-0000-0000-0000-000000000000", "execution_type": "crew", "user_identifier": null, "execution_context": {"crew_fingerprint": null, "crew_name": "Unknown Crew", "flow_name": null, "crewai_version": "1.3.0", "privacy_level": "standard"}, "execution_metadata": {"expected_duration_estimate": 300, "agent_count": 0, "task_count": 0, "flow_method_count": 0, "execution_started_at": "2025-11-05T22:19:56.074812+00:00"}}'
+    body: "{\"messages\":[{\"role\":\"system\",\"content\":\"You are Guardrail Agent.
+      You are a expert at validating the output of a task. By providing effective
+      feedback if the output is not valid.\\nYour personal goal is: Validate the output
+      of the task\\nTo give my best complete final answer to the task respond using
+      the exact following format:\\n\\nThought: I now can give a great answer\\nFinal
+      Answer: Your final answer must be the great and the most complete as possible,
+      it must be outcome described.\\n\\nI MUST use these formats, my job depends
+      on it!\"},{\"role\":\"user\",\"content\":\"\\nCurrent Task: \\n        Ensure
+      the following task result complies with the given guardrail.\\n\\n        Task
+      result:\\n        \\n        Lorem Ipsum is simply dummy text of the printing
+      and typesetting industry. Lorem Ipsum has been the industry's standard dummy
+      text ever\\n        \\n\\n        Guardrail:\\n        Ensure the result has
+      less than 10 words\\n\\n        Your task:\\n        - Confirm if the Task result
+      complies with the guardrail.\\n        - If not, provide clear feedback explaining
+      what is wrong (e.g., by how much it violates the rule, or what specific part
+      fails).\\n        - Focus only on identifying issues \u2014 do not propose corrections.\\n
+      \       - If the Task result complies with the guardrail, saying that is valid\\n
+      \       \\n\\nBegin! This is VERY important to you, use the tools available
+      and give your best Final Answer, your job depends on it!\\n\\nThought:\"}],\"model\":\"gpt-4o\"}"
    headers:
-      Accept:
-      - '*/*'
-      Accept-Encoding:
-      - gzip, deflate, zstd
-      Connection:
-      - keep-alive
-      Content-Length:
-      - '434'
-      Content-Type:
-      - application/json
      User-Agent:
-      - CrewAI-CLI/1.3.0
-      X-Crewai-Version:
-      - 1.3.0
-    method: POST
-    uri: https://app.crewai.com/crewai_plus/api/v1/tracing/batches
-  response:
-    body:
-      string: '{"error":"bad_credentials","message":"Bad credentials"}'
-    headers:
-      Connection:
-      - keep-alive
-      Content-Length:
-      - '55'
-      Content-Type:
-      - application/json; charset=utf-8
-      Date:
-      - Wed, 05 Nov 2025 22:19:56 GMT
-      cache-control:
-      - no-store
-      content-security-policy:
-      - 'default-src ''self'' *.app.crewai.com app.crewai.com; script-src ''self'' ''unsafe-inline'' *.app.crewai.com app.crewai.com https://cdn.jsdelivr.net/npm/apexcharts https://www.gstatic.com https://run.pstmn.io https://apis.google.com https://apis.google.com/js/api.js https://accounts.google.com https://accounts.google.com/gsi/client https://cdnjs.cloudflare.com/ajax/libs/normalize/8.0.1/normalize.min.css.map https://*.google.com https://docs.google.com https://slides.google.com https://js.hs-scripts.com https://js.sentry-cdn.com https://browser.sentry-cdn.com https://www.googletagmanager.com https://js-na1.hs-scripts.com https://js.hubspot.com http://js-na1.hs-scripts.com https://bat.bing.com https://cdn.amplitude.com https://cdn.segment.com https://d1d3n03t5zntha.cloudfront.net/ https://descriptusercontent.com https://edge.fullstory.com https://googleads.g.doubleclick.net https://js.hs-analytics.net https://js.hs-banner.com https://js.hsadspixel.net https://js.hscollectedforms.net
-        https://js.usemessages.com https://snap.licdn.com https://static.cloudflareinsights.com https://static.reo.dev https://www.google-analytics.com https://share.descript.com/; style-src ''self'' ''unsafe-inline'' *.app.crewai.com app.crewai.com https://cdn.jsdelivr.net/npm/apexcharts; img-src ''self'' data: *.app.crewai.com app.crewai.com https://zeus.tools.crewai.com https://dashboard.tools.crewai.com https://cdn.jsdelivr.net https://forms.hsforms.com https://track.hubspot.com https://px.ads.linkedin.com https://px4.ads.linkedin.com https://www.google.com https://www.google.com.br; font-src ''self'' data: *.app.crewai.com app.crewai.com; connect-src ''self'' *.app.crewai.com app.crewai.com https://zeus.tools.crewai.com https://connect.useparagon.com/ https://zeus.useparagon.com/* https://*.useparagon.com/* https://run.pstmn.io https://connect.tools.crewai.com/ https://*.sentry.io https://www.google-analytics.com https://edge.fullstory.com https://rs.fullstory.com https://api.hubspot.com
-        https://forms.hscollectedforms.net https://api.hubapi.com https://px.ads.linkedin.com https://px4.ads.linkedin.com https://google.com/pagead/form-data/16713662509 https://google.com/ccm/form-data/16713662509 https://www.google.com/ccm/collect https://worker-actionkit.tools.crewai.com https://api.reo.dev; frame-src ''self'' *.app.crewai.com app.crewai.com https://connect.useparagon.com/ https://zeus.tools.crewai.com https://zeus.useparagon.com/* https://connect.tools.crewai.com/ https://docs.google.com https://drive.google.com https://slides.google.com https://accounts.google.com https://*.google.com https://app.hubspot.com/ https://td.doubleclick.net https://www.googletagmanager.com/ https://www.youtube.com https://share.descript.com'
-      expires:
-      - '0'
-      permissions-policy:
-      - camera=(), microphone=(self), geolocation=()
-      pragma:
-      - no-cache
-      referrer-policy:
-      - strict-origin-when-cross-origin
-      strict-transport-security:
-      - max-age=63072000; includeSubDomains
-      vary:
-      - Accept
-      x-content-type-options:
-      - nosniff
-      x-frame-options:
-      - SAMEORIGIN
-      x-permitted-cross-domain-policies:
-      - none
-      x-request-id:
-      - 230c6cb5-92c7-448d-8c94-e5548a9f4259
-      x-runtime:
-      - '0.073220'
-      x-xss-protection:
-      - 1; mode=block
-    status:
-      code: 401
-      message: Unauthorized
- request:
-    body: '{"messages":[{"role":"system","content":"You are Guardrail Agent. You are a expert at validating the output of a task. By providing effective feedback if the output is not valid.\nYour personal goal is: Validate the output of the task\n\nTo give my best complete final answer to the task respond using the exact following format:\n\nThought: I now can give a great answer\nFinal Answer: Your final answer must be the great and the most complete as possible, it must be outcome described.\n\nI MUST use these formats, my job depends on it!Ensure your final answer strictly adheres to the following OpenAPI schema: {\n  \"type\": \"json_schema\",\n  \"json_schema\": {\n    \"name\": \"LLMGuardrailResult\",\n    \"strict\": true,\n    \"schema\": {\n      \"properties\": {\n        \"valid\": {\n          \"description\": \"Whether the task output complies with the guardrail\",\n          \"title\": \"Valid\",\n          \"type\": \"boolean\"\n        },\n        \"feedback\": {\n          \"anyOf\":
-      [\n            {\n              \"type\": \"string\"\n            },\n            {\n              \"type\": \"null\"\n            }\n          ],\n          \"default\": null,\n          \"description\": \"A feedback about the task output if it is not valid\",\n          \"title\": \"Feedback\"\n        }\n      },\n      \"required\": [\n        \"valid\",\n        \"feedback\"\n      ],\n      \"title\": \"LLMGuardrailResult\",\n      \"type\": \"object\",\n      \"additionalProperties\": false\n    }\n  }\n}\n\nDo not include the OpenAPI schema in the final output. Ensure the final output does not include any code block markers like ```json or ```python."},{"role":"user","content":"\n        Ensure the following task result complies with the given guardrail.\n\n        Task result:\n        \n        Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry''s standard dummy text ever\n        \n\n        Guardrail:\n        Ensure
-      the result has less than 10 words\n\n        Your task:\n        - Confirm if the Task result complies with the guardrail.\n        - If not, provide clear feedback explaining what is wrong (e.g., by how much it violates the rule, or what specific part fails).\n        - Focus only on identifying issues — do not propose corrections.\n        - If the Task result complies with the guardrail, saying that is valid\n        "}],"model":"gpt-4o"}'
-    headers:
+      - X-USER-AGENT-XXX
      accept:
      - application/json
      accept-encoding:
-      - gzip, deflate, zstd
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
      connection:
      - keep-alive
      content-length:
-      - '2452'
+      - '1467'
      content-type:
      - application/json
      host:
      - api.openai.com
-      user-agent:
-      - OpenAI/Python 1.109.1
      x-stainless-arch:
-      - arm64
+      - X-STAINLESS-ARCH-XXX
      x-stainless-async:
      - 'false'
      x-stainless-lang:
      - python
      x-stainless-os:
-      - MacOS
+      - X-STAINLESS-OS-XXX
      x-stainless-package-version:
-      - 1.109.1
+      - 1.83.0
      x-stainless-read-timeout:
-      - '600'
+      - X-STAINLESS-READ-TIMEOUT-XXX
      x-stainless-retry-count:
      - '0'
      x-stainless-runtime:
      - CPython
      x-stainless-runtime-version:
-      - 3.12.9
+      - 3.13.3
    method: POST
    uri: https://api.openai.com/v1/chat/completions
  response:
    body:
-      string: "{\n  \"id\": \"chatcmpl-CYg96Riy2RJRxnBHvoROukymP9wvs\",\n  \"object\": \"chat.completion\",\n  \"created\": 1762381196,\n  \"model\": \"gpt-4o-2024-08-06\",\n  \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\": \"assistant\",\n        \"content\": \"Thought: I need to check if the task result meets the requirement of having less than 10 words.\\n\\nFinal Answer: {\\n  \\\"valid\\\": false,\\n  \\\"feedback\\\": \\\"The task result contains more than 10 words, violating the guardrail. The text provided contains about 21 words.\\\"\\n}\",\n        \"refusal\": null,\n        \"annotations\": []\n      },\n      \"logprobs\": null,\n      \"finish_reason\": \"stop\"\n    }\n  ],\n  \"usage\": {\n    \"prompt_tokens\": 489,\n    \"completion_tokens\": 61,\n    \"total_tokens\": 550,\n    \"prompt_tokens_details\": {\n      \"cached_tokens\": 0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\": {\n      \"reasoning_tokens\"\
-        : 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\": 0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\": \"default\",\n  \"system_fingerprint\": \"fp_cbf1785567\"\n}\n"
+      string: "{\n  \"id\": \"chatcmpl-Cy7yHRYTZi8yzRbcODnKr92keLKCb\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768446357,\n  \"model\": \"gpt-4o-2024-08-06\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"The task result provided has more than
+        10 words. I will count the words to verify this.\\n\\nThe task result is the
+        following text:\\n\\\"Lorem Ipsum is simply dummy text of the printing and
+        typesetting industry. Lorem Ipsum has been the industry's standard dummy text
+        ever\\\"\\n\\nCounting the words:\\n\\n1. Lorem \\n2. Ipsum \\n3. is \\n4.
+        simply \\n5. dummy \\n6. text \\n7. of \\n8. the \\n9. printing \\n10. and
+        \\n11. typesetting \\n12. industry. \\n13. Lorem \\n14. Ipsum \\n15. has \\n16.
+        been \\n17. the \\n18. industry's \\n19. standard \\n20. dummy \\n21. text
+        \\n22. ever\\n\\nThe total word count is 22.\\n\\nThought: I now can give
+        a great answer\\nFinal Answer: The task result does not comply with the guardrail.
+        It contains 22 words, which exceeds the limit of 10 words.\",\n        \"refusal\":
+        null,\n        \"annotations\": []\n      },\n      \"logprobs\": null,\n
+        \     \"finish_reason\": \"stop\"\n    }\n  ],\n  \"usage\": {\n    \"prompt_tokens\":
+        285,\n    \"completion_tokens\": 195,\n    \"total_tokens\": 480,\n    \"prompt_tokens_details\":
+        {\n      \"cached_tokens\": 0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_deacdd5f6f\"\n}\n"
    headers:
      CF-RAY:
-      - REDACTED-RAY
+      - CF-RAY-XXX
      Connection:
      - keep-alive
      Content-Type:
      - application/json
      Date:
-      - Wed, 05 Nov 2025 22:19:58 GMT
+      - Thu, 15 Jan 2026 03:05:59 GMT
      Server:
      - cloudflare
      Set-Cookie:
-      - __cf_bm=REDACTED; path=/; expires=Wed, 05-Nov-25 22:49:58 GMT; domain=.api.openai.com; HttpOnly; Secure; SameSite=None
-      - _cfuvid=REDACTED; path=/; domain=.api.openai.com; HttpOnly; Secure; SameSite=None
+      - SET-COOKIE-XXX
      Strict-Transport-Security:
-      - max-age=31536000; includeSubDomains; preload
+      - STS-XXX
      Transfer-Encoding:
      - chunked
      X-Content-Type-Options:
-      - nosniff
+      - X-CONTENT-TYPE-XXX
      access-control-expose-headers:
-      - X-Request-ID
+      - ACCESS-CONTROL-XXX
      alt-svc:
      - h3=":443"; ma=86400
      cf-cache-status:
      - DYNAMIC
+      content-length:
+      - '1557'
      openai-organization:
-      - user-hortuttj2f3qtmxyik2zxf4q
+      - OPENAI-ORG-XXX
      openai-processing-ms:
-      - '2201'
+      - '2130'
      openai-project:
-      - proj_fL4UBWR1CMpAAdgzaSKqsVvA
+      - OPENAI-PROJECT-XXX
      openai-version:
      - '2020-10-01'
      x-envoy-upstream-service-time:
-      - '2401'
+      - '2147'
      x-openai-proxy-wasm:
      - v0.1
      x-ratelimit-limit-requests:
-      - '500'
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
      x-ratelimit-limit-tokens:
-      - '30000'
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
      x-ratelimit-remaining-requests:
-      - '499'
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
      x-ratelimit-remaining-tokens:
-      - '29439'
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
      x-ratelimit-reset-requests:
-      - 120ms
+      - X-RATELIMIT-RESET-REQUESTS-XXX
      x-ratelimit-reset-tokens:
-      - 1.122s
+      - X-RATELIMIT-RESET-TOKENS-XXX
      x-request-id:
-      - req_REDACTED
+      - X-REQUEST-ID-XXX
    status:
      code: 200
      message: OK
 - request:
-    body: '{"messages":[{"role":"system","content":"Ensure your final answer strictly adheres to the following OpenAPI schema: {\n  \"type\": \"json_schema\",\n  \"json_schema\": {\n    \"name\": \"LLMGuardrailResult\",\n    \"strict\": true,\n    \"schema\": {\n      \"properties\": {\n        \"valid\": {\n          \"description\": \"Whether the task output complies with the guardrail\",\n          \"title\": \"Valid\",\n          \"type\": \"boolean\"\n        },\n        \"feedback\": {\n          \"anyOf\": [\n            {\n              \"type\": \"string\"\n            },\n            {\n              \"type\": \"null\"\n            }\n          ],\n          \"default\": null,\n          \"description\": \"A feedback about the task output if it is not valid\",\n          \"title\": \"Feedback\"\n        }\n      },\n      \"required\": [\n        \"valid\",\n        \"feedback\"\n      ],\n      \"title\": \"LLMGuardrailResult\",\n      \"type\": \"object\",\n      \"additionalProperties\":
-      false\n    }\n  }\n}\n\nDo not include the OpenAPI schema in the final output. Ensure the final output does not include any code block markers like ```json or ```python."},{"role":"user","content":"{\n  \"valid\": false,\n  \"feedback\": \"The task result contains more than 10 words, violating the guardrail. The text provided contains about 21 words.\"\n}"}],"model":"gpt-4o","response_format":{"type":"json_schema","json_schema":{"schema":{"properties":{"valid":{"description":"Whether the task output complies with the guardrail","title":"Valid","type":"boolean"},"feedback":{"anyOf":[{"type":"string"},{"type":"null"}],"description":"A feedback about the task output if it is not valid","title":"Feedback"}},"required":["valid","feedback"],"title":"LLMGuardrailResult","type":"object","additionalProperties":false},"name":"LLMGuardrailResult","strict":true}},"stream":false}'
+    body: '{"messages":[{"role":"system","content":"Ensure your final answer strictly
+      adheres to the following OpenAPI schema: {\n  \"type\": \"json_schema\",\n  \"json_schema\":
+      {\n    \"name\": \"LLMGuardrailResult\",\n    \"strict\": true,\n    \"schema\":
+      {\n      \"properties\": {\n        \"valid\": {\n          \"description\":
+      \"Whether the task output complies with the guardrail\",\n          \"title\":
+      \"Valid\",\n          \"type\": \"boolean\"\n        },\n        \"feedback\":
+      {\n          \"anyOf\": [\n            {\n              \"type\": \"string\"\n            },\n            {\n              \"type\":
+      \"null\"\n            }\n          ],\n          \"default\": null,\n          \"description\":
+      \"A feedback about the task output if it is not valid\",\n          \"title\":
+      \"Feedback\"\n        }\n      },\n      \"required\": [\n        \"valid\",\n        \"feedback\"\n      ],\n      \"title\":
+      \"LLMGuardrailResult\",\n      \"type\": \"object\",\n      \"additionalProperties\":
+      false\n    }\n  }\n}\n\nDo not include the OpenAPI schema in the final output.
+      Ensure the final output does not include any code block markers like ```json
+      or ```python."},{"role":"user","content":"The task result does not comply with
+      the guardrail. It contains 22 words, which exceeds the limit of 10 words."}],"model":"gpt-4o","response_format":{"type":"json_schema","json_schema":{"schema":{"properties":{"valid":{"description":"Whether
+      the task output complies with the guardrail","title":"Valid","type":"boolean"},"feedback":{"anyOf":[{"type":"string"},{"type":"null"}],"description":"A
+      feedback about the task output if it is not valid","title":"Feedback"}},"required":["valid","feedback"],"title":"LLMGuardrailResult","type":"object","additionalProperties":false},"name":"LLMGuardrailResult","strict":true}},"stream":false}'
    headers:
+      User-Agent:
+      - X-USER-AGENT-XXX
      accept:
      - application/json
      accept-encoding:
-      - gzip, deflate, zstd
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
      connection:
      - keep-alive
      content-length:
-      - '1884'
+      - '1835'
      content-type:
      - application/json
      cookie:
-      - __cf_bm=REDACTED; _cfuvid=REDACTED
+      - COOKIE-XXX
      host:
      - api.openai.com
-      user-agent:
-      - OpenAI/Python 1.109.1
      x-stainless-arch:
-      - arm64
+      - X-STAINLESS-ARCH-XXX
      x-stainless-async:
      - 'false'
      x-stainless-helper-method:
-      - chat.completions.parse
+      - beta.chat.completions.parse
      x-stainless-lang:
      - python
      x-stainless-os:
-      - MacOS
+      - X-STAINLESS-OS-XXX
      x-stainless-package-version:
-      - 1.109.1
+      - 1.83.0
      x-stainless-read-timeout:
-      - '600'
+      - X-STAINLESS-READ-TIMEOUT-XXX
      x-stainless-retry-count:
      - '0'
      x-stainless-runtime:
      - CPython
      x-stainless-runtime-version:
-      - 3.12.9
+      - 3.13.3
    method: POST
    uri: https://api.openai.com/v1/chat/completions
  response:
    body:
-      string: "{\n  \"id\": \"chatcmpl-CYg98QlZ8NTrQ69676MpXXyCoZJT8\",\n  \"object\": \"chat.completion\",\n  \"created\": 1762381198,\n  \"model\": \"gpt-4o-2024-08-06\",\n  \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\": \"assistant\",\n        \"content\": \"{\\\"valid\\\":false,\\\"feedback\\\":\\\"The task result contains more than 10 words, violating the guardrail. The text provided contains about 21 words.\\\"}\",\n        \"refusal\": null,\n        \"annotations\": []\n      },\n      \"logprobs\": null,\n      \"finish_reason\": \"stop\"\n    }\n  ],\n  \"usage\": {\n    \"prompt_tokens\": 374,\n    \"completion_tokens\": 32,\n    \"total_tokens\": 406,\n    \"prompt_tokens_details\": {\n      \"cached_tokens\": 0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\": {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\": 0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n\
-        \  \"service_tier\": \"default\",\n  \"system_fingerprint\": \"fp_cbf1785567\"\n}\n"
+      string: "{\n  \"id\": \"chatcmpl-Cy7yJiPCk4fXuogyT5e8XeGRLCSf8\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768446359,\n  \"model\": \"gpt-4o-2024-08-06\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"{\\\"valid\\\":false,\\\"feedback\\\":\\\"The
+        task output exceeds the word limit of 10 words by containing 22 words.\\\"}\",\n
+        \       \"refusal\": null,\n        \"annotations\": []\n      },\n      \"logprobs\":
+        null,\n      \"finish_reason\": \"stop\"\n    }\n  ],\n  \"usage\": {\n    \"prompt_tokens\":
+        363,\n    \"completion_tokens\": 25,\n    \"total_tokens\": 388,\n    \"prompt_tokens_details\":
+        {\n      \"cached_tokens\": 0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_a0e9480a2f\"\n}\n"
    headers:
      CF-RAY:
-      - REDACTED-RAY
+      - CF-RAY-XXX
      Connection:
      - keep-alive
      Content-Type:
      - application/json
      Date:
-      - Wed, 05 Nov 2025 22:19:59 GMT
+      - Thu, 15 Jan 2026 03:05:59 GMT
      Server:
      - cloudflare
      Strict-Transport-Security:
-      - max-age=31536000; includeSubDomains; preload
+      - STS-XXX
      Transfer-Encoding:
      - chunked
      X-Content-Type-Options:
-      - nosniff
+      - X-CONTENT-TYPE-XXX
      access-control-expose-headers:
-      - X-Request-ID
+      - ACCESS-CONTROL-XXX
      alt-svc:
      - h3=":443"; ma=86400
      cf-cache-status:
      - DYNAMIC
+      content-length:
+      - '913'
      openai-organization:
-      - user-hortuttj2f3qtmxyik2zxf4q
+      - OPENAI-ORG-XXX
      openai-processing-ms:
-      - '419'
+      - '488'
      openai-project:
-      - proj_fL4UBWR1CMpAAdgzaSKqsVvA
+      - OPENAI-PROJECT-XXX
      openai-version:
      - '2020-10-01'
      x-envoy-upstream-service-time:
-      - '432'
+      - '507'
      x-openai-proxy-wasm:
      - v0.1
      x-ratelimit-limit-requests:
-      - '500'
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
      x-ratelimit-limit-tokens:
-      - '30000'
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
      x-ratelimit-remaining-requests:
-      - '499'
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
      x-ratelimit-remaining-tokens:
-      - '29702'
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
      x-ratelimit-reset-requests:
-      - 120ms
+      - X-RATELIMIT-RESET-REQUESTS-XXX
      x-ratelimit-reset-tokens:
-      - 596ms
+      - X-RATELIMIT-RESET-TOKENS-XXX
      x-request-id:
-      - req_REDACTED
+      - X-REQUEST-ID-XXX
    status:
      code: 200
      message: OK
 - request:
-    body: '{"messages":[{"role":"system","content":"You are Guardrail Agent. You are a expert at validating the output of a task. By providing effective feedback if the output is not valid.\nYour personal goal is: Validate the output of the task\n\nTo give my best complete final answer to the task respond using the exact following format:\n\nThought: I now can give a great answer\nFinal Answer: Your final answer must be the great and the most complete as possible, it must be outcome described.\n\nI MUST use these formats, my job depends on it!Ensure your final answer strictly adheres to the following OpenAPI schema: {\n  \"type\": \"json_schema\",\n  \"json_schema\": {\n    \"name\": \"LLMGuardrailResult\",\n    \"strict\": true,\n    \"schema\": {\n      \"properties\": {\n        \"valid\": {\n          \"description\": \"Whether the task output complies with the guardrail\",\n          \"title\": \"Valid\",\n          \"type\": \"boolean\"\n        },\n        \"feedback\": {\n          \"anyOf\":
-      [\n            {\n              \"type\": \"string\"\n            },\n            {\n              \"type\": \"null\"\n            }\n          ],\n          \"default\": null,\n          \"description\": \"A feedback about the task output if it is not valid\",\n          \"title\": \"Feedback\"\n        }\n      },\n      \"required\": [\n        \"valid\",\n        \"feedback\"\n      ],\n      \"title\": \"LLMGuardrailResult\",\n      \"type\": \"object\",\n      \"additionalProperties\": false\n    }\n  }\n}\n\nDo not include the OpenAPI schema in the final output. Ensure the final output does not include any code block markers like ```json or ```python."},{"role":"user","content":"\n        Ensure the following task result complies with the given guardrail.\n\n        Task result:\n        \n        Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry''s standard dummy text ever\n        \n\n        Guardrail:\n        Ensure
-      the result has less than 500 words\n\n        Your task:\n        - Confirm if the Task result complies with the guardrail.\n        - If not, provide clear feedback explaining what is wrong (e.g., by how much it violates the rule, or what specific part fails).\n        - Focus only on identifying issues — do not propose corrections.\n        - If the Task result complies with the guardrail, saying that is valid\n        "}],"model":"gpt-4o"}'
+    body: "{\"messages\":[{\"role\":\"system\",\"content\":\"You are Guardrail Agent.
+      You are a expert at validating the output of a task. By providing effective
+      feedback if the output is not valid.\\nYour personal goal is: Validate the output
+      of the task\\nTo give my best complete final answer to the task respond using
+      the exact following format:\\n\\nThought: I now can give a great answer\\nFinal
+      Answer: Your final answer must be the great and the most complete as possible,
+      it must be outcome described.\\n\\nI MUST use these formats, my job depends
+      on it!\"},{\"role\":\"user\",\"content\":\"\\nCurrent Task: \\n        Ensure
+      the following task result complies with the given guardrail.\\n\\n        Task
+      result:\\n        \\n        Lorem Ipsum is simply dummy text of the printing
+      and typesetting industry. Lorem Ipsum has been the industry's standard dummy
+      text ever\\n        \\n\\n        Guardrail:\\n        Ensure the result has
+      less than 500 words\\n\\n        Your task:\\n        - Confirm if the Task
+      result complies with the guardrail.\\n        - If not, provide clear feedback
+      explaining what is wrong (e.g., by how much it violates the rule, or what specific
+      part fails).\\n        - Focus only on identifying issues \u2014 do not propose
+      corrections.\\n        - If the Task result complies with the guardrail, saying
+      that is valid\\n        \\n\\nBegin! This is VERY important to you, use the
+      tools available and give your best Final Answer, your job depends on it!\\n\\nThought:\"}],\"model\":\"gpt-4o\"}"
    headers:
+      User-Agent:
+      - X-USER-AGENT-XXX
      accept:
      - application/json
      accept-encoding:
-      - gzip, deflate, zstd
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
      connection:
      - keep-alive
      content-length:
-      - '2453'
+      - '1468'
      content-type:
      - application/json
      host:
      - api.openai.com
-      user-agent:
-      - OpenAI/Python 1.109.1
      x-stainless-arch:
-      - arm64
+      - X-STAINLESS-ARCH-XXX
      x-stainless-async:
      - 'false'
      x-stainless-lang:
      - python
      x-stainless-os:
-      - MacOS
+      - X-STAINLESS-OS-XXX
      x-stainless-package-version:
-      - 1.109.1
+      - 1.83.0
      x-stainless-read-timeout:
-      - '600'
+      - X-STAINLESS-READ-TIMEOUT-XXX
      x-stainless-retry-count:
      - '0'
      x-stainless-runtime:
      - CPython
      x-stainless-runtime-version:
-      - 3.12.9
+      - 3.13.3
    method: POST
    uri: https://api.openai.com/v1/chat/completions
  response:
    body:
-      string: "{\n  \"id\": \"chatcmpl-CYgBMV6fu7EvV2BqzMdJaKyLAg1WW\",\n  \"object\": \"chat.completion\",\n  \"created\": 1762381336,\n  \"model\": \"gpt-4o-2024-08-06\",\n  \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\": \"assistant\",\n        \"content\": \"Thought: I now can give a great answer\\nFinal Answer: {\\\"valid\\\": true, \\\"feedback\\\": null}\",\n        \"refusal\": null,\n        \"annotations\": []\n      },\n      \"logprobs\": null,\n      \"finish_reason\": \"stop\"\n    }\n  ],\n  \"usage\": {\n    \"prompt_tokens\": 489,\n    \"completion_tokens\": 23,\n    \"total_tokens\": 512,\n    \"prompt_tokens_details\": {\n      \"cached_tokens\": 0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\": {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\": 0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\": \"default\",\n  \"system_fingerprint\"\
-        : \"fp_cbf1785567\"\n}\n"
+      string: "{\n  \"id\": \"chatcmpl-Cy7yKa0rmi2YoTLpyXt9hjeLt2rTI\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768446360,\n  \"model\": \"gpt-4o-2024-08-06\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"First, I'll count the number of words
+        in the Task result to ensure it complies with the guardrail. \\n\\nThe Task
+        result is: \\\"Lorem Ipsum is simply dummy text of the printing and typesetting
+        industry. Lorem Ipsum has been the industry's standard dummy text ever.\\\"\\n\\nBy
+        counting the words: \\n1. Lorem\\n2. Ipsum\\n3. is\\n4. simply\\n5. dummy\\n6.
+        text\\n7. of\\n8. the\\n9. printing\\n10. and\\n11. typesetting\\n12. industry\\n13.
+        Lorem\\n14. Ipsum\\n15. has\\n16. been\\n17. the\\n18. industry's\\n19. standard\\n20.
+        dummy\\n21. text\\n22. ever\\n\\nThere are 22 words total in the Task result.\\n\\nI
+        need to verify if the count of 22 words is less than the guardrail limit of
+        500 words.\\n\\nThought: I now can give a great answer\\nFinal Answer: The
+        Task result complies with the guardrail as it contains 22 words, which is
+        less than the 500-word limit. Therefore, the output is valid.\",\n        \"refusal\":
+        null,\n        \"annotations\": []\n      },\n      \"logprobs\": null,\n
+        \     \"finish_reason\": \"stop\"\n    }\n  ],\n  \"usage\": {\n    \"prompt_tokens\":
+        285,\n    \"completion_tokens\": 227,\n    \"total_tokens\": 512,\n    \"prompt_tokens_details\":
+        {\n      \"cached_tokens\": 0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_deacdd5f6f\"\n}\n"
    headers:
      CF-RAY:
-      - REDACTED-RAY
+      - CF-RAY-XXX
      Connection:
      - keep-alive
      Content-Type:
      - application/json
      Date:
-      - Wed, 05 Nov 2025 22:22:16 GMT
+      - Thu, 15 Jan 2026 03:06:02 GMT
      Server:
      - cloudflare
      Set-Cookie:
-      - __cf_bm=REDACTED; path=/; expires=Wed, 05-Nov-25 22:52:16 GMT; domain=.api.openai.com; HttpOnly; Secure; SameSite=None
-      - _cfuvid=REDACTED; path=/; domain=.api.openai.com; HttpOnly; Secure; SameSite=None
+      - SET-COOKIE-XXX
      Strict-Transport-Security:
-      - max-age=31536000; includeSubDomains; preload
+      - STS-XXX
      Transfer-Encoding:
      - chunked
      X-Content-Type-Options:
-      - nosniff
+      - X-CONTENT-TYPE-XXX
      access-control-expose-headers:
-      - X-Request-ID
+      - ACCESS-CONTROL-XXX
      alt-svc:
      - h3=":443"; ma=86400
      cf-cache-status:
      - DYNAMIC
+      content-length:
+      - '1668'
      openai-organization:
-      - user-hortuttj2f3qtmxyik2zxf4q
+      - OPENAI-ORG-XXX
      openai-processing-ms:
-      - '327'
+      - '2502'
      openai-project:
-      - proj_fL4UBWR1CMpAAdgzaSKqsVvA
+      - OPENAI-PROJECT-XXX
      openai-version:
      - '2020-10-01'
      x-envoy-upstream-service-time:
-      - '372'
+      - '2522'
      x-openai-proxy-wasm:
      - v0.1
      x-ratelimit-limit-requests:
-      - '500'
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
      x-ratelimit-limit-tokens:
-      - '30000'
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
      x-ratelimit-remaining-requests:
-      - '499'
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
      x-ratelimit-remaining-tokens:
-      - '29438'
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
      x-ratelimit-reset-requests:
-      - 120ms
+      - X-RATELIMIT-RESET-REQUESTS-XXX
      x-ratelimit-reset-tokens:
-      - 1.124s
+      - X-RATELIMIT-RESET-TOKENS-XXX
      x-request-id:
-      - req_REDACTED
+      - X-REQUEST-ID-XXX
    status:
      code: 200
      message: OK
 - request:
-    body: '{"messages":[{"role":"system","content":"Ensure your final answer strictly adheres to the following OpenAPI schema: {\n  \"type\": \"json_schema\",\n  \"json_schema\": {\n    \"name\": \"LLMGuardrailResult\",\n    \"strict\": true,\n    \"schema\": {\n      \"properties\": {\n        \"valid\": {\n          \"description\": \"Whether the task output complies with the guardrail\",\n          \"title\": \"Valid\",\n          \"type\": \"boolean\"\n        },\n        \"feedback\": {\n          \"anyOf\": [\n            {\n              \"type\": \"string\"\n            },\n            {\n              \"type\": \"null\"\n            }\n          ],\n          \"default\": null,\n          \"description\": \"A feedback about the task output if it is not valid\",\n          \"title\": \"Feedback\"\n        }\n      },\n      \"required\": [\n        \"valid\",\n        \"feedback\"\n      ],\n      \"title\": \"LLMGuardrailResult\",\n      \"type\": \"object\",\n      \"additionalProperties\":
-      false\n    }\n  }\n}\n\nDo not include the OpenAPI schema in the final output. Ensure the final output does not include any code block markers like ```json or ```python."},{"role":"user","content":"{\"valid\": true, \"feedback\": null}"}],"model":"gpt-4o","response_format":{"type":"json_schema","json_schema":{"schema":{"properties":{"valid":{"description":"Whether the task output complies with the guardrail","title":"Valid","type":"boolean"},"feedback":{"anyOf":[{"type":"string"},{"type":"null"}],"description":"A feedback about the task output if it is not valid","title":"Feedback"}},"required":["valid","feedback"],"title":"LLMGuardrailResult","type":"object","additionalProperties":false},"name":"LLMGuardrailResult","strict":true}},"stream":false}'
+    body: '{"messages":[{"role":"system","content":"Ensure your final answer strictly
+      adheres to the following OpenAPI schema: {\n  \"type\": \"json_schema\",\n  \"json_schema\":
+      {\n    \"name\": \"LLMGuardrailResult\",\n    \"strict\": true,\n    \"schema\":
+      {\n      \"properties\": {\n        \"valid\": {\n          \"description\":
+      \"Whether the task output complies with the guardrail\",\n          \"title\":
+      \"Valid\",\n          \"type\": \"boolean\"\n        },\n        \"feedback\":
+      {\n          \"anyOf\": [\n            {\n              \"type\": \"string\"\n            },\n            {\n              \"type\":
+      \"null\"\n            }\n          ],\n          \"default\": null,\n          \"description\":
+      \"A feedback about the task output if it is not valid\",\n          \"title\":
+      \"Feedback\"\n        }\n      },\n      \"required\": [\n        \"valid\",\n        \"feedback\"\n      ],\n      \"title\":
+      \"LLMGuardrailResult\",\n      \"type\": \"object\",\n      \"additionalProperties\":
+      false\n    }\n  }\n}\n\nDo not include the OpenAPI schema in the final output.
+      Ensure the final output does not include any code block markers like ```json
+      or ```python."},{"role":"user","content":"The Task result complies with the
+      guardrail as it contains 22 words, which is less than the 500-word limit. Therefore,
+      the output is valid."}],"model":"gpt-4o","response_format":{"type":"json_schema","json_schema":{"schema":{"properties":{"valid":{"description":"Whether
+      the task output complies with the guardrail","title":"Valid","type":"boolean"},"feedback":{"anyOf":[{"type":"string"},{"type":"null"}],"description":"A
+      feedback about the task output if it is not valid","title":"Feedback"}},"required":["valid","feedback"],"title":"LLMGuardrailResult","type":"object","additionalProperties":false},"name":"LLMGuardrailResult","strict":true}},"stream":false}'
    headers:
+      User-Agent:
+      - X-USER-AGENT-XXX
      accept:
      - application/json
      accept-encoding:
-      - gzip, deflate, zstd
+      - ACCEPT-ENCODING-XXX
+      authorization:
+      - AUTHORIZATION-XXX
      connection:
      - keep-alive
      content-length:
-      - '1762'
+      - '1864'
      content-type:
      - application/json
      cookie:
-      - __cf_bm=REDACTED; _cfuvid=REDACTED
+      - COOKIE-XXX
      host:
      - api.openai.com
-      user-agent:
-      - OpenAI/Python 1.109.1
      x-stainless-arch:
-      - arm64
+      - X-STAINLESS-ARCH-XXX
      x-stainless-async:
      - 'false'
      x-stainless-helper-method:
-      - chat.completions.parse
+      - beta.chat.completions.parse
      x-stainless-lang:
      - python
      x-stainless-os:
-      - MacOS
+      - X-STAINLESS-OS-XXX
      x-stainless-package-version:
-      - 1.109.1
+      - 1.83.0
      x-stainless-read-timeout:
-      - '600'
+      - X-STAINLESS-READ-TIMEOUT-XXX
      x-stainless-retry-count:
      - '0'
      x-stainless-runtime:
      - CPython
      x-stainless-runtime-version:
-      - 3.12.9
+      - 3.13.3
    method: POST
    uri: https://api.openai.com/v1/chat/completions
  response:
    body:
-      string: "{\n  \"id\": \"chatcmpl-CYgBMU20R45qGGaLN6vNAmW1NR4R6\",\n  \"object\": \"chat.completion\",\n  \"created\": 1762381336,\n  \"model\": \"gpt-4o-2024-08-06\",\n  \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\": \"assistant\",\n        \"content\": \"{\\\"valid\\\":true,\\\"feedback\\\":null}\",\n        \"refusal\": null,\n        \"annotations\": []\n      },\n      \"logprobs\": null,\n      \"finish_reason\": \"stop\"\n    }\n  ],\n  \"usage\": {\n    \"prompt_tokens\": 347,\n    \"completion_tokens\": 9,\n    \"total_tokens\": 356,\n    \"prompt_tokens_details\": {\n      \"cached_tokens\": 0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\": {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\": 0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\": \"default\",\n  \"system_fingerprint\": \"fp_cbf1785567\"\n}\n"
+      string: "{\n  \"id\": \"chatcmpl-Cy7yMAjNYSCz2foZPEcSVCuapzF8y\",\n  \"object\":
+        \"chat.completion\",\n  \"created\": 1768446362,\n  \"model\": \"gpt-4o-2024-08-06\",\n
+        \ \"choices\": [\n    {\n      \"index\": 0,\n      \"message\": {\n        \"role\":
+        \"assistant\",\n        \"content\": \"{\\\"valid\\\":true,\\\"feedback\\\":null}\",\n
+        \       \"refusal\": null,\n        \"annotations\": []\n      },\n      \"logprobs\":
+        null,\n      \"finish_reason\": \"stop\"\n    }\n  ],\n  \"usage\": {\n    \"prompt_tokens\":
+        369,\n    \"completion_tokens\": 9,\n    \"total_tokens\": 378,\n    \"prompt_tokens_details\":
+        {\n      \"cached_tokens\": 0,\n      \"audio_tokens\": 0\n    },\n    \"completion_tokens_details\":
+        {\n      \"reasoning_tokens\": 0,\n      \"audio_tokens\": 0,\n      \"accepted_prediction_tokens\":
+        0,\n      \"rejected_prediction_tokens\": 0\n    }\n  },\n  \"service_tier\":
+        \"default\",\n  \"system_fingerprint\": \"fp_a0e9480a2f\"\n}\n"
    headers:
      CF-RAY:
-      - REDACTED-RAY
+      - CF-RAY-XXX
      Connection:
      - keep-alive
      Content-Type:
      - application/json
      Date:
-      - Wed, 05 Nov 2025 22:22:17 GMT
+      - Thu, 15 Jan 2026 03:06:03 GMT
      Server:
      - cloudflare
      Strict-Transport-Security:
-      - max-age=31536000; includeSubDomains; preload
+      - STS-XXX
      Transfer-Encoding:
      - chunked
      X-Content-Type-Options:
-      - nosniff
+      - X-CONTENT-TYPE-XXX
      access-control-expose-headers:
-      - X-Request-ID
+      - ACCESS-CONTROL-XXX
      alt-svc:
      - h3=":443"; ma=86400
      cf-cache-status:
      - DYNAMIC
+      content-length:
+      - '837'
      openai-organization:
-      - user-hortuttj2f3qtmxyik2zxf4q
+      - OPENAI-ORG-XXX
      openai-processing-ms:
-      - '1081'
+      - '413'
      openai-project:
-      - proj_fL4UBWR1CMpAAdgzaSKqsVvA
+      - OPENAI-PROJECT-XXX
      openai-version:
      - '2020-10-01'
      x-envoy-upstream-service-time:
-      - '1241'
+      - '650'
      x-openai-proxy-wasm:
      - v0.1
      x-ratelimit-limit-requests:
-      - '500'
+      - X-RATELIMIT-LIMIT-REQUESTS-XXX
      x-ratelimit-limit-tokens:
-      - '30000'
+      - X-RATELIMIT-LIMIT-TOKENS-XXX
      x-ratelimit-remaining-requests:
-      - '499'
+      - X-RATELIMIT-REMAINING-REQUESTS-XXX
      x-ratelimit-remaining-tokens:
-      - '29478'
+      - X-RATELIMIT-REMAINING-TOKENS-XXX
      x-ratelimit-reset-requests:
-      - 120ms
+      - X-RATELIMIT-RESET-REQUESTS-XXX
      x-ratelimit-reset-tokens:
-      - 1.042s
+      - X-RATELIMIT-RESET-TOKENS-XXX
      x-request-id:
-      - req_REDACTED
+      - X-REQUEST-ID-XXX
    status:
      code: 200
      message: OK
--- a/lib/crewai/tests/test_flow.py
+++ b/lib/crewai/tests/test_flow.py
@@ -1202,7 +1202,8 @@ def test_complex_and_or_branching():
    )
    assert execution_order.index("branch_2b") > min_branch_1_index

-    # Final should be last and after both 2a and 2b
+
+    # Final should be after both 2a and 2b
    assert execution_order[-1] == "final"
    assert execution_order.index("final") > execution_order.index("branch_2a")
    assert execution_order.index("final") > execution_order.index("branch_2b")
@@ -1255,10 +1256,11 @@ def test_conditional_router_paths_exclusivity():


 def test_state_consistency_across_parallel_branches():
-    """Test that state remains consistent when branches execute sequentially.
+    """Test that state remains consistent when branches execute in parallel.

-    Note: Branches triggered by the same parent execute sequentially, not in parallel.
-    This ensures predictable state mutations and prevents race conditions.
+    Note: Branches triggered by the same parent execute in parallel for efficiency.
+    Thread-safe state access via StateProxy ensures no race conditions.
+    We check the execution order to ensure the branches execute in parallel.
    """
    execution_order = []

@@ -1295,12 +1297,14 @@ def test_state_consistency_across_parallel_branches():
    flow = StateConsistencyFlow()
    flow.kickoff()

-    # Branches execute sequentially, so branch_a runs first, then branch_b
-    assert flow.state["branch_a_value"] == 10  # Sees initial value
-    assert flow.state["branch_b_value"] == 11  # Sees value after branch_a increment
+    assert "branch_a" in execution_order
+    assert "branch_b" in execution_order
+    assert "verify_state" in execution_order

-    # Final counter should reflect both increments sequentially
-    assert flow.state["counter"] == 16  # 10 + 1 + 5
+    assert flow.state["branch_a_value"] is not None
+    assert flow.state["branch_b_value"] is not None
+
+    assert flow.state["counter"] == 16


 def test_deeply_nested_conditions():
--- a/lib/crewai/tests/test_flow_persistence.py
+++ b/lib/crewai/tests/test_flow_persistence.py
@@ -247,4 +247,4 @@ def test_persistence_with_base_model(tmp_path):
    assert message.role == "user"
    assert message.type == "text"
    assert message.content == "Hello, World!"
-    assert isinstance(flow.state, State)
+    assert isinstance(flow.state._unwrap(), State)
--- a/lib/crewai/tests/test_task_guardrails.py
+++ b/lib/crewai/tests/test_task_guardrails.py
@@ -185,8 +185,8 @@ def test_task_guardrail_process_output(task_output):

    result = guardrail(task_output)
    assert result[0] is False
-
-    assert result[1] == "The task result contains more than 10 words, violating the guardrail. The text provided contains about 21 words."
+    # Check that feedback is provided (wording varies by LLM)
+    assert result[1] == "The task output exceeds the word limit of 10 words by containing 22 words."

    guardrail = LLMGuardrail(
        description="Ensure the result has less than 500 words", llm=LLM(model="gpt-4o")
--- a/lib/crewai/tests/utilities/test_events.py
+++ b/lib/crewai/tests/utilities/test_events.py
@@ -348,11 +348,11 @@ def test_agent_emits_execution_error_event(base_agent, base_task):

    error_message = "Error happening while sending prompt to model."
    base_agent.max_retry_limit = 0
-    with patch.object(
-        CrewAgentExecutor, "invoke", wraps=base_agent.agent_executor.invoke
-    ) as invoke_mock:
-        invoke_mock.side_effect = Exception(error_message)

+    # Patch at the class level since agent_executor is created lazily
+    with patch.object(
+        CrewAgentExecutor, "invoke", side_effect=Exception(error_message)
+    ):
        with pytest.raises(Exception):  # noqa: B017
            base_agent.execute_task(
                task=base_task,