Merge branch 'main' into feat/per-user-token-tracing

2026-01-08 23:58:34 +00:00 · 2026-01-03 22:32:12 +05:30
parent 9bbf53e84a f3c17a249b
commit 314642f392
33 changed files with 3757 additions and 2847 deletions
--- a/README.md
+++ b/README.md
@@ -57,7 +57,7 @@
 > It empowers developers with both high-level simplicity and precise low-level control, ideal for creating autonomous AI agents tailored to any scenario.

 - **CrewAI Crews**: Optimize for autonomy and collaborative intelligence.
- **CrewAI Flows**: Enable granular, event-driven control, single LLM calls for precise task orchestration and supports Crews natively
+- **CrewAI Flows**: The **enterprise and production architecture** for building and deploying multi-agent systems. Enable granular, event-driven control, single LLM calls for precise task orchestration and supports Crews natively

 With over 100,000 developers certified through our community courses at [learn.crewai.com](https://learn.crewai.com), CrewAI is rapidly becoming the
 standard for enterprise-ready AI automation.
@@ -166,13 +166,13 @@ Ensure you have Python >=3.10 <3.14 installed on your system. CrewAI uses [UV](h
 First, install CrewAI:

 ```shell
-pip install crewai
+uv pip install crewai
 ```

 If you want to install the 'crewai' package along with its optional features that include additional tools for agents, you can do so by using the following command:

 ```shell
-pip install 'crewai[tools]'
+uv pip install 'crewai[tools]'
 ```

 The command above installs the basic package and also adds extra components which require more dependencies to function.
@@ -185,14 +185,14 @@ If you encounter issues during installation or usage, here are some common solut

 1. **ModuleNotFoundError: No module named 'tiktoken'**

-   - Install tiktoken explicitly: `pip install 'crewai[embeddings]'`
-   - If using embedchain or other tools: `pip install 'crewai[tools]'`
+   - Install tiktoken explicitly: `uv pip install 'crewai[embeddings]'`
+   - If using embedchain or other tools: `uv pip install 'crewai[tools]'`
 2. **Failed building wheel for tiktoken**

   - Ensure Rust compiler is installed (see installation steps above)
   - For Windows: Verify Visual C++ Build Tools are installed
-   - Try upgrading pip: `pip install --upgrade pip`
-   - If issues persist, use a pre-built wheel: `pip install tiktoken --prefer-binary`
+   - Try upgrading pip: `uv pip install --upgrade pip`
+   - If issues persist, use a pre-built wheel: `uv pip install tiktoken --prefer-binary`

 ### 2. Setting Up Your Crew with the YAML Configuration

@@ -611,7 +611,7 @@ uv build
 ### Installing Locally

 ```bash
-pip install dist/*.tar.gz
+uv pip install dist/*.tar.gz
 ```

 ## Telemetry
@@ -687,13 +687,13 @@ A: CrewAI is a standalone, lean, and fast Python framework built specifically fo
 A: Install CrewAI using pip:

 ```shell
-pip install crewai
+uv pip install crewai
 ```

 For additional tools, use:

 ```shell
-pip install 'crewai[tools]'
+uv pip install 'crewai[tools]'
 ```

 ### Q: Does CrewAI depend on LangChain?
--- a/docs/docs.json
+++ b/docs/docs.json
@@ -116,6 +116,7 @@
                  "en/concepts/tasks",
                  "en/concepts/crews",
                  "en/concepts/flows",
+                  "en/concepts/production-architecture",
                  "en/concepts/knowledge",
                  "en/concepts/llms",
                  "en/concepts/processes",
@@ -559,6 +560,7 @@
                  "pt-BR/concepts/tasks",
                  "pt-BR/concepts/crews",
                  "pt-BR/concepts/flows",
+                  "pt-BR/concepts/production-architecture",
                  "pt-BR/concepts/knowledge",
                  "pt-BR/concepts/llms",
                  "pt-BR/concepts/processes",
@@ -703,6 +705,7 @@
              {
                "group": "Observabilidade",
                "pages": [
+                  "pt-BR/observability/tracing",
                  "pt-BR/observability/overview",
                  "pt-BR/observability/arize-phoenix",
                  "pt-BR/observability/braintrust",
@@ -984,6 +987,7 @@
                  "ko/concepts/tasks",
                  "ko/concepts/crews",
                  "ko/concepts/flows",
+                  "ko/concepts/production-architecture",
                  "ko/concepts/knowledge",
                  "ko/concepts/llms",
                  "ko/concepts/processes",
@@ -1140,6 +1144,7 @@
              {
                "group": "Observability",
                "pages": [
+                  "ko/observability/tracing",
                  "ko/observability/overview",
                  "ko/observability/arize-phoenix",
                  "ko/observability/braintrust",
--- a/docs/en/concepts/production-architecture.mdx
+++ b/docs/en/concepts/production-architecture.mdx
@@ -0,0 +1,154 @@
+---
+title: Production Architecture
+description: Best practices for building production-ready AI applications with CrewAI
+icon: server
+mode: "wide"
+---
+
+# The Flow-First Mindset
+
+When building production AI applications with CrewAI, **we recommend starting with a Flow**.
+
+While it's possible to run individual Crews or Agents, wrapping them in a Flow provides the necessary structure for a robust, scalable application.
+
+## Why Flows?
+
+1.  **State Management**: Flows provide a built-in way to manage state across different steps of your application. This is crucial for passing data between Crews, maintaining context, and handling user inputs.
+2.  **Control**: Flows allow you to define precise execution paths, including loops, conditionals, and branching logic. This is essential for handling edge cases and ensuring your application behaves predictably.
+3.  **Observability**: Flows provide a clear structure that makes it easier to trace execution, debug issues, and monitor performance. We recommend using [CrewAI Tracing](/en/observability/tracing) for detailed insights. Simply run `crewai login` to enable free observability features.
+
+## The Architecture
+
+A typical production CrewAI application looks like this:
+
+```mermaid
+graph TD
+    Start((Start)) --> Flow[Flow Orchestrator]
+    Flow --> State{State Management}
+    State --> Step1[Step 1: Data Gathering]
+    Step1 --> Crew1[Research Crew]
+    Crew1 --> State
+    State --> Step2{Condition Check}
+    Step2 -- "Valid" --> Step3[Step 3: Execution]
+    Step3 --> Crew2[Action Crew]
+    Step2 -- "Invalid" --> End((End))
+    Crew2 --> End
+```
+
+### 1. The Flow Class
+Your `Flow` class is the entry point. It defines the state schema and the methods that execute your logic.
+
+```python
+from crewai.flow.flow import Flow, listen, start
+from pydantic import BaseModel
+
+class AppState(BaseModel):
+    user_input: str = ""
+    research_results: str = ""
+    final_report: str = ""
+
+class ProductionFlow(Flow[AppState]):
+    @start()
+    def gather_input(self):
+        # ... logic to get input ...
+        pass
+
+    @listen(gather_input)
+    def run_research_crew(self):
+        # ... trigger a Crew ...
+        pass
+```
+
+### 2. State Management
+Use Pydantic models to define your state. This ensures type safety and makes it clear what data is available at each step.
+
+- **Keep it minimal**: Store only what you need to persist between steps.
+- **Use structured data**: Avoid unstructured dictionaries when possible.
+
+### 3. Crews as Units of Work
+Delegate complex tasks to Crews. A Crew should be focused on a specific goal (e.g., "Research a topic", "Write a blog post").
+
+- **Don't over-engineer Crews**: Keep them focused.
+- **Pass state explicitly**: Pass the necessary data from the Flow state to the Crew inputs.
+
+```python
+    @listen(gather_input)
+    def run_research_crew(self):
+        crew = ResearchCrew()
+        result = crew.kickoff(inputs={"topic": self.state.user_input})
+        self.state.research_results = result.raw
+```
+
+## Control Primitives
+
+Leverage CrewAI's control primitives to add robustness and control to your Crews.
+
+### 1. Task Guardrails
+Use [Task Guardrails](/en/concepts/tasks#task-guardrails) to validate task outputs before they are accepted. This ensures that your agents produce high-quality results.
+
+```python
+def validate_content(result: TaskOutput) -> Tuple[bool, Any]:
+    if len(result.raw) < 100:
+        return (False, "Content is too short. Please expand.")
+    return (True, result.raw)
+
+task = Task(
+    ...,
+    guardrail=validate_content
+)
+```
+
+### 2. Structured Outputs
+Always use structured outputs (`output_pydantic` or `output_json`) when passing data between tasks or to your application. This prevents parsing errors and ensures type safety.
+
+```python
+class ResearchResult(BaseModel):
+    summary: str
+    sources: List[str]
+
+task = Task(
+    ...,
+    output_pydantic=ResearchResult
+)
+```
+
+### 3. LLM Hooks
+Use [LLM Hooks](/en/learn/llm-hooks) to inspect or modify messages before they are sent to the LLM, or to sanitize responses.
+
+```python
+@before_llm_call
+def log_request(context):
+    print(f"Agent {context.agent.role} is calling the LLM...")
+```
+
+## Deployment Patterns
+
+When deploying your Flow, consider the following:
+
+### CrewAI Enterprise
+The easiest way to deploy your Flow is using CrewAI Enterprise. It handles the infrastructure, authentication, and monitoring for you.
+
+Check out the [Deployment Guide](/en/enterprise/guides/deploy-crew) to get started.
+
+```bash
+crewai deploy create
+```
+
+### Async Execution
+For long-running tasks, use `kickoff_async` to avoid blocking your API.
+
+### Persistence
+Use the `@persist` decorator to save the state of your Flow to a database. This allows you to resume execution if the process crashes or if you need to wait for human input.
+
+```python
+@persist
+class ProductionFlow(Flow[AppState]):
+    # ...
+```
+
+## Summary
+
+- **Start with a Flow.**
+- **Define a clear State.**
+- **Use Crews for complex tasks.**
+- **Deploy with an API and persistence.**
--- a/docs/en/introduction.mdx
+++ b/docs/en/introduction.mdx
@@ -7,110 +7,89 @@ mode: "wide"

 # What is CrewAI?

-**CrewAI is a lean, lightning-fast Python framework built entirely from scratch—completely independent of LangChain or other agent frameworks.**
+**CrewAI is the leading open-source framework for orchestrating autonomous AI agents and building complex workflows.**

-CrewAI empowers developers with both high-level simplicity and precise low-level control, ideal for creating autonomous AI agents tailored to any scenario:
+It empowers developers to build production-ready multi-agent systems by combining the collaborative intelligence of **Crews** with the precise control of **Flows**.

- **[CrewAI Crews](/en/guides/crews/first-crew)**: Optimize for autonomy and collaborative intelligence, enabling you to create AI teams where each agent has specific roles, tools, and goals.
- **[CrewAI Flows](/en/guides/flows/first-flow)**: Enable granular, event-driven control, single LLM calls for precise task orchestration and supports Crews natively.
+- **[CrewAI Flows](/en/guides/flows/first-flow)**: The backbone of your AI application. Flows allow you to create structured, event-driven workflows that manage state and control execution. They provide the scaffolding for your AI agents to work within.
+- **[CrewAI Crews](/en/guides/crews/first-crew)**: The units of work within your Flow. Crews are teams of autonomous agents that collaborate to solve specific tasks delegated to them by the Flow.

-With over 100,000 developers certified through our community courses, CrewAI is rapidly becoming the standard for enterprise-ready AI automation.
+With over 100,000 developers certified through our community courses, CrewAI is the standard for enterprise-ready AI automation.

+## The CrewAI Architecture

-## How Crews Work
+CrewAI's architecture is designed to balance autonomy with control.
+
+### 1. Flows: The Backbone

 <Note>
-  Just like a company has departments (Sales, Engineering, Marketing) working together under leadership to achieve business goals, CrewAI helps you create an organization of AI agents with specialized roles collaborating to accomplish complex tasks.
-</Note>
-
-<Frame caption="CrewAI Framework Overview">
-  <img src="/images/crews.png" alt="CrewAI Framework Overview" />
-</Frame>
-
-| Component | Description | Key Features |
-|:----------|:-----------:|:------------|
-| **Crew** | The top-level organization | • Manages AI agent teams<br/>• Oversees workflows<br/>• Ensures collaboration<br/>• Delivers outcomes |
-| **AI Agents** | Specialized team members | • Have specific roles (researcher, writer)<br/>• Use designated tools<br/>• Can delegate tasks<br/>• Make autonomous decisions |
-| **Process** | Workflow management system | • Defines collaboration patterns<br/>• Controls task assignments<br/>• Manages interactions<br/>• Ensures efficient execution |
-| **Tasks** | Individual assignments | • Have clear objectives<br/>• Use specific tools<br/>• Feed into larger process<br/>• Produce actionable results |
-
-### How It All Works Together
-
-1. The **Crew** organizes the overall operation
-2. **AI Agents** work on their specialized tasks
-3. The **Process** ensures smooth collaboration
-4. **Tasks** get completed to achieve the goal
-
-## Key Features
-
-<CardGroup cols={2}>
-  <Card title="Role-Based Agents" icon="users">
-    Create specialized agents with defined roles, expertise, and goals - from researchers to analysts to writers
-  </Card>
-  <Card title="Flexible Tools" icon="screwdriver-wrench">
-    Equip agents with custom tools and APIs to interact with external services and data sources
-  </Card>
-  <Card title="Intelligent Collaboration" icon="people-arrows">
-    Agents work together, sharing insights and coordinating tasks to achieve complex objectives
-  </Card>
-  <Card title="Task Management" icon="list-check">
-    Define sequential or parallel workflows, with agents automatically handling task dependencies
-  </Card>
-</CardGroup>
-
-## How Flows Work
-
-<Note>
-  While Crews excel at autonomous collaboration, Flows provide structured automations, offering granular control over workflow execution. Flows ensure tasks are executed reliably, securely, and efficiently, handling conditional logic, loops, and dynamic state management with precision. Flows integrate seamlessly with Crews, enabling you to balance high autonomy with exacting control.
+  Think of a Flow as the "manager" or the "process definition" of your application. It defines the steps, the logic, and how data moves through your system.
 </Note>

 <Frame caption="CrewAI Framework Overview">
  <img src="/images/flows.png" alt="CrewAI Framework Overview" />
 </Frame>

-| Component | Description | Key Features |
-|:----------|:-----------:|:------------|
-| **Flow** | Structured workflow orchestration | • Manages execution paths<br/>• Handles state transitions<br/>• Controls task sequencing<br/>• Ensures reliable execution |
-| **Events** | Triggers for workflow actions | • Initiate specific processes<br/>• Enable dynamic responses<br/>• Support conditional branching<br/>• Allow for real-time adaptation |
-| **States** | Workflow execution contexts | • Maintain execution data<br/>• Enable persistence<br/>• Support resumability<br/>• Ensure execution integrity |
-| **Crew Support** | Enhances workflow automation | • Injects pockets of agency when needed<br/>• Complements structured workflows<br/>• Balances automation with intelligence<br/>• Enables adaptive decision-making |
+Flows provide:
+- **State Management**: Persist data across steps and executions.
+- **Event-Driven Execution**: Trigger actions based on events or external inputs.
+- **Control Flow**: Use conditional logic, loops, and branching.

-### Key Capabilities
+### 2. Crews: The Intelligence
+
+<Note>
+  Crews are the "teams" that do the heavy lifting. Within a Flow, you can trigger a Crew to tackle a complex problem requiring creativity and collaboration.
+</Note>
+
+<Frame caption="CrewAI Framework Overview">
+  <img src="/images/crews.png" alt="CrewAI Framework Overview" />
+</Frame>
+
+Crews provide:
+- **Role-Playing Agents**: Specialized agents with specific goals and tools.
+- **Autonomous Collaboration**: Agents work together to solve tasks.
+- **Task Delegation**: Tasks are assigned and executed based on agent capabilities.
+
+## How It All Works Together
+
+1.  **The Flow** triggers an event or starts a process.
+2.  **The Flow** manages the state and decides what to do next.
+3.  **The Flow** delegates a complex task to a **Crew**.
+4.  **The Crew**'s agents collaborate to complete the task.
+5.  **The Crew** returns the result to the **Flow**.
+6.  **The Flow** continues execution based on the result.
+
+## Key Features

 <CardGroup cols={2}>
-  <Card title="Event-Driven Orchestration" icon="bolt">
-    Define precise execution paths responding dynamically to events
+  <Card title="Production-Grade Flows" icon="arrow-progress">
+    Build reliable, stateful workflows that can handle long-running processes and complex logic.
  </Card>
-  <Card title="Fine-Grained Control" icon="sliders">
-    Manage workflow states and conditional execution securely and efficiently
+  <Card title="Autonomous Crews" icon="users">
+    Deploy teams of agents that can plan, execute, and collaborate to achieve high-level goals.
  </Card>
-  <Card title="Native Crew Integration" icon="puzzle-piece">
-    Effortlessly combine with Crews for enhanced autonomy and intelligence
+  <Card title="Flexible Tools" icon="screwdriver-wrench">
+     Connect your agents to any API, database, or local tool.
  </Card>
-  <Card title="Deterministic Execution" icon="route">
-    Ensure predictable outcomes with explicit control flow and error handling
+  <Card title="Enterprise Security" icon="lock">
+    Designed with security and compliance in mind for enterprise deployments.
  </Card>
 </CardGroup>

 ## When to Use Crews vs. Flows

-<Note>
-  Understanding when to use [Crews](/en/guides/crews/first-crew) versus [Flows](/en/guides/flows/first-flow) is key to maximizing the potential of CrewAI in your applications.
-</Note>
+**The short answer: Use both.**

-| Use Case | Recommended Approach | Why? |
-|:---------|:---------------------|:-----|
-| **Open-ended research** | [Crews](/en/guides/crews/first-crew) | When tasks require creative thinking, exploration, and adaptation |
-| **Content generation** | [Crews](/en/guides/crews/first-crew) | For collaborative creation of articles, reports, or marketing materials |
-| **Decision workflows** | [Flows](/en/guides/flows/first-flow) | When you need predictable, auditable decision paths with precise control |
-| **API orchestration** | [Flows](/en/guides/flows/first-flow) | For reliable integration with multiple external services in a specific sequence |
-| **Hybrid applications** | Combined approach | Use [Flows](/en/guides/flows/first-flow) to orchestrate overall process with [Crews](/en/guides/crews/first-crew) handling complex subtasks |
+For any production-ready application, **start with a Flow**.

-### Decision Framework
+- **Use a Flow** to define the overall structure, state, and logic of your application.
+- **Use a Crew** within a Flow step when you need a team of agents to perform a specific, complex task that requires autonomy.

- **Choose [Crews](/en/guides/crews/first-crew) when:** You need autonomous problem-solving, creative collaboration, or exploratory tasks
- **Choose [Flows](/en/guides/flows/first-flow) when:** You require deterministic outcomes, auditability, or precise control over execution
- **Combine both when:** Your application needs both structured processes and pockets of autonomous intelligence
+| Use Case | Architecture |
+| :--- | :--- |
+| **Simple Automation** | Single Flow with Python tasks |
+| **Complex Research** | Flow managing state -> Crew performing research |
+| **Application Backend** | Flow handling API requests -> Crew generating content -> Flow saving to DB |

 ## Why Choose CrewAI?

@@ -124,13 +103,6 @@ With over 100,000 developers certified through our community courses, CrewAI is
 ## Ready to Start Building?

 <CardGroup cols={2}>
-  <Card
-    title="Build Your First Crew"
-    icon="users-gear"
-    href="/en/guides/crews/first-crew"
-  >
-    Step-by-step tutorial to create a collaborative AI team that works together to solve complex problems.
-  </Card>
  <Card
    title="Build Your First Flow"
    icon="diagram-project"
@@ -138,6 +110,13 @@ With over 100,000 developers certified through our community courses, CrewAI is
  >
    Learn how to create structured, event-driven workflows with precise control over execution.
  </Card>
+  <Card
+    title="Build Your First Crew"
+    icon="users-gear"
+    href="/en/guides/crews/first-crew"
+  >
+    Step-by-step tutorial to create a collaborative AI team that works together to solve complex problems.
+  </Card>
 </CardGroup>

 <CardGroup cols={3}>
--- a/docs/ko/concepts/production-architecture.mdx
+++ b/docs/ko/concepts/production-architecture.mdx
@@ -0,0 +1,154 @@
+---
+title: 프로덕션 아키텍처
+description: CrewAI로 프로덕션 수준의 AI 애플리케이션을 구축하기 위한 모범 사례
+icon: server
+mode: "wide"
+---
+
+# Flow 우선 사고방식 (Flow-First Mindset)
+
+CrewAI로 프로덕션 AI 애플리케이션을 구축할 때는 **Flow로 시작하는 것을 권장합니다**.
+
+개별 Crews나 Agents를 실행하는 것도 가능하지만, 이를 Flow로 감싸면 견고하고 확장 가능한 애플리케이션에 필요한 구조를 제공합니다.
+
+## 왜 Flows인가?
+
+1.  **상태 관리 (State Management)**: Flows는 애플리케이션의 여러 단계에 걸쳐 상태를 관리하는 내장된 방법을 제공합니다. 이는 Crews 간에 데이터를 전달하고, 컨텍스트를 유지하며, 사용자 입력을 처리하는 데 중요합니다.
+2.  **제어 (Control)**: Flows를 사용하면 루프, 조건문, 분기 로직을 포함한 정확한 실행 경로를 정의할 수 있습니다. 이는 예외 상황을 처리하고 애플리케이션이 예측 가능하게 동작하도록 보장하는 데 필수적입니다.
+3.  **관측 가능성 (Observability)**: Flows는 실행을 추적하고, 문제를 디버깅하며, 성능을 모니터링하기 쉽게 만드는 명확한 구조를 제공합니다. 자세한 통찰력을 얻으려면 [CrewAI Tracing](/ko/observability/tracing)을 사용하는 것이 좋습니다. `crewai login`을 실행하여 무료 관측 가능성 기능을 활성화하세요.
+
+## 아키텍처
+
+일반적인 프로덕션 CrewAI 애플리케이션은 다음과 같습니다:
+
+```mermaid
+graph TD
+    Start((시작)) --> Flow[Flow 오케스트레이터]
+    Flow --> State{상태 관리}
+    State --> Step1[1단계: 데이터 수집]
+    Step1 --> Crew1[연구 Crew]
+    Crew1 --> State
+    State --> Step2{조건 확인}
+    Step2 -- "유효함" --> Step3[3단계: 실행]
+    Step3 --> Crew2[액션 Crew]
+    Step2 -- "유효하지 않음" --> End((종료))
+    Crew2 --> End
+```
+
+### 1. Flow 클래스
+`Flow` 클래스는 진입점입니다. 상태 스키마와 로직을 실행하는 메서드를 정의합니다.
+
+```python
+from crewai.flow.flow import Flow, listen, start
+from pydantic import BaseModel
+
+class AppState(BaseModel):
+    user_input: str = ""
+    research_results: str = ""
+    final_report: str = ""
+
+class ProductionFlow(Flow[AppState]):
+    @start()
+    def gather_input(self):
+        # ... 입력 받는 로직 ...
+        pass
+
+    @listen(gather_input)
+    def run_research_crew(self):
+        # ... Crew 트리거 ...
+        pass
+```
+
+### 2. 상태 관리 (State Management)
+Pydantic 모델을 사용하여 상태를 정의하세요. 이는 타입 안전성을 보장하고 각 단계에서 어떤 데이터를 사용할 수 있는지 명확하게 합니다.
+
+- **최소한으로 유지**: 단계 간에 유지해야 할 것만 저장하세요.
+- **구조화된 데이터 사용**: 가능하면 비구조화된 딕셔너리는 피하세요.
+
+### 3. 작업 단위로서의 Crews
+복잡한 작업은 Crews에게 위임하세요. Crew는 특정 목표(예: "주제 연구", "블로그 게시물 작성")에 집중해야 합니다.
+
+- **Crews를 과도하게 설계하지 마세요**: 집중력을 유지하세요.
+- **상태를 명시적으로 전달하세요**: Flow 상태에서 필요한 데이터를 Crew 입력으로 전달하세요.
+
+```python
+    @listen(gather_input)
+    def run_research_crew(self):
+        crew = ResearchCrew()
+        result = crew.kickoff(inputs={"topic": self.state.user_input})
+        self.state.research_results = result.raw
+```
+
+## Control Primitives
+
+CrewAI의 Control Primitives를 활용하여 Crew에 견고함과 제어력을 더하세요.
+
+### 1. Task Guardrails
+[Task Guardrails](/ko/concepts/tasks#task-guardrails)를 사용하여 작업 결과가 수락되기 전에 유효성을 검사하세요. 이를 통해 agent가 고품질 결과를 생성하도록 보장할 수 있습니다.
+
+```python
+def validate_content(result: TaskOutput) -> Tuple[bool, Any]:
+    if len(result.raw) < 100:
+        return (False, "Content is too short. Please expand.")
+    return (True, result.raw)
+
+task = Task(
+    ...,
+    guardrail=validate_content
+)
+```
+
+### 2. 구조화된 출력 (Structured Outputs)
+작업 간에 데이터를 전달하거나 애플리케이션으로 전달할 때는 항상 구조화된 출력(`output_pydantic` 또는 `output_json`)을 사용하세요. 이는 파싱 오류를 방지하고 타입 안전성을 보장합니다.
+
+```python
+class ResearchResult(BaseModel):
+    summary: str
+    sources: List[str]
+
+task = Task(
+    ...,
+    output_pydantic=ResearchResult
+)
+```
+
+### 3. LLM Hooks
+[LLM Hooks](/ko/learn/llm-hooks)를 사용하여 LLM으로 전송되기 전에 메시지를 검사하거나 수정하고, 응답을 정리(sanitize)하세요.
+
+```python
+@before_llm_call
+def log_request(context):
+    print(f"Agent {context.agent.role} is calling the LLM...")
+```
+
+## 배포 패턴
+
+Flow를 배포할 때 다음을 고려하세요:
+
+### CrewAI Enterprise
+Flow를 배포하는 가장 쉬운 방법은 CrewAI Enterprise를 사용하는 것입니다. 인프라, 인증 및 모니터링을 대신 처리합니다.
+
+시작하려면 [배포 가이드](/ko/enterprise/guides/deploy-crew)를 확인하세요.
+
+```bash
+crewai deploy create
+```
+
+### 비동기 실행 (Async Execution)
+장기 실행 작업의 경우 `kickoff_async`를 사용하여 API 차단을 방지하세요.
+
+### 지속성 (Persistence)
+`@persist` 데코레이터를 사용하여 Flow의 상태를 데이터베이스에 저장하세요. 이를 통해 프로세스가 중단되거나 사람의 입력을 기다려야 할 때 실행을 재개할 수 있습니다.
+
+```python
+@persist
+class ProductionFlow(Flow[AppState]):
+    # ...
+```
+
+## 요약
+
+- **Flow로 시작하세요.**
+- **명확한 State를 정의하세요.**
+- **복잡한 작업에는 Crews를 사용하세요.**
+- **API와 지속성을 갖추어 배포하세요.**
--- a/docs/ko/introduction.mdx
+++ b/docs/ko/introduction.mdx
@@ -7,109 +7,89 @@ mode: "wide"

 # CrewAI란 무엇인가?

-**CrewAI는 LangChain이나 기타 agent 프레임워크에 의존하지 않고, 완전히 독립적으로 처음부터 스크래치로 개발된 가볍고 매우 빠른 Python 프레임워크입니다.**
+**CrewAI는 자율 AI agent를 조직하고 복잡한 workflow를 구축하기 위한 최고의 오픈 소스 프레임워크입니다.**

-CrewAI는 고수준의 간편함과 정밀한 저수준 제어를 모두 제공하여, 어떤 시나리오에도 맞춤화된 자율 AI agent를 만드는 데 이상적입니다:
+**Crews**의 협업 지능과 **Flows**의 정밀한 제어를 결합하여 개발자가 프로덕션 수준의 멀티 에이전트 시스템을 구축할 수 있도록 지원합니다.

- **[CrewAI Crews](/ko/guides/crews/first-crew)**: 자율성과 협업 지능을 극대화하여, 각 agent가 특정 역할, 도구, 목표를 가진 AI 팀을 만들 수 있습니다.
- **[CrewAI Flows](/ko/guides/flows/first-flow)**: 이벤트 기반의 세밀한 제어와 단일 LLM 호출을 통한 정확한 작업 orchestration을 지원하며, Crews와 네이티브로 통합됩니다.
+- **[CrewAI Flows](/ko/guides/flows/first-flow)**: AI 애플리케이션의 중추(Backbone)입니다. Flows를 사용하면 상태를 관리하고 실행을 제어하는 구조화된 이벤트 기반 workflow를 만들 수 있습니다. AI agent가 작업할 수 있는 기반을 제공합니다.
+- **[CrewAI Crews](/ko/guides/crews/first-crew)**: Flow 내의 작업 단위입니다. Crews는 Flow가 위임한 특정 작업을 해결하기 위해 협력하는 자율 agent 팀입니다.

-10만 명이 넘는 개발자가 커뮤니티 과정을 통해 인증을 받았으며, CrewAI는 기업용 AI 자동화의 표준으로 빠르게 자리잡고 있습니다.
+10만 명이 넘는 개발자가 커뮤니티 과정을 통해 인증을 받았으며, CrewAI는 기업용 AI 자동화의 표준입니다.

-## Crew의 작동 방식
+## CrewAI 아키텍처
+
+CrewAI의 아키텍처는 자율성과 제어의 균형을 맞추도록 설계되었습니다.
+
+### 1. Flows: 중추 (Backbone)

 <Note>
-  회사가 비즈니스 목표를 달성하기 위해 여러 부서(영업, 엔지니어링, 마케팅 등)가 리더십 아래에서 함께 일하는 것처럼, CrewAI는 복잡한 작업을 달성하기 위해 전문화된 역할의 AI agent들이 협력하는 조직을 만들 수 있도록 도와줍니다.
-</Note>
-
-<Frame caption="CrewAI Framework Overview">
-  <img src="/images/crews.png" alt="CrewAI Framework Overview" />
-</Frame>
-
-| 구성 요소 | 설명 | 주요 특징 |
-|:----------|:----:|:----------|
-| **Crew** | 최상위 조직 | • AI agent 팀 관리<br/>• workflow 감독<br/>• 협업 보장<br/>• 결과 전달 |
-| **AI agents** | 전문 팀원 | • 특정 역할 보유(Researcher, Writer 등)<br/>• 지정된 도구 사용<br/>• 작업 위임 가능<br/>• 자율적 의사결정 가능 |
-| **Process** | workflow 관리 시스템 | • 협업 패턴 정의<br/>• 작업 할당 제어<br/>• 상호작용 관리<br/>• 효율적 실행 보장 |
-| **Task** | 개별 할당 | • 명확한 목표 보유<br/>• 특정 도구 사용<br/>• 더 큰 프로세스에 기여<br/>• 실행 가능한 결과 도출 |
-
-### 전체 구조의 동작 방식
-
-1. **Crew**가 전체 운영을 조직합니다
-2. **AI agents**가 자신들의 전문 작업을 수행합니다
-3. **Process**가 원활한 협업을 보장합니다
-4. **Tasks**가 완료되어 목표를 달성합니다
-
-## 주요 기능
-
-<CardGroup cols={2}>
-  <Card title="역할 기반 agent" icon="users">
-    Researcher, Analyst, Writer 등 다양한 역할과 전문성, 목표를 가진 agent를 생성할 수 있습니다
-  </Card>
-  <Card title="유연한 도구" icon="screwdriver-wrench">
-    agent에게 외부 서비스 및 데이터 소스와 상호작용할 수 있는 맞춤형 도구와 API를 제공합니다
-  </Card>
-  <Card title="지능형 협업" icon="people-arrows">
-    agent들이 함께 작업하며, 인사이트를 공유하고 작업을 조율하여 복잡한 목표를 달성합니다
-  </Card>
-  <Card title="작업 관리" icon="list-check">
-    순차적 또는 병렬 workflow를 정의할 수 있으며, agent가 작업 의존성을 자동으로 처리합니다
-  </Card>
-</CardGroup>
-
-## Flow의 작동 원리
-
-<Note>
-  Crew가 자율 협업에 탁월하다면, Flow는 구조화된 자동화를 제공하여 workflow 실행에 대한 세밀한 제어를 제공합니다. Flow는 조건부 로직, 반복문, 동적 상태 관리를 정확하게 처리하면서 작업이 신뢰성 있게, 안전하게, 효율적으로 실행되도록 보장합니다. Flow는 Crew와 원활하게 통합되어 높은 자율성과 엄격한 제어의 균형을 이룰 수 있게 해줍니다.
+  Flow를 애플리케이션의 "관리자" 또는 "프로세스 정의"라고 생각하세요. 단계, 로직, 그리고 시스템 내에서 데이터가 이동하는 방식을 정의합니다.
 </Note>

 <Frame caption="CrewAI Framework Overview">
  <img src="/images/flows.png" alt="CrewAI Framework Overview" />
 </Frame>

-| 구성 요소 | 설명 | 주요 기능 |
-|:----------|:-----------:|:------------|
-| **Flow** | 구조화된 workflow orchestration | • 실행 경로 관리<br/>• 상태 전환 처리<br/>• 작업 순서 제어<br/>• 신뢰성 있는 실행 보장 |
-| **Events** | workflow 액션 트리거 | • 특정 프로세스 시작<br/>• 동적 응답 가능<br/>• 조건부 분기 지원<br/>• 실시간 적응 허용 |
-| **States** | workflow 실행 컨텍스트 | • 실행 데이터 유지<br/>• 데이터 영속성 지원<br/>• 재개 가능성 보장<br/>• 실행 무결성 확보 |
-| **Crew Support** | workflow 자동화 강화 | • 필요할 때 agency 삽입<br/>• 구조화된 workflow 보완<br/>• 자동화와 인텔리전스의 균형<br/>• 적응적 의사결정 지원 |
+Flows의 기능:
+- **상태 관리**: 단계 및 실행 전반에 걸쳐 데이터를 유지합니다.
+- **이벤트 기반 실행**: 이벤트 또는 외부 입력을 기반으로 작업을 트리거합니다.
+- **제어 흐름**: 조건부 로직, 반복문, 분기를 사용합니다.

-### 주요 기능
+### 2. Crews: 지능 (Intelligence)
+
+<Note>
+  Crews는 힘든 일을 처리하는 "팀"입니다. Flow 내에서 창의성과 협업이 필요한 복잡한 문제를 해결하기 위해 Crew를 트리거할 수 있습니다.
+</Note>
+
+<Frame caption="CrewAI Framework Overview">
+  <img src="/images/crews.png" alt="CrewAI Framework Overview" />
+</Frame>
+
+Crews의 기능:
+- **역할 수행 Agent**: 특정 목표와 도구를 가진 전문 agent입니다.
+- **자율 협업**: agent들이 협력하여 작업을 해결합니다.
+- **작업 위임**: agent의 능력에 따라 작업이 할당되고 실행됩니다.
+
+## 전체 작동 방식
+
+1.  **Flow**가 이벤트를 트리거하거나 프로세스를 시작합니다.
+2.  **Flow**가 상태를 관리하고 다음에 무엇을 할지 결정합니다.
+3.  **Flow**가 복잡한 작업을 **Crew**에게 위임합니다.
+4.  **Crew**의 agent들이 협력하여 작업을 완료합니다.
+5.  **Crew**가 결과를 **Flow**에 반환합니다.
+6.  **Flow**가 결과를 바탕으로 실행을 계속합니다.
+
+## 주요 기능

 <CardGroup cols={2}>
-  <Card title="이벤트 기반 orchestration" icon="bolt">
-    이벤트에 동적으로 반응하여 정밀한 실행 경로를 정의합니다
+  <Card title="프로덕션 등급 Flows" icon="arrow-progress">
+    장기 실행 프로세스와 복잡한 로직을 처리할 수 있는 신뢰할 수 있고 상태를 유지하는 workflow를 구축합니다.
  </Card>
-  <Card title="세밀한 제어" icon="sliders">
-    workflow 상태와 조건부 실행을 안전하고 효율적으로 관리합니다
+  <Card title="자율 Crews" icon="users">
+    높은 수준의 목표를 달성하기 위해 계획하고, 실행하고, 협력할 수 있는 agent 팀을 배포합니다.
  </Card>
-  <Card title="네이티브 Crew 통합" icon="puzzle-piece">
-    Crews와 손쉽게 결합하여 자율성과 지능을 강화합니다
+  <Card title="유연한 도구" icon="screwdriver-wrench">
+     agent를 모든 API, 데이터베이스 또는 로컬 도구에 연결합니다.
  </Card>
-  <Card title="결정론적 실행" icon="route">
-    명시적 제어 흐름과 오류 처리로 예측 가능한 결과를 보장합니다
+  <Card title="엔터프라이즈 보안" icon="lock">
+    엔터프라이즈 배포를 위한 보안 및 규정 준수를 고려하여 설계되었습니다.
  </Card>
 </CardGroup>

-## Crew와 Flow를 언제 사용할까
+## Crews vs Flows 사용 시기

-<Note>
-  [Crew](/ko/guides/crews/first-crew)와 [Flow](/ko/guides/flows/first-flow)를 언제 사용할지 이해하는 것은 CrewAI의 잠재력을 애플리케이션에서 극대화하는 데 핵심적입니다.
-</Note>
+**짧은 답변: 둘 다 사용하세요.**

-| 사용 사례 | 권장 접근 방식 | 이유 |
-|:---------|:---------------------|:-----|
-| **개방형 연구** | [Crew](/ko/guides/crews/first-crew) | 창의적 사고, 탐색, 적응이 필요한 작업에 적합 |
-| **콘텐츠 생성** | [Crew](/ko/guides/crews/first-crew) | 기사, 보고서, 마케팅 자료 등 협업형 생성에 적합 |
-| **의사결정 workflow** | [Flow](/ko/guides/flows/first-flow) | 예측 가능하고 감사 가능한 의사결정 경로 및 정밀 제어가 필요할 때 |
-| **API orchestration** | [Flow](/ko/guides/flows/first-flow) | 특정 순서로 여러 외부 서비스에 신뢰성 있게 통합할 때 |
-| **하이브리드 애플리케이션** | 혼합 접근 방식 | [Flow](/ko/guides/flows/first-flow)로 전체 프로세스를 orchestration하고, [Crew](/ko/guides/crews/first-crew)로 복잡한 하위 작업을 처리 |
+모든 프로덕션 애플리케이션의 경우, **Flow로 시작하세요**.

-### 의사결정 프레임워크
+- 애플리케이션의 전체 구조, 상태, 로직을 정의하려면 **Flow를 사용하세요**.
+- 자율성이 필요한 특정하고 복잡한 작업을 수행하기 위해 agent 팀이 필요할 때 Flow 단계 내에서 **Crew를 사용하세요**.

- **[Crews](/ko/guides/crews/first-crew)를 선택할 때:** 자율적인 문제 해결, 창의적 협업 또는 탐구적 작업이 필요할 때
- **[Flows](/ko/guides/flows/first-flow)를 선택할 때:** 결정론적 결과, 감사 가능성, 또는 실행에 대한 정밀한 제어가 필요할 때
- **둘 다 결합할 때:** 애플리케이션에 구조화된 프로세스와 자율적 지능이 모두 필요할 때
+| 사용 사례 | 아키텍처 |
+| :--- | :--- |
+| **간단한 자동화** | Python 작업이 포함된 단일 Flow |
+| **복잡한 연구** | 상태를 관리하는 Flow -> 연구를 수행하는 Crew |
+| **애플리케이션 백엔드** | API 요청을 처리하는 Flow -> 콘텐츠를 생성하는 Crew -> DB에 저장하는 Flow |

 ## CrewAI를 선택해야 하는 이유?

@@ -123,13 +103,6 @@ CrewAI는 고수준의 간편함과 정밀한 저수준 제어를 모두 제공
 ## 지금 바로 빌드를 시작해보세요!

 <CardGroup cols={2}>
-  <Card
-    title="첫 번째 Crew 만들기"
-    icon="users-gear"
-    href="/ko/guides/crews/first-crew"
-  >
-    복잡한 문제를 함께 해결하는 협업 AI 팀을 단계별로 만드는 튜토리얼입니다.
-  </Card>
  <Card
    title="첫 번째 Flow 만들기"
    icon="diagram-project"
@@ -137,6 +110,13 @@ CrewAI는 고수준의 간편함과 정밀한 저수준 제어를 모두 제공
  >
    실행을 정밀하게 제어할 수 있는 구조화된, 이벤트 기반 workflow를 만드는 방법을 배워보세요.
  </Card>
+  <Card
+    title="첫 번째 Crew 만들기"
+    icon="users-gear"
+    href="/ko/guides/crews/first-crew"
+  >
+    복잡한 문제를 함께 해결하는 협업 AI 팀을 단계별로 만드는 튜토리얼입니다.
+  </Card>
 </CardGroup>

 <CardGroup cols={3}>
@@ -161,4 +141,4 @@ CrewAI는 고수준의 간편함과 정밀한 저수준 제어를 모두 제공
  >
    다른 개발자와 소통하며, 도움을 받고 CrewAI 경험을 공유해보세요.
  </Card>
-</CardGroup>
+</CardGroup>
--- a/docs/ko/observability/tracing.mdx
+++ b/docs/ko/observability/tracing.mdx
@@ -0,0 +1,213 @@
+---
+title: CrewAI Tracing
+description: CrewAI AOP 플랫폼을 사용한 CrewAI Crews 및 Flows의 내장 추적
+icon: magnifying-glass-chart
+mode: "wide"
+---
+
+# CrewAI 내장 추적 (Built-in Tracing)
+
+CrewAI는 Crews와 Flows를 실시간으로 모니터링하고 디버깅할 수 있는 내장 추적 기능을 제공합니다. 이 가이드는 CrewAI의 통합 관측 가능성 플랫폼을 사용하여 **Crews**와 **Flows** 모두에 대한 추적을 활성화하는 방법을 보여줍니다.
+
+> **CrewAI Tracing이란?** CrewAI의 내장 추적은 agent 결정, 작업 실행 타임라인, 도구 사용, LLM 호출을 포함한 AI agent에 대한 포괄적인 관측 가능성을 제공하며, 모두 [CrewAI AOP 플랫폼](https://app.crewai.com)을 통해 액세스할 수 있습니다.
+
+![CrewAI Tracing Interface](/images/crewai-tracing.png)
+
+## 사전 요구 사항
+
+CrewAI 추적을 사용하기 전에 다음이 필요합니다:
+
+1. **CrewAI AOP 계정**: [app.crewai.com](https://app.crewai.com)에서 무료 계정에 가입하세요
+2. **CLI 인증**: CrewAI CLI를 사용하여 로컬 환경을 인증하세요
+
+```bash
+crewai login
+```
+
+## 설정 지침
+
+### 1단계: CrewAI AOP 계정 생성
+
+[app.crewai.com](https://app.crewai.com)을 방문하여 무료 계정을 만드세요. 이를 통해 추적, 메트릭을 보고 crews를 관리할 수 있는 CrewAI AOP 플랫폼에 액세스할 수 있습니다.
+
+### 2단계: CrewAI CLI 설치 및 인증
+
+아직 설치하지 않았다면 CLI 도구와 함께 CrewAI를 설치하세요:
+
+```bash
+uv add crewai[tools]
+```
+
+그런 다음 CrewAI AOP 계정으로 CLI를 인증하세요:
+
+```bash
+crewai login
+```
+
+이 명령은 다음을 수행합니다:
+1. 브라우저에서 인증 페이지를 엽니다
+2. 장치 코드를 입력하라는 메시지를 표시합니다
+3. CrewAI AOP 계정으로 로컬 환경을 인증합니다
+4. 로컬 개발을 위한 추적 기능을 활성화합니다
+
+### 3단계: Crew에서 추적 활성화
+
+`tracing` 매개변수를 `True`로 설정하여 Crew에 대한 추적을 활성화할 수 있습니다:
+
+```python
+from crewai import Agent, Crew, Process, Task
+from crewai_tools import SerperDevTool
+
+# Define your agents
+researcher = Agent(
+    role="Senior Research Analyst",
+    goal="Uncover cutting-edge developments in AI and data science",
+    backstory=\"\"\"You work at a leading tech think tank.
+    Your expertise lies in identifying emerging trends.
+    You have a knack for dissecting complex data and presenting actionable insights.\"\"\",
+    verbose=True,
+    tools=[SerperDevTool()],
+)
+
+writer = Agent(
+    role="Tech Content Strategist",
+    goal="Craft compelling content on tech advancements",
+    backstory=\"\"\"You are a renowned Content Strategist, known for your insightful and engaging articles.
+    You transform complex concepts into compelling narratives.\"\"\",
+    verbose=True,
+)
+
+# Create tasks for your agents
+research_task = Task(
+    description=\"\"\"Conduct a comprehensive analysis of the latest advancements in AI in 2024.
+    Identify key trends, breakthrough technologies, and potential industry impacts.\"\"\",
+    expected_output="Full analysis report in bullet points",
+    agent=researcher,
+)
+
+writing_task = Task(
+    description=\"\"\"Using the insights provided, develop an engaging blog
+    post that highlights the most significant AI advancements.
+    Your post should be informative yet accessible, catering to a tech-savvy audience.\"\"\",
+    expected_output="Full blog post of at least 4 paragraphs",
+    agent=writer,
+)
+
+# Enable tracing in your crew
+crew = Crew(
+    agents=[researcher, writer],
+    tasks=[research_task, writing_task],
+    process=Process.sequential,
+    tracing=True,  # Enable built-in tracing
+    verbose=True
+)
+
+# Execute your crew
+result = crew.kickoff()
+```
+
+### 4단계: Flow에서 추적 활성화
+
+마찬가지로 CrewAI Flows에 대한 추적을 활성화할 수 있습니다:
+
+```python
+from crewai.flow.flow import Flow, listen, start
+from pydantic import BaseModel
+
+class ExampleState(BaseModel):
+    counter: int = 0
+    message: str = ""
+
+class ExampleFlow(Flow[ExampleState]):
+    def __init__(self):
+        super().__init__(tracing=True)  # Enable tracing for the flow
+
+    @start()
+    def first_method(self):
+        print("Starting the flow")
+        self.state.counter = 1
+        self.state.message = "Flow started"
+        return "continue"
+
+    @listen("continue")
+    def second_method(self):
+        print("Continuing the flow")
+        self.state.counter += 1
+        self.state.message = "Flow continued"
+        return "finish"
+
+    @listen("finish")
+    def final_method(self):
+        print("Finishing the flow")
+        self.state.counter += 1
+        self.state.message = "Flow completed"
+
+# Create and run the flow with tracing enabled
+flow = ExampleFlow(tracing=True)
+result = flow.kickoff()
+```
+
+### 5단계: CrewAI AOP 대시보드에서 추적 보기
+
+crew 또는 flow를 실행한 후 CrewAI AOP 대시보드에서 CrewAI 애플리케이션이 생성한 추적을 볼 수 있습니다. agent 상호 작용, 도구 사용 및 LLM 호출의 세부 단계를 볼 수 있습니다.
+아래 링크를 클릭하여 추적을 보거나 대시보드의 추적 탭으로 이동하세요 [여기](https://app.crewai.com/crewai_plus/trace_batches)
+![CrewAI Tracing Interface](/images/view-traces.png)
+
+
+### 대안: 환경 변수 구성
+
+환경 변수를 설정하여 전역적으로 추적을 활성화할 수도 있습니다:
+
+```bash
+export CREWAI_TRACING_ENABLED=true
+```
+
+또는 `.env` 파일에 추가하세요:
+
+```env
+CREWAI_TRACING_ENABLED=true
+```
+
+이 환경 변수가 설정되면 `tracing=True`를 명시적으로 설정하지 않아도 모든 Crews와 Flows에 자동으로 추적이 활성화됩니다.
+
+## 추적 보기
+
+### CrewAI AOP 대시보드 액세스
+
+1. [app.crewai.com](https://app.crewai.com)을 방문하여 계정에 로그인하세요
+2. 프로젝트 대시보드로 이동하세요
+3. **Traces** 탭을 클릭하여 실행 세부 정보를 확인하세요
+
+### 추적에서 볼 수 있는 내용
+
+CrewAI 추적은 다음에 대한 포괄적인 가시성을 제공합니다:
+
+- **Agent 결정**: agent가 작업을 통해 어떻게 추론하고 결정을 내리는지 확인하세요
+- **작업 실행 타임라인**: 작업 시퀀스 및 종속성의 시각적 표현
+- **도구 사용**: 어떤 도구가 호출되고 그 결과를 모니터링하세요
+- **LLM 호출**: 프롬프트 및 응답을 포함한 모든 언어 모델 상호 작용을 추적하세요
+- **성능 메트릭**: 실행 시간, 토큰 사용량 및 비용
+- **오류 추적**: 세부 오류 정보 및 스택 추적
+
+### 추적 기능
+- **실행 타임라인**: 실행의 다양한 단계를 클릭하여 확인하세요
+- **세부 로그**: 디버깅을 위한 포괄적인 로그에 액세스하세요
+- **성능 분석**: 실행 패턴을 분석하고 성능을 최적화하세요
+- **내보내기 기능**: 추가 분석을 위해 추적을 다운로드하세요
+
+### 인증 문제
+
+인증 문제가 발생하는 경우:
+
+1. 로그인되어 있는지 확인하세요: `crewai login`
+2. 인터넷 연결을 확인하세요
+3. [app.crewai.com](https://app.crewai.com)에서 계정을 확인하세요
+
+### 추적이 나타나지 않음
+
+대시보드에 추적이 표시되지 않는 경우:
+
+1. Crew/Flow에서 `tracing=True`가 설정되어 있는지 확인하세요
+2. 환경 변수를 사용하는 경우 `CREWAI_TRACING_ENABLED=true`인지 확인하세요
+3. `crewai login`으로 인증되었는지 확인하세요
+4. crew/flow가 실제로 실행되고 있는지 확인하세요
--- a/docs/pt-BR/concepts/production-architecture.mdx
+++ b/docs/pt-BR/concepts/production-architecture.mdx
@@ -0,0 +1,154 @@
+---
+title: Arquitetura de Produção
+description: Melhores práticas para construir aplicações de IA prontas para produção com CrewAI
+icon: server
+mode: "wide"
+---
+
+# A Mentalidade Flow-First
+
+Ao construir aplicações de IA de produção com CrewAI, **recomendamos começar com um Flow**.
+
+Embora seja possível executar Crews ou Agentes individuais, envolvê-los em um Flow fornece a estrutura necessária para uma aplicação robusta e escalável.
+
+## Por que Flows?
+
+1.  **Gerenciamento de Estado**: Flows fornecem uma maneira integrada de gerenciar o estado em diferentes etapas da sua aplicação. Isso é crucial para passar dados entre Crews, manter o contexto e lidar com entradas do usuário.
+2.  **Controle**: Flows permitem definir caminhos de execução precisos, incluindo loops, condicionais e lógica de ramificação. Isso é essencial para lidar com casos extremos e garantir que sua aplicação se comporte de maneira previsível.
+3.  **Observabilidade**: Flows fornecem uma estrutura clara que facilita o rastreamento da execução, a depuração de problemas e o monitoramento do desempenho. Recomendamos o uso do [CrewAI Tracing](/pt-BR/observability/tracing) para insights detalhados. Basta executar `crewai login` para habilitar recursos de observabilidade gratuitos.
+
+## A Arquitetura
+
+Uma aplicação CrewAI de produção típica se parece com isso:
+
+```mermaid
+graph TD
+    Start((Início)) --> Flow[Orquestrador de Flow]
+    Flow --> State{Gerenciamento de Estado}
+    State --> Step1[Etapa 1: Coleta de Dados]
+    Step1 --> Crew1[Crew de Pesquisa]
+    Crew1 --> State
+    State --> Step2{Verificação de Condição}
+    Step2 -- "Válido" --> Step3[Etapa 3: Execução]
+    Step3 --> Crew2[Crew de Ação]
+    Step2 -- "Inválido" --> End((Fim))
+    Crew2 --> End
+```
+
+### 1. A Classe Flow
+Sua classe `Flow` é o ponto de entrada. Ela define o esquema de estado e os métodos que executam sua lógica.
+
+```python
+from crewai.flow.flow import Flow, listen, start
+from pydantic import BaseModel
+
+class AppState(BaseModel):
+    user_input: str = ""
+    research_results: str = ""
+    final_report: str = ""
+
+class ProductionFlow(Flow[AppState]):
+    @start()
+    def gather_input(self):
+        # ... lógica para obter entrada ...
+        pass
+
+    @listen(gather_input)
+    def run_research_crew(self):
+        # ... acionar um Crew ...
+        pass
+```
+
+### 2. Gerenciamento de Estado
+Use modelos Pydantic para definir seu estado. Isso garante a segurança de tipos e deixa claro quais dados estão disponíveis em cada etapa.
+
+- **Mantenha o mínimo**: Armazene apenas o que você precisa persistir entre as etapas.
+- **Use dados estruturados**: Evite dicionários não estruturados quando possível.
+
+### 3. Crews como Unidades de Trabalho
+Delegue tarefas complexas para Crews. Um Crew deve ser focado em um objetivo específico (por exemplo, "Pesquisar um tópico", "Escrever uma postagem no blog").
+
+- **Não superengendre Crews**: Mantenha-os focados.
+- **Passe o estado explicitamente**: Passe os dados necessários do estado do Flow para as entradas do Crew.
+
+```python
+    @listen(gather_input)
+    def run_research_crew(self):
+        crew = ResearchCrew()
+        result = crew.kickoff(inputs={"topic": self.state.user_input})
+        self.state.research_results = result.raw
+```
+
+## Primitivas de Controle
+
+Aproveite as primitivas de controle do CrewAI para adicionar robustez e controle aos seus Crews.
+
+### 1. Task Guardrails
+Use [Task Guardrails](/pt-BR/concepts/tasks#task-guardrails) para validar as saídas das tarefas antes que sejam aceitas. Isso garante que seus agentes produzam resultados de alta qualidade.
+
+```python
+def validate_content(result: TaskOutput) -> Tuple[bool, Any]:
+    if len(result.raw) < 100:
+        return (False, "Content is too short. Please expand.")
+    return (True, result.raw)
+
+task = Task(
+    ...,
+    guardrail=validate_content
+)
+```
+
+### 2. Saídas Estruturadas
+Sempre use saídas estruturadas (`output_pydantic` ou `output_json`) ao passar dados entre tarefas ou para sua aplicação. Isso evita erros de análise e garante a segurança de tipos.
+
+```python
+class ResearchResult(BaseModel):
+    summary: str
+    sources: List[str]
+
+task = Task(
+    ...,
+    output_pydantic=ResearchResult
+)
+```
+
+### 3. LLM Hooks
+Use [LLM Hooks](/pt-BR/learn/llm-hooks) para inspecionar ou modificar mensagens antes que elas sejam enviadas para o LLM, ou para higienizar respostas.
+
+```python
+@before_llm_call
+def log_request(context):
+    print(f"Agent {context.agent.role} is calling the LLM...")
+```
+
+## Padrões de Implantação
+
+Ao implantar seu Flow, considere o seguinte:
+
+### CrewAI Enterprise
+A maneira mais fácil de implantar seu Flow é usando o CrewAI Enterprise. Ele lida com a infraestrutura, autenticação e monitoramento para você.
+
+Confira o [Guia de Implantação](/pt-BR/enterprise/guides/deploy-crew) para começar.
+
+```bash
+crewai deploy create
+```
+
+### Execução Assíncrona
+Para tarefas de longa duração, use `kickoff_async` para evitar bloquear sua API.
+
+### Persistência
+Use o decorador `@persist` para salvar o estado do seu Flow em um banco de dados. Isso permite retomar a execução se o processo falhar ou se você precisar esperar pela entrada humana.
+
+```python
+@persist
+class ProductionFlow(Flow[AppState]):
+    # ...
+```
+
+## Resumo
+
+- **Comece com um Flow.**
+- **Defina um Estado claro.**
+- **Use Crews para tarefas complexas.**
+- **Implante com uma API e persistência.**
--- a/docs/pt-BR/introduction.mdx
+++ b/docs/pt-BR/introduction.mdx
@@ -7,110 +7,89 @@ mode: "wide"

 # O que é CrewAI?

-**CrewAI é um framework Python enxuto e ultrarrápido, construído totalmente do zero—completamente independente do LangChain ou de outros frameworks de agentes.**
+**CrewAI é o principal framework open-source para orquestrar agentes de IA autônomos e construir fluxos de trabalho complexos.**

-O CrewAI capacita desenvolvedores tanto com simplicidade de alto nível quanto com controle detalhado de baixo nível, ideal para criar agentes de IA autônomos sob medida para qualquer cenário:
+Ele capacita desenvolvedores a construir sistemas multi-agente prontos para produção, combinando a inteligência colaborativa dos **Crews** com o controle preciso dos **Flows**.

- **[Crews do CrewAI](/pt-BR/guides/crews/first-crew)**: Otimizados para autonomia e inteligência colaborativa, permitindo criar equipes de IA onde cada agente possui funções, ferramentas e objetivos específicos.
- **[Flows do CrewAI](/pt-BR/guides/flows/first-flow)**: Proporcionam controle granular, orientado por eventos, com chamadas LLM individuais para uma orquestração precisa das tarefas, além de suportar Crews nativamente.
+- **[Flows do CrewAI](/pt-BR/guides/flows/first-flow)**: A espinha dorsal da sua aplicação de IA. Flows permitem criar fluxos de trabalho estruturados e orientados a eventos que gerenciam estado e controlam a execução. Eles fornecem a estrutura para seus agentes de IA trabalharem.
+- **[Crews do CrewAI](/pt-BR/guides/crews/first-crew)**: As unidades de trabalho dentro do seu Flow. Crews são equipes de agentes autônomos que colaboram para resolver tarefas específicas delegadas a eles pelo Flow.

-Com mais de 100.000 desenvolvedores certificados em nossos cursos comunitários, o CrewAI está se tornando rapidamente o padrão para automação de IA pronta para empresas.
+Com mais de 100.000 desenvolvedores certificados em nossos cursos comunitários, o CrewAI é o padrão para automação de IA pronta para empresas.

+## A Arquitetura do CrewAI

-## Como funcionam os Crews
+A arquitetura do CrewAI foi projetada para equilibrar autonomia com controle.
+
+### 1. Flows: A Espinha Dorsal

 <Note>
-  Assim como uma empresa possui departamentos (Vendas, Engenharia, Marketing) trabalhando juntos sob uma liderança para atingir objetivos de negócio, o CrewAI ajuda você a criar uma “organização” de agentes de IA com funções especializadas colaborando para realizar tarefas complexas.
-</Note>
-
-<Frame caption="Visão Geral do Framework CrewAI">
-  <img src="/images/crews.png" alt="Visão Geral do Framework CrewAI" />
-</Frame>
-
-| Componente | Descrição | Principais Funcionalidades |
-|:-----------|:-----------:|:-------------------------|
-| **Crew** | Organização de mais alto nível | • Gerencia equipes de agentes de IA<br/>• Supervisiona fluxos de trabalho<br/>• Garante colaboração<br/>• Entrega resultados |
-| **Agentes de IA** | Membros especializados da equipe | • Possuem funções específicas (pesquisador, escritor)<br/>• Utilizam ferramentas designadas<br/>• Podem delegar tarefas<br/>• Tomam decisões autônomas |
-| **Process** | Sistema de gestão do fluxo de trabalho | • Define padrões de colaboração<br/>• Controla designação de tarefas<br/>• Gerencia interações<br/>• Garante execução eficiente |
-| **Tasks** | Atribuições individuais | • Objetivos claros<br/>• Utilizam ferramentas específicas<br/>• Alimentam processos maiores<br/>• Geram resultados acionáveis |
-
-### Como tudo trabalha junto
-
-1. O **Crew** organiza toda a operação
-2. **Agentes de IA** realizam tarefas especializadas
-3. O **Process** garante colaboração fluida
-4. **Tasks** são concluídas para alcançar o objetivo
-
-## Principais Funcionalidades
-
-<CardGroup cols={2}>
-  <Card title="Agentes Baseados em Funções" icon="users">
-    Crie agentes especializados com funções, conhecimentos e objetivos definidos – de pesquisadores e analistas a escritores
-  </Card>
-  <Card title="Ferramentas Flexíveis" icon="screwdriver-wrench">
-    Equipe os agentes com ferramentas e APIs personalizadas para interagir com serviços e fontes de dados externas
-  </Card>
-  <Card title="Colaboração Inteligente" icon="people-arrows">
-    Agentes trabalham juntos, compartilhando insights e coordenando tarefas para conquistar objetivos complexos
-  </Card>
-  <Card title="Gerenciamento de Tarefas" icon="list-check">
-    Defina fluxos de trabalho sequenciais ou paralelos, com agentes lidando automaticamente com dependências entre tarefas
-  </Card>
-</CardGroup>
-
-## Como funcionam os Flows
-
-<Note>
-  Enquanto Crews se destacam na colaboração autônoma, Flows proporcionam automações estruturadas, oferecendo controle granular sobre a execução dos fluxos de trabalho. Flows garantem execução confiável, segura e eficiente, lidando com lógica condicional, loops e gerenciamento dinâmico de estados com precisão. Flows se integram perfeitamente com Crews, permitindo equilibrar alta autonomia com controle rigoroso.
+  Pense em um Flow como o "gerente" ou a "definição do processo" da sua aplicação. Ele define as etapas, a lógica e como os dados se movem através do seu sistema.
 </Note>

 <Frame caption="Visão Geral do Framework CrewAI">
  <img src="/images/flows.png" alt="Visão Geral do Framework CrewAI" />
 </Frame>

-| Componente | Descrição | Principais Funcionalidades |
-|:-----------|:-----------:|:-------------------------|
-| **Flow** | Orquestração de fluxo de trabalho estruturada | • Gerencia caminhos de execução<br/>• Lida com transições de estado<br/>• Controla a sequência de tarefas<br/>• Garante execução confiável |
-| **Events** | Gatilhos para ações nos fluxos | • Iniciam processos específicos<br/>• Permitem respostas dinâmicas<br/>• Suportam ramificações condicionais<br/>• Adaptam-se em tempo real |
-| **States** | Contextos de execução dos fluxos | • Mantêm dados de execução<br/>• Permitem persistência<br/>• Suportam retomada<br/>• Garantem integridade na execução |
-| **Crew Support** | Aprimora automação de fluxos | • Injeta autonomia quando necessário<br/>• Complementa fluxos estruturados<br/>• Equilibra automação e inteligência<br/>• Permite tomada de decisão adaptativa |
+Flows fornecem:
+- **Gerenciamento de Estado**: Persistem dados através de etapas e execuções.
+- **Execução Orientada a Eventos**: Acionam ações com base em eventos ou entradas externas.
+- **Controle de Fluxo**: Usam lógica condicional, loops e ramificações.

-### Capacidades-Chave
+### 2. Crews: A Inteligência
+
+<Note>
+  Crews são as "equipes" que fazem o trabalho pesado. Dentro de um Flow, você pode acionar um Crew para lidar com um problema complexo que requer criatividade e colaboração.
+</Note>
+
+<Frame caption="Visão Geral do Framework CrewAI">
+  <img src="/images/crews.png" alt="Visão Geral do Framework CrewAI" />
+</Frame>
+
+Crews fornecem:
+- **Agentes com Funções**: Agentes especializados com objetivos e ferramentas específicas.
+- **Colaboração Autônoma**: Agentes trabalham juntos para resolver tarefas.
+- **Delegação de Tarefas**: Tarefas são atribuídas e executadas com base nas capacidades dos agentes.
+
+## Como Tudo Funciona Junto
+
+1.  **O Flow** aciona um evento ou inicia um processo.
+2.  **O Flow** gerencia o estado e decide o que fazer a seguir.
+3.  **O Flow** delega uma tarefa complexa para um **Crew**.
+4.  Os agentes do **Crew** colaboram para completar a tarefa.
+5.  **O Crew** retorna o resultado para o **Flow**.
+6.  **O Flow** continua a execução com base no resultado.
+
+## Principais Funcionalidades

 <CardGroup cols={2}>
-  <Card title="Orquestração Orientada por Eventos" icon="bolt">
-    Defina caminhos de execução precisos respondendo dinamicamente a eventos
+  <Card title="Flows de Nível de Produção" icon="arrow-progress">
+    Construa fluxos de trabalho confiáveis e com estado que podem lidar com processos de longa duração e lógica complexa.
  </Card>
-  <Card title="Controle Detalhado" icon="sliders">
-    Gerencie estados de fluxo de trabalho e execução condicional de forma segura e eficiente
+  <Card title="Crews Autônomos" icon="users">
+    Implante equipes de agentes que podem planejar, executar e colaborar para alcançar objetivos de alto nível.
  </Card>
-  <Card title="Integração Nativa com Crew" icon="puzzle-piece">
-    Combine de forma simples com Crews para maior autonomia e inteligência
+  <Card title="Ferramentas Flexíveis" icon="screwdriver-wrench">
+     Conecte seus agentes a qualquer API, banco de dados ou ferramenta local.
  </Card>
-  <Card title="Execução Determinística" icon="route">
-    Garanta resultados previsíveis com controle explícito de fluxo e tratamento de erros
+  <Card title="Segurança Empresarial" icon="lock">
+    Projetado com segurança e conformidade em mente para implantações empresariais.
  </Card>
 </CardGroup>

-## Quando usar Crews versus Flows
+## Quando usar Crews vs. Flows

-<Note>
-  Entender quando utilizar [Crews](/pt-BR/guides/crews/first-crew) ou [Flows](/pt-BR/guides/flows/first-flow) é fundamental para maximizar o potencial do CrewAI em suas aplicações.
-</Note>
+**A resposta curta: Use ambos.**

-| Caso de uso | Abordagem recomendada | Por quê? |
-|:------------|:---------------------|:---------|
-| **Pesquisa aberta** | [Crews](/pt-BR/guides/crews/first-crew) | Quando as tarefas exigem criatividade, exploração e adaptação |
-| **Geração de conteúdo** | [Crews](/pt-BR/guides/crews/first-crew) | Para criação colaborativa de artigos, relatórios ou materiais de marketing |
-| **Fluxos de decisão** | [Flows](/pt-BR/guides/flows/first-flow) | Quando é necessário caminhos de decisão previsíveis, auditáveis e com controle preciso |
-| **Orquestração de APIs** | [Flows](/pt-BR/guides/flows/first-flow) | Para integração confiável com múltiplos serviços externos em sequência específica |
-| **Aplicações híbridas** | Abordagem combinada | Use [Flows](/pt-BR/guides/flows/first-flow) para orquestrar o processo geral com [Crews](/pt-BR/guides/crews/first-crew) lidando com subtarefas complexas |
+Para qualquer aplicação pronta para produção, **comece com um Flow**.

-### Framework de Decisão
+- **Use um Flow** para definir a estrutura geral, estado e lógica da sua aplicação.
+- **Use um Crew** dentro de uma etapa do Flow quando precisar de uma equipe de agentes para realizar uma tarefa específica e complexa que requer autonomia.

- **Escolha [Crews](/pt-BR/guides/crews/first-crew) quando:** Precisa de resolução autônoma de problemas, colaboração criativa ou tarefas exploratórias
- **Escolha [Flows](/pt-BR/guides/flows/first-flow) quando:** Requer resultados determinísticos, auditabilidade ou controle preciso sobre a execução
- **Combine ambos quando:** Sua aplicação precisa de processos estruturados e também de bolsões de inteligência autônoma
+| Caso de Uso | Arquitetura |
+| :--- | :--- |
+| **Automação Simples** | Flow único com tarefas Python |
+| **Pesquisa Complexa** | Flow gerenciando estado -> Crew realizando pesquisa |
+| **Backend de Aplicação** | Flow lidando com requisições API -> Crew gerando conteúdo -> Flow salvando no BD |

 ## Por que escolher o CrewAI?

@@ -124,13 +103,6 @@ Com mais de 100.000 desenvolvedores certificados em nossos cursos comunitários,
 ## Pronto para começar a construir?

 <CardGroup cols={2}>
-  <Card
-    title="Crie Seu Primeiro Crew"
-    icon="users-gear"
-    href="/pt-BR/guides/crews/first-crew"
-  >
-    Tutorial passo a passo para criar uma equipe de IA colaborativa que trabalha junto para resolver problemas complexos.
-  </Card>
  <Card
    title="Crie Seu Primeiro Flow"
    icon="diagram-project"
@@ -138,6 +110,13 @@ Com mais de 100.000 desenvolvedores certificados em nossos cursos comunitários,
  >
    Aprenda a criar fluxos de trabalho estruturados e orientados por eventos com controle preciso de execução.
  </Card>
+  <Card
+    title="Crie Seu Primeiro Crew"
+    icon="users-gear"
+    href="/pt-BR/guides/crews/first-crew"
+  >
+    Tutorial passo a passo para criar uma equipe de IA colaborativa que trabalha junto para resolver problemas complexos.
+  </Card>
 </CardGroup>

 <CardGroup cols={3}>
--- a/docs/pt-BR/observability/tracing.mdx
+++ b/docs/pt-BR/observability/tracing.mdx
@@ -0,0 +1,213 @@
+---
+title: CrewAI Tracing
+description: Rastreamento integrado para Crews e Flows do CrewAI com a plataforma CrewAI AOP
+icon: magnifying-glass-chart
+mode: "wide"
+---
+
+# Rastreamento Integrado do CrewAI
+
+O CrewAI fornece recursos de rastreamento integrados que permitem monitorar e depurar seus Crews e Flows em tempo real. Este guia demonstra como habilitar o rastreamento para **Crews** e **Flows** usando a plataforma de observabilidade integrada do CrewAI.
+
+> **O que é o CrewAI Tracing?** O rastreamento integrado do CrewAI fornece observabilidade abrangente para seus agentes de IA, incluindo decisões de agentes, cronogramas de execução de tarefas, uso de ferramentas e chamadas de LLM - tudo acessível através da [plataforma CrewAI AOP](https://app.crewai.com).
+
+![CrewAI Tracing Interface](/images/crewai-tracing.png)
+
+## Pré-requisitos
+
+Antes de usar o rastreamento do CrewAI, você precisa:
+
+1. **Conta CrewAI AOP**: Cadastre-se para uma conta gratuita em [app.crewai.com](https://app.crewai.com)
+2. **Autenticação CLI**: Use a CLI do CrewAI para autenticar seu ambiente local
+
+```bash
+crewai login
+```
+
+## Instruções de Configuração
+
+### Passo 1: Crie sua Conta CrewAI AOP
+
+Visite [app.crewai.com](https://app.crewai.com) e crie sua conta gratuita. Isso lhe dará acesso à plataforma CrewAI AOP, onde você pode visualizar rastreamentos, métricas e gerenciar seus crews.
+
+### Passo 2: Instale a CLI do CrewAI e Autentique
+
+Se você ainda não o fez, instale o CrewAI com as ferramentas CLI:
+
+```bash
+uv add crewai[tools]
+```
+
+Em seguida, autentique sua CLI com sua conta CrewAI AOP:
+
+```bash
+crewai login
+```
+
+Este comando irá:
+1. Abrir seu navegador na página de autenticação
+2. Solicitar que você insira um código de dispositivo
+3. Autenticar seu ambiente local com sua conta CrewAI AOP
+4. Habilitar recursos de rastreamento para seu desenvolvimento local
+
+### Passo 3: Habilite o Rastreamento em seu Crew
+
+Você pode habilitar o rastreamento para seu Crew definindo o parâmetro `tracing` como `True`:
+
+```python
+from crewai import Agent, Crew, Process, Task
+from crewai_tools import SerperDevTool
+
+# Define your agents
+researcher = Agent(
+    role="Senior Research Analyst",
+    goal="Uncover cutting-edge developments in AI and data science",
+    backstory=\"\"\"You work at a leading tech think tank.
+    Your expertise lies in identifying emerging trends.
+    You have a knack for dissecting complex data and presenting actionable insights.\"\"\",
+    verbose=True,
+    tools=[SerperDevTool()],
+)
+
+writer = Agent(
+    role="Tech Content Strategist",
+    goal="Craft compelling content on tech advancements",
+    backstory=\"\"\"You are a renowned Content Strategist, known for your insightful and engaging articles.
+    You transform complex concepts into compelling narratives.\"\"\",
+    verbose=True,
+)
+
+# Create tasks for your agents
+research_task = Task(
+    description=\"\"\"Conduct a comprehensive analysis of the latest advancements in AI in 2024.
+    Identify key trends, breakthrough technologies, and potential industry impacts.\"\"\",
+    expected_output="Full analysis report in bullet points",
+    agent=researcher,
+)
+
+writing_task = Task(
+    description=\"\"\"Using the insights provided, develop an engaging blog
+    post that highlights the most significant AI advancements.
+    Your post should be informative yet accessible, catering to a tech-savvy audience.\"\"\",
+    expected_output="Full blog post of at least 4 paragraphs",
+    agent=writer,
+)
+
+# Enable tracing in your crew
+crew = Crew(
+    agents=[researcher, writer],
+    tasks=[research_task, writing_task],
+    process=Process.sequential,
+    tracing=True,  # Enable built-in tracing
+    verbose=True
+)
+
+# Execute your crew
+result = crew.kickoff()
+```
+
+### Passo 4: Habilite o Rastreamento em seu Flow
+
+Da mesma forma, você pode habilitar o rastreamento para Flows do CrewAI:
+
+```python
+from crewai.flow.flow import Flow, listen, start
+from pydantic import BaseModel
+
+class ExampleState(BaseModel):
+    counter: int = 0
+    message: str = ""
+
+class ExampleFlow(Flow[ExampleState]):
+    def __init__(self):
+        super().__init__(tracing=True)  # Enable tracing for the flow
+
+    @start()
+    def first_method(self):
+        print("Starting the flow")
+        self.state.counter = 1
+        self.state.message = "Flow started"
+        return "continue"
+
+    @listen("continue")
+    def second_method(self):
+        print("Continuing the flow")
+        self.state.counter += 1
+        self.state.message = "Flow continued"
+        return "finish"
+
+    @listen("finish")
+    def final_method(self):
+        print("Finishing the flow")
+        self.state.counter += 1
+        self.state.message = "Flow completed"
+
+# Create and run the flow with tracing enabled
+flow = ExampleFlow(tracing=True)
+result = flow.kickoff()
+```
+
+### Passo 5: Visualize os Rastreamentos no Painel CrewAI AOP
+
+Após executar o crew ou flow, você pode visualizar os rastreamentos gerados pela sua aplicação CrewAI no painel CrewAI AOP. Você verá etapas detalhadas das interações dos agentes, usos de ferramentas e chamadas de LLM.
+Basta clicar no link abaixo para visualizar os rastreamentos ou ir para a aba de rastreamentos no painel [aqui](https://app.crewai.com/crewai_plus/trace_batches)
+![CrewAI Tracing Interface](/images/view-traces.png)
+
+
+### Alternativa: Configuração de Variável de Ambiente
+
+Você também pode habilitar o rastreamento globalmente definindo uma variável de ambiente:
+
+```bash
+export CREWAI_TRACING_ENABLED=true
+```
+
+Ou adicione-a ao seu arquivo `.env`:
+
+```env
+CREWAI_TRACING_ENABLED=true
+```
+
+Quando esta variável de ambiente estiver definida, todos os Crews e Flows terão automaticamente o rastreamento habilitado, mesmo sem definir explicitamente `tracing=True`.
+
+## Visualizando seus Rastreamentos
+
+### Acesse o Painel CrewAI AOP
+
+1. Visite [app.crewai.com](https://app.crewai.com) e faça login em sua conta
+2. Navegue até o painel do seu projeto
+3. Clique na aba **Traces** para visualizar os detalhes de execução
+
+### O que Você Verá nos Rastreamentos
+
+O rastreamento do CrewAI fornece visibilidade abrangente sobre:
+
+- **Decisões dos Agentes**: Veja como os agentes raciocinam através das tarefas e tomam decisões
+- **Cronograma de Execução de Tarefas**: Representação visual de sequências e dependências de tarefas
+- **Uso de Ferramentas**: Monitore quais ferramentas são chamadas e seus resultados
+- **Chamadas de LLM**: Rastreie todas as interações do modelo de linguagem, incluindo prompts e respostas
+- **Métricas de Desempenho**: Tempos de execução, uso de tokens e custos
+- **Rastreamento de Erros**: Informações detalhadas de erros e rastreamentos de pilha
+
+### Recursos de Rastreamento
+- **Cronograma de Execução**: Clique através de diferentes estágios de execução
+- **Logs Detalhados**: Acesse logs abrangentes para depuração
+- **Análise de Desempenho**: Analise padrões de execução e otimize o desempenho
+- **Capacidades de Exportação**: Baixe rastreamentos para análise adicional
+
+### Problemas de Autenticação
+
+Se você encontrar problemas de autenticação:
+
+1. Certifique-se de estar logado: `crewai login`
+2. Verifique sua conexão com a internet
+3. Verifique sua conta em [app.crewai.com](https://app.crewai.com)
+
+### Rastreamentos Não Aparecem
+
+Se os rastreamentos não estiverem aparecendo no painel:
+
+1. Confirme que `tracing=True` está definido em seu Crew/Flow
+2. Verifique se `CREWAI_TRACING_ENABLED=true` se estiver usando variáveis de ambiente
+3. Certifique-se de estar autenticado com `crewai login`
+4. Verifique se seu crew/flow está realmente executando
--- a/lib/crewai/src/crewai/agent/core.py
+++ b/lib/crewai/src/crewai/agent/core.py
@@ -1,7 +1,7 @@
 from __future__ import annotations

 import asyncio
-from collections.abc import Sequence
+from collections.abc import Callable, Sequence
 import shutil
 import subprocess
 import time
@@ -44,6 +44,7 @@ from crewai.events.types.memory_events import (
    MemoryRetrievalCompletedEvent,
    MemoryRetrievalStartedEvent,
 )
+from crewai.experimental.crew_agent_executor_flow import CrewAgentExecutorFlow
 from crewai.knowledge.knowledge import Knowledge
 from crewai.knowledge.source.base_knowledge_source import BaseKnowledgeSource
 from crewai.lite_agent import LiteAgent
@@ -105,7 +106,7 @@ class Agent(BaseAgent):
    The agent can also have memory, can operate in verbose mode, and can delegate tasks to other agents.

    Attributes:
-            agent_executor: An instance of the CrewAgentExecutor class.
+            agent_executor: An instance of the CrewAgentExecutor or CrewAgentExecutorFlow class.
            role: The role of the agent.
            goal: The objective of the agent.
            backstory: The backstory of the agent.
@@ -221,6 +222,10 @@ class Agent(BaseAgent):
        default=None,
        description="A2A (Agent-to-Agent) configuration for delegating tasks to remote agents. Can be a single A2AConfig or a dict mapping agent IDs to configs.",
    )
+    executor_class: type[CrewAgentExecutor] | type[CrewAgentExecutorFlow] = Field(
+        default=CrewAgentExecutor,
+        description="Class to use for the agent executor. Defaults to CrewAgentExecutor, can optionally use CrewAgentExecutorFlow.",
+    )

    @model_validator(mode="before")
    def validate_from_repository(cls, v: Any) -> dict[str, Any] | None | Any:  # noqa: N805
@@ -721,29 +726,83 @@ class Agent(BaseAgent):
                self.response_template.split("{{ .Response }}")[1].strip()
            )

-        self.agent_executor = CrewAgentExecutor(
-            llm=self.llm,  # type: ignore[arg-type]
-            task=task,  # type: ignore[arg-type]
-            agent=self,
-            crew=self.crew,
-            tools=parsed_tools,
-            prompt=prompt,
-            original_tools=raw_tools,
-            stop_words=stop_words,
-            max_iter=self.max_iter,
-            tools_handler=self.tools_handler,
-            tools_names=get_tool_names(parsed_tools),
-            tools_description=render_text_description_and_args(parsed_tools),
-            step_callback=self.step_callback,
-            function_calling_llm=self.function_calling_llm,
-            respect_context_window=self.respect_context_window,
-            request_within_rpm_limit=(
-                self._rpm_controller.check_or_wait if self._rpm_controller else None
-            ),
-            callbacks=[TokenCalcHandler(self._token_process)],
-            response_model=task.response_model if task else None,
+        rpm_limit_fn = (
+            self._rpm_controller.check_or_wait if self._rpm_controller else None
        )

+        if self.agent_executor is not None:
+            self._update_executor_parameters(
+                task=task,
+                tools=parsed_tools,
+                raw_tools=raw_tools,
+                prompt=prompt,
+                stop_words=stop_words,
+                rpm_limit_fn=rpm_limit_fn,
+            )
+        else:
+            self.agent_executor = self.executor_class(
+                llm=cast(BaseLLM, self.llm),
+                task=task,
+                i18n=self.i18n,
+                agent=self,
+                crew=self.crew,
+                tools=parsed_tools,
+                prompt=prompt,
+                original_tools=raw_tools,
+                stop_words=stop_words,
+                max_iter=self.max_iter,
+                tools_handler=self.tools_handler,
+                tools_names=get_tool_names(parsed_tools),
+                tools_description=render_text_description_and_args(parsed_tools),
+                step_callback=self.step_callback,
+                function_calling_llm=self.function_calling_llm,
+                respect_context_window=self.respect_context_window,
+                request_within_rpm_limit=rpm_limit_fn,
+                callbacks=[TokenCalcHandler(self._token_process)],
+                response_model=task.response_model if task else None,
+            )
+
+    def _update_executor_parameters(
+        self,
+        task: Task | None,
+        tools: list,
+        raw_tools: list[BaseTool],
+        prompt: dict,
+        stop_words: list[str],
+        rpm_limit_fn: Callable | None,
+    ) -> None:
+        """Update executor parameters without recreating instance.
+
+        Args:
+            task: Task to execute.
+            tools: Parsed tools.
+            raw_tools: Original tools.
+            prompt: Generated prompt.
+            stop_words: Stop words list.
+            rpm_limit_fn: RPM limit callback function.
+        """
+        self.agent_executor.task = task
+        self.agent_executor.tools = tools
+        self.agent_executor.original_tools = raw_tools
+        self.agent_executor.prompt = prompt
+        self.agent_executor.stop = stop_words
+        self.agent_executor.tools_names = get_tool_names(tools)
+        self.agent_executor.tools_description = render_text_description_and_args(tools)
+        self.agent_executor.response_model = task.response_model if task else None
+
+        self.agent_executor.tools_handler = self.tools_handler
+        self.agent_executor.request_within_rpm_limit = rpm_limit_fn
+
+        if self.agent_executor.llm:
+            existing_stop = getattr(self.agent_executor.llm, "stop", [])
+            self.agent_executor.llm.stop = list(
+                set(
+                    existing_stop + stop_words
+                    if isinstance(existing_stop, list)
+                    else stop_words
+                )
+            )
+
    def get_delegation_tools(self, agents: list[BaseAgent]) -> list[BaseTool]:
        agent_tools = AgentTools(agents=agents)
        return agent_tools.tools()
--- a/lib/crewai/src/crewai/agents/agent_builder/base_agent.py
+++ b/lib/crewai/src/crewai/agents/agent_builder/base_agent.py
@@ -457,7 +457,6 @@ class BaseAgent(BaseModel, ABC, metaclass=AgentMeta):
        if self.cache:
            self.cache_handler = cache_handler
            self.tools_handler.cache = cache_handler
-        self.create_agent_executor()

    def set_rpm_controller(self, rpm_controller: RPMController) -> None:
        """Set the rpm controller for the agent.
@@ -467,7 +466,6 @@ class BaseAgent(BaseModel, ABC, metaclass=AgentMeta):
        """
        if not self._rpm_controller:
            self._rpm_controller = rpm_controller
-            self.create_agent_executor()

    def set_knowledge(self, crew_embedder: EmbedderConfig | None = None) -> None:
        pass
--- a/lib/crewai/src/crewai/agents/agent_builder/base_agent_executor_mixin.py
+++ b/lib/crewai/src/crewai/agents/agent_builder/base_agent_executor_mixin.py
@@ -3,6 +3,7 @@ from __future__ import annotations
 import time
 from typing import TYPE_CHECKING

+from crewai.agents.parser import AgentFinish
 from crewai.events.event_listener import event_listener
 from crewai.memory.entity.entity_memory_item import EntityMemoryItem
 from crewai.memory.long_term.long_term_memory_item import LongTermMemoryItem
@@ -29,7 +30,7 @@ class CrewAgentExecutorMixin:
    _i18n: I18N
    _printer: Printer = Printer()

-    def _create_short_term_memory(self, output) -> None:
+    def _create_short_term_memory(self, output: AgentFinish) -> None:
        """Create and save a short-term memory item if conditions are met."""
        if (
            self.crew
@@ -53,7 +54,7 @@ class CrewAgentExecutorMixin:
                    "error", f"Failed to add to short term memory: {e}"
                )

-    def _create_external_memory(self, output) -> None:
+    def _create_external_memory(self, output: AgentFinish) -> None:
        """Create and save a external-term memory item if conditions are met."""
        if (
            self.crew
@@ -75,7 +76,7 @@ class CrewAgentExecutorMixin:
                    "error", f"Failed to add to external memory: {e}"
                )

-    def _create_long_term_memory(self, output) -> None:
+    def _create_long_term_memory(self, output: AgentFinish) -> None:
        """Create and save long-term and entity memory items based on evaluation."""
        if (
            self.crew
@@ -136,40 +137,50 @@ class CrewAgentExecutorMixin:
            )

    def _ask_human_input(self, final_answer: str) -> str:
-        """Prompt human input with mode-appropriate messaging."""
-        event_listener.formatter.pause_live_updates()
-        try:
-            self._printer.print(
-                content=f"\033[1m\033[95m ## Final Result:\033[00m \033[92m{final_answer}\033[00m"
-            )
+        """Prompt human input with mode-appropriate messaging.

+        Note: The final answer is already displayed via the AgentLogsExecutionEvent
+        panel, so we only show the feedback prompt here.
+        """
+        from rich.panel import Panel
+        from rich.text import Text
+
+        formatter = event_listener.formatter
+        formatter.pause_live_updates()
+
+        try:
            # Training mode prompt (single iteration)
            if self.crew and getattr(self.crew, "_train", False):
-                prompt = (
-                    "\n\n=====\n"
-                    "## TRAINING MODE: Provide feedback to improve the agent's performance.\n"
+                prompt_text = (
+                    "TRAINING MODE: Provide feedback to improve the agent's performance.\n\n"
                    "This will be used to train better versions of the agent.\n"
-                    "Please provide detailed feedback about the result quality and reasoning process.\n"
-                    "=====\n"
+                    "Please provide detailed feedback about the result quality and reasoning process."
                )
+                title = "🎓 Training Feedback Required"
            # Regular human-in-the-loop prompt (multiple iterations)
            else:
-                prompt = (
-                    "\n\n=====\n"
-                    "## HUMAN FEEDBACK: Provide feedback on the Final Result and Agent's actions.\n"
-                    "Please follow these guidelines:\n"
-                    " - If you are happy with the result, simply hit Enter without typing anything.\n"
-                    " - Otherwise, provide specific improvement requests.\n"
-                    " - You can provide multiple rounds of feedback until satisfied.\n"
-                    "=====\n"
+                prompt_text = (
+                    "Provide feedback on the Final Result above.\n\n"
+                    "• If you are happy with the result, simply hit Enter without typing anything.\n"
+                    "• Otherwise, provide specific improvement requests.\n"
+                    "• You can provide multiple rounds of feedback until satisfied."
                )
+                title = "💬 Human Feedback Required"
+
+            content = Text()
+            content.append(prompt_text, style="yellow")
+
+            prompt_panel = Panel(
+                content,
+                title=title,
+                border_style="yellow",
+                padding=(1, 2),
+            )
+            formatter.console.print(prompt_panel)

-            self._printer.print(content=prompt, color="bold_yellow")
            response = input()
            if response.strip() != "":
-                self._printer.print(
-                    content="\nProcessing your feedback...", color="cyan"
-                )
+                formatter.console.print("\n[cyan]Processing your feedback...[/cyan]")
            return response
        finally:
-            event_listener.formatter.resume_live_updates()
+            formatter.resume_live_updates()
--- a/lib/crewai/src/crewai/agents/crew_agent_executor.py
+++ b/lib/crewai/src/crewai/agents/crew_agent_executor.py
@@ -7,6 +7,7 @@ and memory management.
 from __future__ import annotations

 from collections.abc import Callable
+import logging
 from typing import TYPE_CHECKING, Any, Literal, cast

 from pydantic import BaseModel, GetCoreSchemaHandler
@@ -51,6 +52,8 @@ from crewai.utilities.tool_utils import (
 from crewai.utilities.training_handler import CrewTrainingHandler


+logger = logging.getLogger(__name__)
+
 if TYPE_CHECKING:
    from crewai.agent import Agent
    from crewai.agents.tools_handler import ToolsHandler
@@ -91,6 +94,7 @@ class CrewAgentExecutor(CrewAgentExecutorMixin):
        request_within_rpm_limit: Callable[[], bool] | None = None,
        callbacks: list[Any] | None = None,
        response_model: type[BaseModel] | None = None,
+        i18n: I18N | None = None,
    ) -> None:
        """Initialize executor.

@@ -114,7 +118,7 @@ class CrewAgentExecutor(CrewAgentExecutorMixin):
            callbacks: Optional callbacks list.
            response_model: Optional Pydantic model for structured outputs.
        """
-        self._i18n: I18N = get_i18n()
+        self._i18n: I18N = i18n or get_i18n()
        self.llm = llm
        self.task = task
        self.agent = agent
@@ -540,7 +544,7 @@ class CrewAgentExecutor(CrewAgentExecutorMixin):
        if self.agent is None:
            raise ValueError("Agent cannot be None")

-        crewai_event_bus.emit(
+        future = crewai_event_bus.emit(
            self.agent,
            AgentLogsExecutionEvent(
                agent_role=self.agent.role,
@@ -550,6 +554,12 @@ class CrewAgentExecutor(CrewAgentExecutorMixin):
            ),
        )

+        if future is not None:
+            try:
+                future.result(timeout=5.0)
+            except Exception as e:
+                logger.error(f"Failed to show logs for agent execution event: {e}")
+
    def _handle_crew_training_output(
        self, result: AgentFinish, human_feedback: str | None = None
    ) -> None:
--- a/lib/crewai/src/crewai/events/event_listener.py
+++ b/lib/crewai/src/crewai/events/event_listener.py
@@ -1,7 +1,6 @@
 from __future__ import annotations

 from io import StringIO
-import threading
 from typing import TYPE_CHECKING, Any

 from pydantic import Field, PrivateAttr
@@ -17,8 +16,6 @@ from crewai.events.types.a2a_events import (
    A2AResponseReceivedEvent,
 )
 from crewai.events.types.agent_events import (
-    AgentExecutionCompletedEvent,
-    AgentExecutionStartedEvent,
    LiteAgentExecutionCompletedEvent,
    LiteAgentExecutionErrorEvent,
    LiteAgentExecutionStartedEvent,
@@ -48,7 +45,6 @@ from crewai.events.types.flow_events import (
 from crewai.events.types.knowledge_events import (
    KnowledgeQueryCompletedEvent,
    KnowledgeQueryFailedEvent,
-    KnowledgeQueryStartedEvent,
    KnowledgeRetrievalCompletedEvent,
    KnowledgeRetrievalStartedEvent,
    KnowledgeSearchQueryFailedEvent,
@@ -112,7 +108,6 @@ class EventListener(BaseEventListener):
    text_stream: StringIO = StringIO()
    knowledge_retrieval_in_progress: bool = False
    knowledge_query_in_progress: bool = False
-    method_branches: dict[str, Any] = Field(default_factory=dict)

    def __new__(cls) -> EventListener:
        if cls._instance is None:
@@ -126,10 +121,8 @@ class EventListener(BaseEventListener):
            self._telemetry = Telemetry()
            self._telemetry.set_tracer()
            self.execution_spans = {}
-            self.method_branches = {}
            self._initialized = True
            self.formatter = ConsoleFormatter(verbose=True)
-            self._crew_tree_lock = threading.Condition()

            # Initialize trace listener with formatter for memory event handling
            trace_listener = TraceCollectionListener()
@@ -140,12 +133,10 @@ class EventListener(BaseEventListener):
    def setup_listeners(self, crewai_event_bus: CrewAIEventsBus) -> None:
        @crewai_event_bus.on(CrewKickoffStartedEvent)
        def on_crew_started(source: Any, event: CrewKickoffStartedEvent) -> None:
-            with self._crew_tree_lock:
-                self.formatter.create_crew_tree(event.crew_name or "Crew", source.id)
-                source._execution_span = self._telemetry.crew_execution_span(
-                    source, event.inputs
-                )
-                self._crew_tree_lock.notify_all()
+            self.formatter.handle_crew_started(event.crew_name or "Crew", source.id)
+            source._execution_span = self._telemetry.crew_execution_span(
+                source, event.inputs
+            )

        @crewai_event_bus.on(CrewKickoffCompletedEvent)
        def on_crew_completed(source: Any, event: CrewKickoffCompletedEvent) -> None:
@@ -153,8 +144,7 @@ class EventListener(BaseEventListener):
            final_string_output = event.output.raw
            self._telemetry.end_crew(source, final_string_output)

-            self.formatter.update_crew_tree(
-                self.formatter.current_crew_tree,
+            self.formatter.handle_crew_status(
                event.crew_name or "Crew",
                source.id,
                "completed",
@@ -163,8 +153,7 @@ class EventListener(BaseEventListener):

        @crewai_event_bus.on(CrewKickoffFailedEvent)
        def on_crew_failed(source: Any, event: CrewKickoffFailedEvent) -> None:
-            self.formatter.update_crew_tree(
-                self.formatter.current_crew_tree,
+            self.formatter.handle_crew_status(
                event.crew_name or "Crew",
                source.id,
                "failed",
@@ -197,23 +186,22 @@ class EventListener(BaseEventListener):

        # ----------- TASK EVENTS -----------

+        def get_task_name(source: Any) -> str | None:
+            return (
+                source.name
+                if hasattr(source, "name") and source.name
+                else source.description
+                if hasattr(source, "description") and source.description
+                else None
+            )
+
        @crewai_event_bus.on(TaskStartedEvent)
        def on_task_started(source: Any, event: TaskStartedEvent) -> None:
            span = self._telemetry.task_started(crew=source.agent.crew, task=source)
            self.execution_spans[source] = span

-            with self._crew_tree_lock:
-                self._crew_tree_lock.wait_for(
-                    lambda: self.formatter.current_crew_tree is not None, timeout=5.0
-                )
-
-            if self.formatter.current_crew_tree is not None:
-                task_name = (
-                    source.name if hasattr(source, "name") and source.name else None
-                )
-                self.formatter.create_task_branch(
-                    self.formatter.current_crew_tree, source.id, task_name
-                )
+            task_name = get_task_name(source)
+            self.formatter.handle_task_started(source.id, task_name)

        @crewai_event_bus.on(TaskCompletedEvent)
        def on_task_completed(source: Any, event: TaskCompletedEvent) -> None:
@@ -224,13 +212,9 @@ class EventListener(BaseEventListener):
            self.execution_spans[source] = None

            # Pass task name if it exists
-            task_name = source.name if hasattr(source, "name") and source.name else None
-            self.formatter.update_task_status(
-                self.formatter.current_crew_tree,
-                source.id,
-                source.agent.role,
-                "completed",
-                task_name,
+            task_name = get_task_name(source)
+            self.formatter.handle_task_status(
+                source.id, source.agent.role, "completed", task_name
            )

        @crewai_event_bus.on(TaskFailedEvent)
@@ -242,37 +226,12 @@ class EventListener(BaseEventListener):
                self.execution_spans[source] = None

            # Pass task name if it exists
-            task_name = source.name if hasattr(source, "name") and source.name else None
-            self.formatter.update_task_status(
-                self.formatter.current_crew_tree,
-                source.id,
-                source.agent.role,
-                "failed",
-                task_name,
+            task_name = get_task_name(source)
+            self.formatter.handle_task_status(
+                source.id, source.agent.role, "failed", task_name
            )

        # ----------- AGENT EVENTS -----------
-
-        @crewai_event_bus.on(AgentExecutionStartedEvent)
-        def on_agent_execution_started(
-            _: Any, event: AgentExecutionStartedEvent
-        ) -> None:
-            self.formatter.create_agent_branch(
-                self.formatter.current_task_branch,
-                event.agent.role,
-                self.formatter.current_crew_tree,
-            )
-
-        @crewai_event_bus.on(AgentExecutionCompletedEvent)
-        def on_agent_execution_completed(
-            _: Any, event: AgentExecutionCompletedEvent
-        ) -> None:
-            self.formatter.update_agent_status(
-                self.formatter.current_agent_branch,
-                event.agent.role,
-                self.formatter.current_crew_tree,
-            )
-
        # ----------- LITE AGENT EVENTS -----------

        @crewai_event_bus.on(LiteAgentExecutionStartedEvent)
@@ -316,79 +275,61 @@ class EventListener(BaseEventListener):
            self._telemetry.flow_execution_span(
                event.flow_name, list(source._methods.keys())
            )
-            tree = self.formatter.create_flow_tree(event.flow_name, str(source.flow_id))
-            self.formatter.current_flow_tree = tree
-            self.formatter.start_flow(event.flow_name, str(source.flow_id))
+            self.formatter.handle_flow_created(event.flow_name, str(source.flow_id))
+            self.formatter.handle_flow_started(event.flow_name, str(source.flow_id))

        @crewai_event_bus.on(FlowFinishedEvent)
        def on_flow_finished(source: Any, event: FlowFinishedEvent) -> None:
-            self.formatter.update_flow_status(
-                self.formatter.current_flow_tree, event.flow_name, source.flow_id
+            self.formatter.handle_flow_status(
+                event.flow_name,
+                source.flow_id,
            )

        @crewai_event_bus.on(MethodExecutionStartedEvent)
        def on_method_execution_started(
            _: Any, event: MethodExecutionStartedEvent
        ) -> None:
-            method_branch = self.method_branches.get(event.method_name)
-            updated_branch = self.formatter.update_method_status(
-                method_branch,
-                self.formatter.current_flow_tree,
+            self.formatter.handle_method_status(
                event.method_name,
                "running",
            )
-            self.method_branches[event.method_name] = updated_branch

        @crewai_event_bus.on(MethodExecutionFinishedEvent)
        def on_method_execution_finished(
            _: Any, event: MethodExecutionFinishedEvent
        ) -> None:
-            method_branch = self.method_branches.get(event.method_name)
-            updated_branch = self.formatter.update_method_status(
-                method_branch,
-                self.formatter.current_flow_tree,
+            self.formatter.handle_method_status(
                event.method_name,
                "completed",
            )
-            self.method_branches[event.method_name] = updated_branch

        @crewai_event_bus.on(MethodExecutionFailedEvent)
        def on_method_execution_failed(
            _: Any, event: MethodExecutionFailedEvent
        ) -> None:
-            method_branch = self.method_branches.get(event.method_name)
-            updated_branch = self.formatter.update_method_status(
-                method_branch,
-                self.formatter.current_flow_tree,
+            self.formatter.handle_method_status(
                event.method_name,
                "failed",
            )
-            self.method_branches[event.method_name] = updated_branch

        @crewai_event_bus.on(MethodExecutionPausedEvent)
        def on_method_execution_paused(
            _: Any, event: MethodExecutionPausedEvent
        ) -> None:
-            method_branch = self.method_branches.get(event.method_name)
-            updated_branch = self.formatter.update_method_status(
-                method_branch,
-                self.formatter.current_flow_tree,
+            self.formatter.handle_method_status(
                event.method_name,
                "paused",
            )
-            self.method_branches[event.method_name] = updated_branch

        @crewai_event_bus.on(FlowPausedEvent)
        def on_flow_paused(_: Any, event: FlowPausedEvent) -> None:
-            self.formatter.update_flow_status(
-                self.formatter.current_flow_tree,
+            self.formatter.handle_flow_status(
                event.flow_name,
                event.flow_id,
                "paused",
            )

        # ----------- TOOL USAGE EVENTS -----------
-
        @crewai_event_bus.on(ToolUsageStartedEvent)
        def on_tool_usage_started(source: Any, event: ToolUsageStartedEvent) -> None:
            if isinstance(source, LLM):
@@ -398,9 +339,9 @@ class EventListener(BaseEventListener):
                )
            else:
                self.formatter.handle_tool_usage_started(
-                    self.formatter.current_agent_branch,
                    event.tool_name,
-                    self.formatter.current_crew_tree,
+                    event.tool_args,
+                    event.run_attempts,
                )

        @crewai_event_bus.on(ToolUsageFinishedEvent)
@@ -409,12 +350,6 @@ class EventListener(BaseEventListener):
                self.formatter.handle_llm_tool_usage_finished(
                    event.tool_name,
                )
-            else:
-                self.formatter.handle_tool_usage_finished(
-                    self.formatter.current_tool_branch,
-                    event.tool_name,
-                    self.formatter.current_crew_tree,
-                )

        @crewai_event_bus.on(ToolUsageErrorEvent)
        def on_tool_usage_error(source: Any, event: ToolUsageErrorEvent) -> None:
@@ -425,10 +360,9 @@ class EventListener(BaseEventListener):
                )
            else:
                self.formatter.handle_tool_usage_error(
-                    self.formatter.current_tool_branch,
                    event.tool_name,
                    event.error,
-                    self.formatter.current_crew_tree,
+                    event.run_attempts,
                )

        # ----------- LLM EVENTS -----------
@@ -437,32 +371,15 @@ class EventListener(BaseEventListener):
        def on_llm_call_started(_: Any, event: LLMCallStartedEvent) -> None:
            self.text_stream = StringIO()
            self.next_chunk = 0
-            # Capture the returned tool branch and update the current_tool_branch reference
-            thinking_branch = self.formatter.handle_llm_call_started(
-                self.formatter.current_agent_branch,
-                self.formatter.current_crew_tree,
-            )
-            # Update the formatter's current_tool_branch to ensure proper cleanup
-            if thinking_branch is not None:
-                self.formatter.current_tool_branch = thinking_branch

        @crewai_event_bus.on(LLMCallCompletedEvent)
        def on_llm_call_completed(_: Any, event: LLMCallCompletedEvent) -> None:
            self.formatter.handle_llm_stream_completed()
-            self.formatter.handle_llm_call_completed(
-                self.formatter.current_tool_branch,
-                self.formatter.current_agent_branch,
-                self.formatter.current_crew_tree,
-            )

        @crewai_event_bus.on(LLMCallFailedEvent)
        def on_llm_call_failed(_: Any, event: LLMCallFailedEvent) -> None:
            self.formatter.handle_llm_stream_completed()
-            self.formatter.handle_llm_call_failed(
-                self.formatter.current_tool_branch,
-                event.error,
-                self.formatter.current_crew_tree,
-            )
+            self.formatter.handle_llm_call_failed(event.error)

        @crewai_event_bus.on(LLMStreamChunkEvent)
        def on_llm_stream_chunk(_: Any, event: LLMStreamChunkEvent) -> None:
@@ -473,9 +390,7 @@ class EventListener(BaseEventListener):

            accumulated_text = self.text_stream.getvalue()
            self.formatter.handle_llm_stream_chunk(
-                event.chunk,
                accumulated_text,
-                self.formatter.current_crew_tree,
                event.call_type,
            )

@@ -515,7 +430,6 @@ class EventListener(BaseEventListener):
        @crewai_event_bus.on(CrewTestCompletedEvent)
        def on_crew_test_completed(_: Any, event: CrewTestCompletedEvent) -> None:
            self.formatter.handle_crew_test_completed(
-                self.formatter.current_flow_tree,
                event.crew_name or "Crew",
            )

@@ -532,10 +446,7 @@ class EventListener(BaseEventListener):

            self.knowledge_retrieval_in_progress = True

-            self.formatter.handle_knowledge_retrieval_started(
-                self.formatter.current_agent_branch,
-                self.formatter.current_crew_tree,
-            )
+            self.formatter.handle_knowledge_retrieval_started()

        @crewai_event_bus.on(KnowledgeRetrievalCompletedEvent)
        def on_knowledge_retrieval_completed(
@@ -546,24 +457,13 @@ class EventListener(BaseEventListener):

            self.knowledge_retrieval_in_progress = False
            self.formatter.handle_knowledge_retrieval_completed(
-                self.formatter.current_agent_branch,
-                self.formatter.current_crew_tree,
                event.retrieved_knowledge,
+                event.query,
            )

-        @crewai_event_bus.on(KnowledgeQueryStartedEvent)
-        def on_knowledge_query_started(
-            _: Any, event: KnowledgeQueryStartedEvent
-        ) -> None:
-            pass
-
        @crewai_event_bus.on(KnowledgeQueryFailedEvent)
        def on_knowledge_query_failed(_: Any, event: KnowledgeQueryFailedEvent) -> None:
-            self.formatter.handle_knowledge_query_failed(
-                self.formatter.current_agent_branch,
-                event.error,
-                self.formatter.current_crew_tree,
-            )
+            self.formatter.handle_knowledge_query_failed(event.error)

        @crewai_event_bus.on(KnowledgeQueryCompletedEvent)
        def on_knowledge_query_completed(
@@ -575,11 +475,7 @@ class EventListener(BaseEventListener):
        def on_knowledge_search_query_failed(
            _: Any, event: KnowledgeSearchQueryFailedEvent
        ) -> None:
-            self.formatter.handle_knowledge_search_query_failed(
-                self.formatter.current_agent_branch,
-                event.error,
-                self.formatter.current_crew_tree,
-            )
+            self.formatter.handle_knowledge_search_query_failed(event.error)

        # ----------- REASONING EVENTS -----------

@@ -587,11 +483,7 @@ class EventListener(BaseEventListener):
        def on_agent_reasoning_started(
            _: Any, event: AgentReasoningStartedEvent
        ) -> None:
-            self.formatter.handle_reasoning_started(
-                self.formatter.current_agent_branch,
-                event.attempt,
-                self.formatter.current_crew_tree,
-            )
+            self.formatter.handle_reasoning_started(event.attempt)

        @crewai_event_bus.on(AgentReasoningCompletedEvent)
        def on_agent_reasoning_completed(
@@ -600,14 +492,12 @@ class EventListener(BaseEventListener):
            self.formatter.handle_reasoning_completed(
                event.plan,
                event.ready,
-                self.formatter.current_crew_tree,
            )

        @crewai_event_bus.on(AgentReasoningFailedEvent)
        def on_agent_reasoning_failed(_: Any, event: AgentReasoningFailedEvent) -> None:
            self.formatter.handle_reasoning_failed(
                event.error,
-                self.formatter.current_crew_tree,
            )

        # ----------- AGENT LOGGING EVENTS -----------
@@ -734,18 +624,6 @@ class EventListener(BaseEventListener):
                event.tool_args,
            )

-        @crewai_event_bus.on(MCPToolExecutionCompletedEvent)
-        def on_mcp_tool_execution_completed(
-            _: Any, event: MCPToolExecutionCompletedEvent
-        ) -> None:
-            self.formatter.handle_mcp_tool_execution_completed(
-                event.server_name,
-                event.tool_name,
-                event.tool_args,
-                event.result,
-                event.execution_duration_ms,
-            )
-
        @crewai_event_bus.on(MCPToolExecutionFailedEvent)
        def on_mcp_tool_execution_failed(
            _: Any, event: MCPToolExecutionFailedEvent
--- a/lib/crewai/src/crewai/events/listeners/tracing/trace_listener.py
+++ b/lib/crewai/src/crewai/events/listeners/tracing/trace_listener.py
@@ -1,7 +1,7 @@
 """Trace collection listener for orchestrating trace collection."""

 import os
-from typing import Any, ClassVar
+from typing import Any, ClassVar, cast
 import uuid

 from typing_extensions import Self
@@ -105,7 +105,7 @@ class TraceCollectionListener(BaseEventListener):
        """Create or return singleton instance."""
        if cls._instance is None:
            cls._instance = super().__new__(cls)
-        return cls._instance
+        return cast(Self, cls._instance)

    def __init__(
        self,
@@ -319,21 +319,12 @@ class TraceCollectionListener(BaseEventListener):
            source: Any, event: MemoryQueryCompletedEvent
        ) -> None:
            self._handle_action_event("memory_query_completed", source, event)
-            if self.formatter and self.memory_retrieval_in_progress:
-                self.formatter.handle_memory_query_completed(
-                    self.formatter.current_agent_branch,
-                    event.source_type or "memory",
-                    event.query_time_ms,
-                    self.formatter.current_crew_tree,
-                )

        @event_bus.on(MemoryQueryFailedEvent)
        def on_memory_query_failed(source: Any, event: MemoryQueryFailedEvent) -> None:
            self._handle_action_event("memory_query_failed", source, event)
            if self.formatter and self.memory_retrieval_in_progress:
                self.formatter.handle_memory_query_failed(
-                    self.formatter.current_agent_branch,
-                    self.formatter.current_crew_tree,
                    event.error,
                    event.source_type or "memory",
                )
@@ -347,10 +338,7 @@ class TraceCollectionListener(BaseEventListener):

                self.memory_save_in_progress = True

-                self.formatter.handle_memory_save_started(
-                    self.formatter.current_agent_branch,
-                    self.formatter.current_crew_tree,
-                )
+                self.formatter.handle_memory_save_started()

        @event_bus.on(MemorySaveCompletedEvent)
        def on_memory_save_completed(
@@ -364,8 +352,6 @@ class TraceCollectionListener(BaseEventListener):
                self.memory_save_in_progress = False

                self.formatter.handle_memory_save_completed(
-                    self.formatter.current_agent_branch,
-                    self.formatter.current_crew_tree,
                    event.save_time_ms,
                    event.source_type or "memory",
                )
@@ -375,10 +361,8 @@ class TraceCollectionListener(BaseEventListener):
            self._handle_action_event("memory_save_failed", source, event)
            if self.formatter and self.memory_save_in_progress:
                self.formatter.handle_memory_save_failed(
-                    self.formatter.current_agent_branch,
                    event.error,
                    event.source_type or "memory",
-                    self.formatter.current_crew_tree,
                )

        @event_bus.on(MemoryRetrievalStartedEvent)
@@ -391,10 +375,7 @@ class TraceCollectionListener(BaseEventListener):

                self.memory_retrieval_in_progress = True

-                self.formatter.handle_memory_retrieval_started(
-                    self.formatter.current_agent_branch,
-                    self.formatter.current_crew_tree,
-                )
+                self.formatter.handle_memory_retrieval_started()

        @event_bus.on(MemoryRetrievalCompletedEvent)
        def on_memory_retrieval_completed(
@@ -406,8 +387,6 @@ class TraceCollectionListener(BaseEventListener):

                self.memory_retrieval_in_progress = False
                self.formatter.handle_memory_retrieval_completed(
-                    self.formatter.current_agent_branch,
-                    self.formatter.current_crew_tree,
                    event.memory_content,
                    event.retrieval_time_ms,
                )
--- a/lib/crewai/src/crewai/events/utils/console_formatter.py
+++ b/lib/crewai/src/crewai/events/utils/console_formatter.py
--- a/lib/crewai/src/crewai/experimental/init.py
+++ b/lib/crewai/src/crewai/experimental/init.py
@@ -1,3 +1,4 @@
+from crewai.experimental.crew_agent_executor_flow import CrewAgentExecutorFlow
 from crewai.experimental.evaluation import (
    AgentEvaluationResult,
    AgentEvaluator,
@@ -23,6 +24,7 @@ __all__ = [
    "AgentEvaluationResult",
    "AgentEvaluator",
    "BaseEvaluator",
+    "CrewAgentExecutorFlow",
    "EvaluationScore",
    "EvaluationTraceCallback",
    "ExperimentResult",
--- a/lib/crewai/src/crewai/experimental/crew_agent_executor_flow.py
+++ b/lib/crewai/src/crewai/experimental/crew_agent_executor_flow.py
@@ -0,0 +1,808 @@
+from __future__ import annotations
+
+from collections.abc import Callable
+import threading
+from typing import TYPE_CHECKING, Any, Literal, cast
+from uuid import uuid4
+
+from pydantic import BaseModel, Field, GetCoreSchemaHandler
+from pydantic_core import CoreSchema, core_schema
+from rich.console import Console
+from rich.text import Text
+
+from crewai.agents.agent_builder.base_agent_executor_mixin import CrewAgentExecutorMixin
+from crewai.agents.parser import (
+    AgentAction,
+    AgentFinish,
+    OutputParserError,
+)
+from crewai.events.event_bus import crewai_event_bus
+from crewai.events.types.logging_events import (
+    AgentLogsExecutionEvent,
+    AgentLogsStartedEvent,
+)
+from crewai.flow.flow import Flow, listen, or_, router, start
+from crewai.hooks.llm_hooks import (
+    get_after_llm_call_hooks,
+    get_before_llm_call_hooks,
+)
+from crewai.utilities.agent_utils import (
+    enforce_rpm_limit,
+    format_message_for_llm,
+    get_llm_response,
+    handle_agent_action_core,
+    handle_context_length,
+    handle_max_iterations_exceeded,
+    handle_output_parser_exception,
+    handle_unknown_error,
+    has_reached_max_iterations,
+    is_context_length_exceeded,
+    process_llm_response,
+)
+from crewai.utilities.constants import TRAINING_DATA_FILE
+from crewai.utilities.i18n import I18N, get_i18n
+from crewai.utilities.printer import Printer
+from crewai.utilities.tool_utils import execute_tool_and_check_finality
+from crewai.utilities.training_handler import CrewTrainingHandler
+from crewai.utilities.types import LLMMessage
+
+
+if TYPE_CHECKING:
+    from crewai.agent import Agent
+    from crewai.agents.tools_handler import ToolsHandler
+    from crewai.crew import Crew
+    from crewai.llms.base_llm import BaseLLM
+    from crewai.task import Task
+    from crewai.tools.base_tool import BaseTool
+    from crewai.tools.structured_tool import CrewStructuredTool
+    from crewai.tools.tool_types import ToolResult
+    from crewai.utilities.prompts import StandardPromptResult, SystemPromptResult
+
+
+class AgentReActState(BaseModel):
+    """Structured state for agent ReAct flow execution.
+
+    Replaces scattered instance variables with validated immutable state.
+    Maps to: self.messages, self.iterations, formatted_answer in current executor.
+    """
+
+    messages: list[LLMMessage] = Field(default_factory=list)
+    iterations: int = Field(default=0)
+    current_answer: AgentAction | AgentFinish | None = Field(default=None)
+    is_finished: bool = Field(default=False)
+    ask_for_human_input: bool = Field(default=False)
+
+
+class CrewAgentExecutorFlow(Flow[AgentReActState], CrewAgentExecutorMixin):
+    """Flow-based executor matching CrewAgentExecutor interface.
+
+    Inherits from:
+    - Flow[AgentReActState]: Provides flow orchestration capabilities
+    - CrewAgentExecutorMixin: Provides memory methods (short/long/external term)
+
+    Note: Multiple instances may be created during agent initialization
+    (cache setup, RPM controller setup, etc.) but only the final instance
+    should execute tasks via invoke().
+    """
+
+    def __init__(
+        self,
+        llm: BaseLLM,
+        task: Task,
+        crew: Crew,
+        agent: Agent,
+        prompt: SystemPromptResult | StandardPromptResult,
+        max_iter: int,
+        tools: list[CrewStructuredTool],
+        tools_names: str,
+        stop_words: list[str],
+        tools_description: str,
+        tools_handler: ToolsHandler,
+        step_callback: Any = None,
+        original_tools: list[BaseTool] | None = None,
+        function_calling_llm: BaseLLM | Any | None = None,
+        respect_context_window: bool = False,
+        request_within_rpm_limit: Callable[[], bool] | None = None,
+        callbacks: list[Any] | None = None,
+        response_model: type[BaseModel] | None = None,
+        i18n: I18N | None = None,
+    ) -> None:
+        """Initialize the flow-based agent executor.
+
+        Args:
+            llm: Language model instance.
+            task: Task to execute.
+            crew: Crew instance.
+            agent: Agent to execute.
+            prompt: Prompt templates.
+            max_iter: Maximum iterations.
+            tools: Available tools.
+            tools_names: Tool names string.
+            stop_words: Stop word list.
+            tools_description: Tool descriptions.
+            tools_handler: Tool handler instance.
+            step_callback: Optional step callback.
+            original_tools: Original tool list.
+            function_calling_llm: Optional function calling LLM.
+            respect_context_window: Respect context limits.
+            request_within_rpm_limit: RPM limit check function.
+            callbacks: Optional callbacks list.
+            response_model: Optional Pydantic model for structured outputs.
+        """
+        self._i18n: I18N = i18n or get_i18n()
+        self.llm = llm
+        self.task = task
+        self.agent = agent
+        self.crew = crew
+        self.prompt = prompt
+        self.tools = tools
+        self.tools_names = tools_names
+        self.stop = stop_words
+        self.max_iter = max_iter
+        self.callbacks = callbacks or []
+        self._printer: Printer = Printer()
+        self.tools_handler = tools_handler
+        self.original_tools = original_tools or []
+        self.step_callback = step_callback
+        self.tools_description = tools_description
+        self.function_calling_llm = function_calling_llm
+        self.respect_context_window = respect_context_window
+        self.request_within_rpm_limit = request_within_rpm_limit
+        self.response_model = response_model
+        self.log_error_after = 3
+        self._console: Console = Console()
+
+        # Error context storage for recovery
+        self._last_parser_error: OutputParserError | None = None
+        self._last_context_error: Exception | None = None
+
+        # Execution guard to prevent concurrent/duplicate executions
+        self._execution_lock = threading.Lock()
+        self._is_executing: bool = False
+        self._has_been_invoked: bool = False
+        self._flow_initialized: bool = False
+
+        self._instance_id = str(uuid4())[:8]
+
+        self.before_llm_call_hooks: list[Callable] = []
+        self.after_llm_call_hooks: list[Callable] = []
+        self.before_llm_call_hooks.extend(get_before_llm_call_hooks())
+        self.after_llm_call_hooks.extend(get_after_llm_call_hooks())
+
+        if self.llm:
+            existing_stop = getattr(self.llm, "stop", [])
+            self.llm.stop = list(
+                set(
+                    existing_stop + self.stop
+                    if isinstance(existing_stop, list)
+                    else self.stop
+                )
+            )
+
+        self._state = AgentReActState()
+
+    def _ensure_flow_initialized(self) -> None:
+        """Ensure Flow.__init__() has been called.
+
+        This is deferred from __init__ to prevent FlowCreatedEvent emission
+        during agent setup when multiple executor instances are created.
+        Only the instance that actually executes via invoke() will emit events.
+        """
+        if not self._flow_initialized:
+            # Now call Flow's __init__ which will replace self._state
+            # with Flow's managed state. Suppress flow events since this is
+            # an agent executor, not a user-facing flow.
+            super().__init__(
+                suppress_flow_events=True,
+            )
+            self._flow_initialized = True
+
+    @property
+    def use_stop_words(self) -> bool:
+        """Check to determine if stop words are being used.
+
+        Returns:
+            bool: True if stop words should be used.
+        """
+        return self.llm.supports_stop_words() if self.llm else False
+
+    @property
+    def state(self) -> AgentReActState:
+        """Get state - returns temporary state if Flow not yet initialized.
+
+        Flow initialization is deferred to prevent event emission during agent setup.
+        Returns the temporary state until invoke() is called.
+        """
+        return self._state
+
+    @property
+    def messages(self) -> list[LLMMessage]:
+        """Compatibility property for mixin - returns state messages."""
+        return self._state.messages
+
+    @property
+    def iterations(self) -> int:
+        """Compatibility property for mixin - returns state iterations."""
+        return self._state.iterations
+
+    @start()
+    def initialize_reasoning(self) -> Literal["initialized"]:
+        """Initialize the reasoning flow and emit agent start logs."""
+        self._show_start_logs()
+        return "initialized"
+
+    @listen("force_final_answer")
+    def force_final_answer(self) -> Literal["agent_finished"]:
+        """Force agent to provide final answer when max iterations exceeded."""
+        formatted_answer = handle_max_iterations_exceeded(
+            formatted_answer=None,
+            printer=self._printer,
+            i18n=self._i18n,
+            messages=list(self.state.messages),
+            llm=self.llm,
+            callbacks=self.callbacks,
+        )
+
+        self.state.current_answer = formatted_answer
+        self.state.is_finished = True
+
+        return "agent_finished"
+
+    @listen("continue_reasoning")
+    def call_llm_and_parse(self) -> Literal["parsed", "parser_error", "context_error"]:
+        """Execute LLM call with hooks and parse the response.
+
+        Returns routing decision based on parsing result.
+        """
+        try:
+            enforce_rpm_limit(self.request_within_rpm_limit)
+
+            answer = get_llm_response(
+                llm=self.llm,
+                messages=list(self.state.messages),
+                callbacks=self.callbacks,
+                printer=self._printer,
+                from_task=self.task,
+                from_agent=self.agent,
+                response_model=self.response_model,
+                executor_context=self,
+            )
+
+            # Parse the LLM response
+            formatted_answer = process_llm_response(answer, self.use_stop_words)
+            self.state.current_answer = formatted_answer
+
+            if "Final Answer:" in answer and isinstance(formatted_answer, AgentAction):
+                warning_text = Text()
+                warning_text.append("⚠️ ", style="yellow bold")
+                warning_text.append(
+                    f"LLM returned 'Final Answer:' but parsed as AgentAction (tool: {formatted_answer.tool})",
+                    style="yellow",
+                )
+                self._console.print(warning_text)
+                preview_text = Text()
+                preview_text.append("Answer preview: ", style="yellow")
+                preview_text.append(f"{answer[:200]}...", style="yellow dim")
+                self._console.print(preview_text)
+
+            return "parsed"
+
+        except OutputParserError as e:
+            # Store error context for recovery
+            self._last_parser_error = e or OutputParserError(
+                error="Unknown parser error"
+            )
+            return "parser_error"
+
+        except Exception as e:
+            if is_context_length_exceeded(e):
+                self._last_context_error = e
+                return "context_error"
+            if e.__class__.__module__.startswith("litellm"):
+                raise e
+            handle_unknown_error(self._printer, e)
+            raise
+
+    @router(call_llm_and_parse)
+    def route_by_answer_type(self) -> Literal["execute_tool", "agent_finished"]:
+        """Route based on whether answer is AgentAction or AgentFinish."""
+        if isinstance(self.state.current_answer, AgentAction):
+            return "execute_tool"
+        return "agent_finished"
+
+    @listen("execute_tool")
+    def execute_tool_action(self) -> Literal["tool_completed", "tool_result_is_final"]:
+        """Execute the tool action and handle the result."""
+        try:
+            action = cast(AgentAction, self.state.current_answer)
+
+            # Extract fingerprint context for tool execution
+            fingerprint_context = {}
+            if (
+                self.agent
+                and hasattr(self.agent, "security_config")
+                and hasattr(self.agent.security_config, "fingerprint")
+            ):
+                fingerprint_context = {
+                    "agent_fingerprint": str(self.agent.security_config.fingerprint)
+                }
+
+            # Execute the tool
+            tool_result = execute_tool_and_check_finality(
+                agent_action=action,
+                fingerprint_context=fingerprint_context,
+                tools=self.tools,
+                i18n=self._i18n,
+                agent_key=self.agent.key if self.agent else None,
+                agent_role=self.agent.role if self.agent else None,
+                tools_handler=self.tools_handler,
+                task=self.task,
+                agent=self.agent,
+                function_calling_llm=self.function_calling_llm,
+                crew=self.crew,
+            )
+
+            # Handle agent action and append observation to messages
+            result = self._handle_agent_action(action, tool_result)
+            self.state.current_answer = result
+
+            # Invoke step callback if configured
+            self._invoke_step_callback(result)
+
+            # Append result message to conversation state
+            if hasattr(result, "text"):
+                self._append_message_to_state(result.text)
+
+            # Check if tool result became a final answer (result_as_answer flag)
+            if isinstance(result, AgentFinish):
+                self.state.is_finished = True
+                return "tool_result_is_final"
+
+            return "tool_completed"
+
+        except Exception as e:
+            error_text = Text()
+            error_text.append("❌ Error in tool execution: ", style="red bold")
+            error_text.append(str(e), style="red")
+            self._console.print(error_text)
+            raise
+
+    @listen("initialized")
+    def continue_iteration(self) -> Literal["check_iteration"]:
+        """Bridge listener that connects iteration loop back to iteration check."""
+        return "check_iteration"
+
+    @router(or_(initialize_reasoning, continue_iteration))
+    def check_max_iterations(
+        self,
+    ) -> Literal["force_final_answer", "continue_reasoning"]:
+        """Check if max iterations reached before proceeding with reasoning."""
+        if has_reached_max_iterations(self.state.iterations, self.max_iter):
+            return "force_final_answer"
+        return "continue_reasoning"
+
+    @router(execute_tool_action)
+    def increment_and_continue(self) -> Literal["initialized"]:
+        """Increment iteration counter and loop back for next iteration."""
+        self.state.iterations += 1
+        return "initialized"
+
+    @listen(or_("agent_finished", "tool_result_is_final"))
+    def finalize(self) -> Literal["completed", "skipped"]:
+        """Finalize execution and emit completion logs."""
+        if self.state.current_answer is None:
+            skip_text = Text()
+            skip_text.append("⚠️ ", style="yellow bold")
+            skip_text.append(
+                "Finalize called but no answer in state - skipping", style="yellow"
+            )
+            self._console.print(skip_text)
+            return "skipped"
+
+        if not isinstance(self.state.current_answer, AgentFinish):
+            skip_text = Text()
+            skip_text.append("⚠️ ", style="yellow bold")
+            skip_text.append(
+                f"Finalize called with {type(self.state.current_answer).__name__} instead of AgentFinish - skipping",
+                style="yellow",
+            )
+            self._console.print(skip_text)
+            return "skipped"
+
+        self.state.is_finished = True
+
+        self._show_logs(self.state.current_answer)
+
+        return "completed"
+
+    @listen("parser_error")
+    def recover_from_parser_error(self) -> Literal["initialized"]:
+        """Recover from output parser errors and retry."""
+        formatted_answer = handle_output_parser_exception(
+            e=self._last_parser_error,
+            messages=list(self.state.messages),
+            iterations=self.state.iterations,
+            log_error_after=self.log_error_after,
+            printer=self._printer,
+        )
+
+        if formatted_answer:
+            self.state.current_answer = formatted_answer
+
+        self.state.iterations += 1
+
+        return "initialized"
+
+    @listen("context_error")
+    def recover_from_context_length(self) -> Literal["initialized"]:
+        """Recover from context length errors and retry."""
+        handle_context_length(
+            respect_context_window=self.respect_context_window,
+            printer=self._printer,
+            messages=self.state.messages,
+            llm=self.llm,
+            callbacks=self.callbacks,
+            i18n=self._i18n,
+        )
+
+        self.state.iterations += 1
+
+        return "initialized"
+
+    def invoke(self, inputs: dict[str, Any]) -> dict[str, Any]:
+        """Execute agent with given inputs.
+
+        Args:
+            inputs: Input dictionary containing prompt variables.
+
+        Returns:
+            Dictionary with agent output.
+        """
+        self._ensure_flow_initialized()
+
+        with self._execution_lock:
+            if self._is_executing:
+                raise RuntimeError(
+                    "Executor is already running. "
+                    "Cannot invoke the same executor instance concurrently."
+                )
+            self._is_executing = True
+            self._has_been_invoked = True
+
+        try:
+            # Reset state for fresh execution
+            self.state.messages.clear()
+            self.state.iterations = 0
+            self.state.current_answer = None
+            self.state.is_finished = False
+
+            if "system" in self.prompt:
+                prompt = cast("SystemPromptResult", self.prompt)
+                system_prompt = self._format_prompt(prompt["system"], inputs)
+                user_prompt = self._format_prompt(prompt["user"], inputs)
+                self.state.messages.append(
+                    format_message_for_llm(system_prompt, role="system")
+                )
+                self.state.messages.append(format_message_for_llm(user_prompt))
+            else:
+                user_prompt = self._format_prompt(self.prompt["prompt"], inputs)
+                self.state.messages.append(format_message_for_llm(user_prompt))
+
+            self.state.ask_for_human_input = bool(
+                inputs.get("ask_for_human_input", False)
+            )
+
+            self.kickoff()
+
+            formatted_answer = self.state.current_answer
+
+            if not isinstance(formatted_answer, AgentFinish):
+                raise RuntimeError(
+                    "Agent execution ended without reaching a final answer."
+                )
+
+            if self.state.ask_for_human_input:
+                formatted_answer = self._handle_human_feedback(formatted_answer)
+
+            self._create_short_term_memory(formatted_answer)
+            self._create_long_term_memory(formatted_answer)
+            self._create_external_memory(formatted_answer)
+
+            return {"output": formatted_answer.output}
+
+        except AssertionError:
+            fail_text = Text()
+            fail_text.append("❌ ", style="red bold")
+            fail_text.append(
+                "Agent failed to reach a final answer. This is likely a bug - please report it.",
+                style="red",
+            )
+            self._console.print(fail_text)
+            raise
+        except Exception as e:
+            handle_unknown_error(self._printer, e)
+            raise
+        finally:
+            self._is_executing = False
+
+    def _handle_agent_action(
+        self, formatted_answer: AgentAction, tool_result: ToolResult
+    ) -> AgentAction | AgentFinish:
+        """Process agent action and tool execution result.
+
+        Args:
+            formatted_answer: Agent's action to execute.
+            tool_result: Result from tool execution.
+
+        Returns:
+            Updated action or final answer.
+        """
+        add_image_tool = self._i18n.tools("add_image")
+        if (
+            isinstance(add_image_tool, dict)
+            and formatted_answer.tool.casefold().strip()
+            == add_image_tool.get("name", "").casefold().strip()
+        ):
+            self.state.messages.append(
+                {"role": "assistant", "content": tool_result.result}
+            )
+            return formatted_answer
+
+        return handle_agent_action_core(
+            formatted_answer=formatted_answer,
+            tool_result=tool_result,
+            messages=self.state.messages,
+            step_callback=self.step_callback,
+            show_logs=self._show_logs,
+        )
+
+    def _invoke_step_callback(
+        self, formatted_answer: AgentAction | AgentFinish
+    ) -> None:
+        """Invoke step callback if configured.
+
+        Args:
+            formatted_answer: Current agent response.
+        """
+        if self.step_callback:
+            self.step_callback(formatted_answer)
+
+    def _append_message_to_state(
+        self, text: str, role: Literal["user", "assistant", "system"] = "assistant"
+    ) -> None:
+        """Add message to state conversation history.
+
+        Args:
+            text: Message content.
+            role: Message role (default: assistant).
+        """
+        self.state.messages.append(format_message_for_llm(text, role=role))
+
+    def _show_start_logs(self) -> None:
+        """Emit agent start event."""
+        if self.agent is None:
+            raise ValueError("Agent cannot be None")
+
+        crewai_event_bus.emit(
+            self.agent,
+            AgentLogsStartedEvent(
+                agent_role=self.agent.role,
+                task_description=(self.task.description if self.task else "Not Found"),
+                verbose=self.agent.verbose
+                or (hasattr(self, "crew") and getattr(self.crew, "verbose", False)),
+            ),
+        )
+
+    def _show_logs(self, formatted_answer: AgentAction | AgentFinish) -> None:
+        """Emit agent execution event.
+
+        Args:
+            formatted_answer: Agent's response to log.
+        """
+        if self.agent is None:
+            raise ValueError("Agent cannot be None")
+
+        crewai_event_bus.emit(
+            self.agent,
+            AgentLogsExecutionEvent(
+                agent_role=self.agent.role,
+                formatted_answer=formatted_answer,
+                verbose=self.agent.verbose
+                or (hasattr(self, "crew") and getattr(self.crew, "verbose", False)),
+            ),
+        )
+
+    def _handle_crew_training_output(
+        self, result: AgentFinish, human_feedback: str | None = None
+    ) -> None:
+        """Save training data for crew training mode.
+
+        Args:
+            result: Agent's final output.
+            human_feedback: Optional feedback from human.
+        """
+        agent_id = str(self.agent.id)
+        train_iteration = (
+            getattr(self.crew, "_train_iteration", None) if self.crew else None
+        )
+
+        if train_iteration is None or not isinstance(train_iteration, int):
+            train_error = Text()
+            train_error.append("❌ ", style="red bold")
+            train_error.append(
+                "Invalid or missing train iteration. Cannot save training data.",
+                style="red",
+            )
+            self._console.print(train_error)
+            return
+
+        training_handler = CrewTrainingHandler(TRAINING_DATA_FILE)
+        training_data = training_handler.load() or {}
+
+        # Initialize or retrieve agent's training data
+        agent_training_data = training_data.get(agent_id, {})
+
+        if human_feedback is not None:
+            # Save initial output and human feedback
+            agent_training_data[train_iteration] = {
+                "initial_output": result.output,
+                "human_feedback": human_feedback,
+            }
+        else:
+            # Save improved output
+            if train_iteration in agent_training_data:
+                agent_training_data[train_iteration]["improved_output"] = result.output
+            else:
+                train_error = Text()
+                train_error.append("❌ ", style="red bold")
+                train_error.append(
+                    f"No existing training data for agent {agent_id} and iteration "
+                    f"{train_iteration}. Cannot save improved output.",
+                    style="red",
+                )
+                self._console.print(train_error)
+                return
+
+        # Update the training data and save
+        training_data[agent_id] = agent_training_data
+        training_handler.save(training_data)
+
+    @staticmethod
+    def _format_prompt(prompt: str, inputs: dict[str, str]) -> str:
+        """Format prompt template with input values.
+
+        Args:
+            prompt: Template string.
+            inputs: Values to substitute.
+
+        Returns:
+            Formatted prompt.
+        """
+        prompt = prompt.replace("{input}", inputs["input"])
+        prompt = prompt.replace("{tool_names}", inputs["tool_names"])
+        return prompt.replace("{tools}", inputs["tools"])
+
+    def _handle_human_feedback(self, formatted_answer: AgentFinish) -> AgentFinish:
+        """Process human feedback and refine answer.
+
+        Args:
+            formatted_answer: Initial agent result.
+
+        Returns:
+            Final answer after feedback.
+        """
+        human_feedback = self._ask_human_input(formatted_answer.output)
+
+        if self._is_training_mode():
+            return self._handle_training_feedback(formatted_answer, human_feedback)
+
+        return self._handle_regular_feedback(formatted_answer, human_feedback)
+
+    def _is_training_mode(self) -> bool:
+        """Check if training mode is active.
+
+        Returns:
+            True if in training mode.
+        """
+        return bool(self.crew and self.crew._train)
+
+    def _handle_training_feedback(
+        self, initial_answer: AgentFinish, feedback: str
+    ) -> AgentFinish:
+        """Process training feedback and generate improved answer.
+
+        Args:
+            initial_answer: Initial agent output.
+            feedback: Training feedback.
+
+        Returns:
+            Improved answer.
+        """
+        self._handle_crew_training_output(initial_answer, feedback)
+        self.state.messages.append(
+            format_message_for_llm(
+                self._i18n.slice("feedback_instructions").format(feedback=feedback)
+            )
+        )
+
+        # Re-run flow for improved answer
+        self.state.iterations = 0
+        self.state.is_finished = False
+        self.state.current_answer = None
+
+        self.kickoff()
+
+        # Get improved answer from state
+        improved_answer = self.state.current_answer
+        if not isinstance(improved_answer, AgentFinish):
+            raise RuntimeError(
+                "Training feedback iteration did not produce final answer"
+            )
+
+        self._handle_crew_training_output(improved_answer)
+        self.state.ask_for_human_input = False
+        return improved_answer
+
+    def _handle_regular_feedback(
+        self, current_answer: AgentFinish, initial_feedback: str
+    ) -> AgentFinish:
+        """Process regular feedback iteratively until user is satisfied.
+
+        Args:
+            current_answer: Current agent output.
+            initial_feedback: Initial user feedback.
+
+        Returns:
+            Final answer after iterations.
+        """
+        feedback = initial_feedback
+        answer = current_answer
+
+        while self.state.ask_for_human_input:
+            if feedback.strip() == "":
+                self.state.ask_for_human_input = False
+            else:
+                answer = self._process_feedback_iteration(feedback)
+                feedback = self._ask_human_input(answer.output)
+
+        return answer
+
+    def _process_feedback_iteration(self, feedback: str) -> AgentFinish:
+        """Process a single feedback iteration and generate updated response.
+
+        Args:
+            feedback: User feedback.
+
+        Returns:
+            Updated agent response.
+        """
+        self.state.messages.append(
+            format_message_for_llm(
+                self._i18n.slice("feedback_instructions").format(feedback=feedback)
+            )
+        )
+
+        # Re-run flow
+        self.state.iterations = 0
+        self.state.is_finished = False
+        self.state.current_answer = None
+
+        self.kickoff()
+
+        # Get answer from state
+        answer = self.state.current_answer
+        if not isinstance(answer, AgentFinish):
+            raise RuntimeError("Feedback iteration did not produce final answer")
+
+        return answer
+
+    @classmethod
+    def __get_pydantic_core_schema__(
+        cls, _source_type: Any, _handler: GetCoreSchemaHandler
+    ) -> CoreSchema:
+        """Generate Pydantic core schema for Protocol compatibility.
+
+        Allows the executor to be used in Pydantic models without
+        requiring arbitrary_types_allowed=True.
+        """
+        return core_schema.any_schema()
--- a/lib/crewai/src/crewai/experimental/evaluation/agent_evaluator.py
+++ b/lib/crewai/src/crewai/experimental/evaluation/agent_evaluator.py
@@ -1,8 +1,9 @@
+from __future__ import annotations
+
 from collections.abc import Sequence
 import threading
-from typing import Any
+from typing import TYPE_CHECKING, Any

-from crewai.agent.core import Agent
 from crewai.agents.agent_builder.base_agent import BaseAgent
 from crewai.events.event_bus import crewai_event_bus
 from crewai.events.types.agent_events import (
@@ -28,6 +29,10 @@ from crewai.experimental.evaluation.evaluation_listener import (
 from crewai.task import Task


+if TYPE_CHECKING:
+    from crewai.agent import Agent
+
+
 class ExecutionState:
    current_agent_id: str | None = None
    current_task_id: str | None = None
--- a/lib/crewai/src/crewai/experimental/evaluation/base_evaluator.py
+++ b/lib/crewai/src/crewai/experimental/evaluation/base_evaluator.py
@@ -1,17 +1,22 @@
+from __future__ import annotations
+
 import abc
 import enum
 from enum import Enum
-from typing import Any
+from typing import TYPE_CHECKING, Any

 from pydantic import BaseModel, Field

-from crewai.agent import Agent
 from crewai.agents.agent_builder.base_agent import BaseAgent
 from crewai.llm import BaseLLM
 from crewai.task import Task
 from crewai.utilities.llm_utils import create_llm


+if TYPE_CHECKING:
+    from crewai.agent import Agent
+
+
 class MetricCategory(enum.Enum):
    GOAL_ALIGNMENT = "goal_alignment"
    SEMANTIC_QUALITY = "semantic_quality"
--- a/lib/crewai/src/crewai/experimental/evaluation/experiment/runner.py
+++ b/lib/crewai/src/crewai/experimental/evaluation/experiment/runner.py
@@ -1,8 +1,9 @@
+from __future__ import annotations
+
 from collections import defaultdict
 from hashlib import md5
-from typing import Any
+from typing import TYPE_CHECKING, Any

-from crewai import Agent, Crew
 from crewai.agents.agent_builder.base_agent import BaseAgent
 from crewai.experimental.evaluation import AgentEvaluator, create_default_evaluator
 from crewai.experimental.evaluation.evaluation_display import (
@@ -17,6 +18,11 @@ from crewai.experimental.evaluation.experiment.result_display import (
 )


+if TYPE_CHECKING:
+    from crewai.agent import Agent
+    from crewai.crew import Crew
+
+
 class ExperimentRunner:
    def __init__(self, dataset: list[dict[str, Any]]):
        self.dataset = dataset or []
--- a/lib/crewai/src/crewai/experimental/evaluation/metrics/goal_metrics.py
+++ b/lib/crewai/src/crewai/experimental/evaluation/metrics/goal_metrics.py
@@ -1,6 +1,7 @@
-from typing import Any
+from __future__ import annotations
+
+from typing import TYPE_CHECKING, Any

-from crewai.agent import Agent
 from crewai.agents.agent_builder.base_agent import BaseAgent
 from crewai.experimental.evaluation.base_evaluator import (
    BaseEvaluator,
@@ -12,6 +13,10 @@ from crewai.task import Task
 from crewai.utilities.types import LLMMessage


+if TYPE_CHECKING:
+    from crewai.agent import Agent
+
+
 class GoalAlignmentEvaluator(BaseEvaluator):
    @property
    def metric_category(self) -> MetricCategory:
--- a/lib/crewai/src/crewai/experimental/evaluation/metrics/reasoning_metrics.py
+++ b/lib/crewai/src/crewai/experimental/evaluation/metrics/reasoning_metrics.py
@@ -6,15 +6,16 @@ This module provides evaluator implementations for:
 - Thinking-to-action ratio
 """

+from __future__ import annotations
+
 from collections.abc import Sequence
 from enum import Enum
 import logging
 import re
-from typing import Any
+from typing import TYPE_CHECKING, Any

 import numpy as np

-from crewai.agent import Agent
 from crewai.agents.agent_builder.base_agent import BaseAgent
 from crewai.experimental.evaluation.base_evaluator import (
    BaseEvaluator,
@@ -27,6 +28,10 @@ from crewai.tasks.task_output import TaskOutput
 from crewai.utilities.types import LLMMessage


+if TYPE_CHECKING:
+    from crewai.agent import Agent
+
+
 class ReasoningPatternType(Enum):
    EFFICIENT = "efficient"  # Good reasoning flow
    LOOP = "loop"  # Agent is stuck in a loop
--- a/lib/crewai/src/crewai/experimental/evaluation/metrics/semantic_quality_metrics.py
+++ b/lib/crewai/src/crewai/experimental/evaluation/metrics/semantic_quality_metrics.py
@@ -1,6 +1,7 @@
-from typing import Any
+from __future__ import annotations
+
+from typing import TYPE_CHECKING, Any

-from crewai.agent import Agent
 from crewai.agents.agent_builder.base_agent import BaseAgent
 from crewai.experimental.evaluation.base_evaluator import (
    BaseEvaluator,
@@ -12,6 +13,10 @@ from crewai.task import Task
 from crewai.utilities.types import LLMMessage


+if TYPE_CHECKING:
+    from crewai.agent import Agent
+
+
 class SemanticQualityEvaluator(BaseEvaluator):
    @property
    def metric_category(self) -> MetricCategory:
--- a/lib/crewai/src/crewai/experimental/evaluation/metrics/tools_metrics.py
+++ b/lib/crewai/src/crewai/experimental/evaluation/metrics/tools_metrics.py
@@ -1,7 +1,8 @@
-import json
-from typing import Any
+from __future__ import annotations
+
+import json
+from typing import TYPE_CHECKING, Any

-from crewai.agent import Agent
 from crewai.agents.agent_builder.base_agent import BaseAgent
 from crewai.experimental.evaluation.base_evaluator import (
    BaseEvaluator,
@@ -13,6 +14,10 @@ from crewai.task import Task
 from crewai.utilities.types import LLMMessage


+if TYPE_CHECKING:
+    from crewai.agent import Agent
+
+
 class ToolSelectionEvaluator(BaseEvaluator):
    @property
    def metric_category(self) -> MetricCategory:
--- a/lib/crewai/src/crewai/flow/flow.py
+++ b/lib/crewai/src/crewai/flow/flow.py
@@ -459,7 +459,10 @@ class FlowMeta(type):
                ):
                    routers.add(attr_name)
                    # Get router paths from the decorator attribute
-                    if hasattr(attr_value, "__router_paths__") and attr_value.__router_paths__:
+                    if (
+                        hasattr(attr_value, "__router_paths__")
+                        and attr_value.__router_paths__
+                    ):
                        router_paths[attr_name] = attr_value.__router_paths__
                    else:
                        possible_returns = get_possible_return_constants(attr_value)
@@ -501,6 +504,7 @@ class Flow(Generic[T], metaclass=FlowMeta):
        self,
        persistence: FlowPersistence | None = None,
        tracing: bool | None = None,
+        suppress_flow_events: bool = False,
        **kwargs: Any,
    ) -> None:
        """Initialize a new Flow instance.
@@ -508,6 +512,7 @@ class Flow(Generic[T], metaclass=FlowMeta):
        Args:
            persistence: Optional persistence backend for storing flow states
            tracing: Whether to enable tracing. True=always enable, False=always disable, None=check environment/user settings
+            suppress_flow_events: Whether to suppress flow event emissions (internal use)
            **kwargs: Additional state values to initialize or override
        """
        # Initialize basic instance attributes
@@ -526,6 +531,7 @@ class Flow(Generic[T], metaclass=FlowMeta):
        self.human_feedback_history: list[HumanFeedbackResult] = []
        self.last_human_feedback: HumanFeedbackResult | None = None
        self._pending_feedback_context: PendingFeedbackContext | None = None
+        self.suppress_flow_events: bool = suppress_flow_events

        # Initialize state with initial values
        self._state = self._create_initial_state()
@@ -539,13 +545,14 @@ class Flow(Generic[T], metaclass=FlowMeta):
        if kwargs:
            self._initialize_state(kwargs)

-        crewai_event_bus.emit(
-            self,
-            FlowCreatedEvent(
-                type="flow_created",
-                flow_name=self.name or self.__class__.__name__,
-            ),
-        )
+        if not self.suppress_flow_events:
+            crewai_event_bus.emit(
+                self,
+                FlowCreatedEvent(
+                    type="flow_created",
+                    flow_name=self.name or self.__class__.__name__,
+                ),
+            )

        # Register all flow-related methods
        for method_name in dir(self):
@@ -672,6 +679,7 @@ class Flow(Generic[T], metaclass=FlowMeta):
                result = flow.resume(feedback)
                return result

+
            # In an async handler, use resume_async instead:
            async def handle_feedback_async(flow_id: str, feedback: str):
                flow = MyFlow.from_pending(flow_id)
@@ -1307,19 +1315,20 @@ class Flow(Generic[T], metaclass=FlowMeta):
                    self._initialize_state(filtered_inputs)

            # Emit FlowStartedEvent and log the start of the flow.
-            future = crewai_event_bus.emit(
-                self,
-                FlowStartedEvent(
-                    type="flow_started",
-                    flow_name=self.name or self.__class__.__name__,
-                    inputs=inputs,
-                ),
-            )
-            if future:
-                self._event_futures.append(future)
-            self._log_flow_event(
-                f"Flow started with ID: {self.flow_id}", color="bold magenta"
-            )
+            if not self.suppress_flow_events:
+                future = crewai_event_bus.emit(
+                    self,
+                    FlowStartedEvent(
+                        type="flow_started",
+                        flow_name=self.name or self.__class__.__name__,
+                        inputs=inputs,
+                    ),
+                )
+                if future:
+                    self._event_futures.append(future)
+                self._log_flow_event(
+                    f"Flow started with ID: {self.flow_id}", color="bold magenta"
+                )

            if inputs is not None and "id" not in inputs:
                self._initialize_state(inputs)
@@ -1391,17 +1400,18 @@ class Flow(Generic[T], metaclass=FlowMeta):

            final_output = self._method_outputs[-1] if self._method_outputs else None

-            future = crewai_event_bus.emit(
-                self,
-                FlowFinishedEvent(
-                    type="flow_finished",
-                    flow_name=self.name or self.__class__.__name__,
-                    result=final_output,
-                    state=self._copy_and_serialize_state(),
-                ),
-            )
-            if future:
-                self._event_futures.append(future)
+            if not self.suppress_flow_events:
+                future = crewai_event_bus.emit(
+                    self,
+                    FlowFinishedEvent(
+                        type="flow_finished",
+                        flow_name=self.name or self.__class__.__name__,
+                        result=final_output,
+                        state=self._copy_and_serialize_state(),
+                    ),
+                )
+                if future:
+                    self._event_futures.append(future)

            if self._event_futures:
                await asyncio.gather(
@@ -1537,18 +1547,19 @@ class Flow(Generic[T], metaclass=FlowMeta):
                kwargs or {}
            )

-            future = crewai_event_bus.emit(
-                self,
-                MethodExecutionStartedEvent(
-                    type="method_execution_started",
-                    method_name=method_name,
-                    flow_name=self.name or self.__class__.__name__,
-                    params=dumped_params,
-                    state=self._copy_and_serialize_state(),
-                ),
-            )
-            if future:
-                self._event_futures.append(future)
+            if not self.suppress_flow_events:
+                future = crewai_event_bus.emit(
+                    self,
+                    MethodExecutionStartedEvent(
+                        type="method_execution_started",
+                        method_name=method_name,
+                        flow_name=self.name or self.__class__.__name__,
+                        params=dumped_params,
+                        state=self._copy_and_serialize_state(),
+                    ),
+                )
+                if future:
+                    self._event_futures.append(future)

            result = (
                await method(*args, **kwargs)
@@ -1563,41 +1574,32 @@ class Flow(Generic[T], metaclass=FlowMeta):

            self._completed_methods.add(method_name)

-            future = crewai_event_bus.emit(
-                self,
-                MethodExecutionFinishedEvent(
-                    type="method_execution_finished",
-                    method_name=method_name,
-                    flow_name=self.name or self.__class__.__name__,
-                    state=self._copy_and_serialize_state(),
-                    result=result,
-                ),
-            )
-            if future:
-                self._event_futures.append(future)
-
-            return result
-        except Exception as e:
-            # Check if this is a HumanFeedbackPending exception (paused, not failed)
-            from crewai.flow.async_feedback.types import HumanFeedbackPending
-
-            if isinstance(e, HumanFeedbackPending):
-                # Emit paused event instead of failed
+            if not self.suppress_flow_events:
                future = crewai_event_bus.emit(
                    self,
-                    MethodExecutionPausedEvent(
-                        type="method_execution_paused",
+                    MethodExecutionFinishedEvent(
+                        type="method_execution_finished",
                        method_name=method_name,
                        flow_name=self.name or self.__class__.__name__,
                        state=self._copy_and_serialize_state(),
-                        flow_id=e.context.flow_id,
-                        message=e.context.message,
-                        emit=e.context.emit,
+                        result=result,
                    ),
                )
                if future:
                    self._event_futures.append(future)
-                raise e
+
+            return result
+        except Exception as e:
+            if not self.suppress_flow_events:
+                # Check if this is a HumanFeedbackPending exception (paused, not failed)
+                from crewai.flow.async_feedback.types import HumanFeedbackPending
+
+                if isinstance(e, HumanFeedbackPending):
+                    # Auto-save pending feedback (create default persistence if needed)
+                    if self._persistence is None:
+                        from crewai.flow.persistence import SQLiteFlowPersistence
+
+                        self._persistence = SQLiteFlowPersistence()

            # Regular failure
            future = crewai_event_bus.emit(
@@ -1644,7 +1646,9 @@ class Flow(Generic[T], metaclass=FlowMeta):
        """
        # First, handle routers repeatedly until no router triggers anymore
        router_results = []
-        router_result_to_feedback: dict[str, Any] = {}  # Map outcome -> HumanFeedbackResult
+        router_result_to_feedback: dict[
+            str, Any
+        ] = {}  # Map outcome -> HumanFeedbackResult
        current_trigger = trigger_method
        current_result = result  # Track the result to pass to each router

@@ -1963,7 +1967,9 @@ class Flow(Generic[T], metaclass=FlowMeta):

            # Show message and prompt for feedback
            formatter.console.print(message, style="yellow")
-            formatter.console.print("(Press Enter to skip, or type your feedback)\n", style="cyan")
+            formatter.console.print(
+                "(Press Enter to skip, or type your feedback)\n", style="cyan"
+            )

            feedback = input("Your feedback: ").strip()

--- a/lib/crewai/src/crewai/tools/tool_usage.py
+++ b/lib/crewai/src/crewai/tools/tool_usage.py
@@ -249,6 +249,7 @@ class ToolUsage:
                "tool_args": self.action.tool_input,
                "tool_class": self.action.tool,
                "agent": self.agent,
+                "run_attempts": self._run_attempts,
            }

            if self.agent.fingerprint:  # type: ignore
@@ -435,6 +436,7 @@ class ToolUsage:
                "tool_args": self.action.tool_input,
                "tool_class": self.action.tool,
                "agent": self.agent,
+                "run_attempts": self._run_attempts,
            }

            # TODO: Investigate fingerprint attribute availability on BaseAgent/LiteAgent
--- a/lib/crewai/tests/agents/test_agent.py
+++ b/lib/crewai/tests/agents/test_agent.py
@@ -1178,6 +1178,7 @@ def test_system_and_prompt_template():

 {{ .Response }}<|eot_id|>""",
    )
+    agent.create_agent_executor()

    expected_prompt = """<|start_header_id|>system<|end_header_id|>

@@ -1442,6 +1443,8 @@ def test_agent_max_retry_limit():
        human_input=True,
    )

+    agent.create_agent_executor(task=task)
+
    error_message = "Error happening while sending prompt to model."
    with patch.object(
        CrewAgentExecutor, "invoke", wraps=agent.agent_executor.invoke
@@ -1503,9 +1506,8 @@ def test_agent_with_custom_stop_words():
    )

    assert isinstance(agent.llm, BaseLLM)
-    assert set(agent.llm.stop) == set([*stop_words, "\nObservation:"])
+    assert set(agent.llm.stop) == set(stop_words)
    assert all(word in agent.llm.stop for word in stop_words)
-    assert "\nObservation:" in agent.llm.stop


 def test_agent_with_callbacks():
@@ -1629,6 +1631,8 @@ def test_handle_context_length_exceeds_limit_cli_no():
    )
    task = Task(description="test task", agent=agent, expected_output="test output")

+    agent.create_agent_executor(task=task)
+
    with patch.object(
        CrewAgentExecutor, "invoke", wraps=agent.agent_executor.invoke
    ) as private_mock:
@@ -1679,8 +1683,8 @@ def test_agent_with_all_llm_attributes():
    assert agent.llm.temperature == 0.7
    assert agent.llm.top_p == 0.9
    # assert agent.llm.n == 1
-    assert set(agent.llm.stop) == set(["STOP", "END", "\nObservation:"])
-    assert all(word in agent.llm.stop for word in ["STOP", "END", "\nObservation:"])
+    assert set(agent.llm.stop) == set(["STOP", "END"])
+    assert all(word in agent.llm.stop for word in ["STOP", "END"])
    assert agent.llm.max_tokens == 100
    assert agent.llm.presence_penalty == 0.1
    assert agent.llm.frequency_penalty == 0.1
--- a/lib/crewai/tests/agents/test_crew_agent_executor_flow.py
+++ b/lib/crewai/tests/agents/test_crew_agent_executor_flow.py
@@ -0,0 +1,479 @@
+"""Unit tests for CrewAgentExecutorFlow.
+
+Tests the Flow-based agent executor implementation including state management,
+flow methods, routing logic, and error handling.
+"""
+
+from unittest.mock import Mock, patch
+
+import pytest
+
+from crewai.experimental.crew_agent_executor_flow import (
+    AgentReActState,
+    CrewAgentExecutorFlow,
+)
+from crewai.agents.parser import AgentAction, AgentFinish
+
+class TestAgentReActState:
+    """Test AgentReActState Pydantic model."""
+
+    def test_state_initialization(self):
+        """Test AgentReActState initialization with defaults."""
+        state = AgentReActState()
+        assert state.iterations == 0
+        assert state.messages == []
+        assert state.current_answer is None
+        assert state.is_finished is False
+        assert state.ask_for_human_input is False
+
+    def test_state_with_values(self):
+        """Test AgentReActState initialization with values."""
+        messages = [{"role": "user", "content": "test"}]
+        state = AgentReActState(
+            messages=messages,
+            iterations=5,
+            current_answer=AgentFinish(thought="thinking", output="done", text="final"),
+            is_finished=True,
+            ask_for_human_input=True,
+        )
+        assert state.messages == messages
+        assert state.iterations == 5
+        assert isinstance(state.current_answer, AgentFinish)
+        assert state.is_finished is True
+        assert state.ask_for_human_input is True
+
+
+class TestCrewAgentExecutorFlow:
+    """Test CrewAgentExecutorFlow class."""
+
+    @pytest.fixture
+    def mock_dependencies(self):
+        """Create mock dependencies for executor."""
+        llm = Mock()
+        llm.supports_stop_words.return_value = True
+
+        task = Mock()
+        task.description = "Test task"
+        task.human_input = False
+        task.response_model = None
+
+        crew = Mock()
+        crew.verbose = False
+        crew._train = False
+
+        agent = Mock()
+        agent.id = "test-agent-id"
+        agent.role = "Test Agent"
+        agent.verbose = False
+        agent.key = "test-key"
+
+        prompt = {"prompt": "Test prompt with {input}, {tool_names}, {tools}"}
+
+        tools = []
+        tools_handler = Mock()
+
+        return {
+            "llm": llm,
+            "task": task,
+            "crew": crew,
+            "agent": agent,
+            "prompt": prompt,
+            "max_iter": 10,
+            "tools": tools,
+            "tools_names": "",
+            "stop_words": ["Observation"],
+            "tools_description": "",
+            "tools_handler": tools_handler,
+        }
+
+    def test_executor_initialization(self, mock_dependencies):
+        """Test CrewAgentExecutorFlow initialization."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+
+        assert executor.llm == mock_dependencies["llm"]
+        assert executor.task == mock_dependencies["task"]
+        assert executor.agent == mock_dependencies["agent"]
+        assert executor.crew == mock_dependencies["crew"]
+        assert executor.max_iter == 10
+        assert executor.use_stop_words is True
+
+    def test_initialize_reasoning(self, mock_dependencies):
+        """Test flow entry point."""
+        with patch.object(
+            CrewAgentExecutorFlow, "_show_start_logs"
+        ) as mock_show_start:
+            executor = CrewAgentExecutorFlow(**mock_dependencies)
+            result = executor.initialize_reasoning()
+
+            assert result == "initialized"
+            mock_show_start.assert_called_once()
+
+    def test_check_max_iterations_not_reached(self, mock_dependencies):
+        """Test routing when iterations < max."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor.state.iterations = 5
+
+        result = executor.check_max_iterations()
+        assert result == "continue_reasoning"
+
+    def test_check_max_iterations_reached(self, mock_dependencies):
+        """Test routing when iterations >= max."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor.state.iterations = 10
+
+        result = executor.check_max_iterations()
+        assert result == "force_final_answer"
+
+    def test_route_by_answer_type_action(self, mock_dependencies):
+        """Test routing for AgentAction."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor.state.current_answer = AgentAction(
+            thought="thinking", tool="search", tool_input="query", text="action text"
+        )
+
+        result = executor.route_by_answer_type()
+        assert result == "execute_tool"
+
+    def test_route_by_answer_type_finish(self, mock_dependencies):
+        """Test routing for AgentFinish."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor.state.current_answer = AgentFinish(
+            thought="final thoughts", output="Final answer", text="complete"
+        )
+
+        result = executor.route_by_answer_type()
+        assert result == "agent_finished"
+
+    def test_continue_iteration(self, mock_dependencies):
+        """Test iteration continuation."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+
+        result = executor.continue_iteration()
+
+        assert result == "check_iteration"
+
+    def test_finalize_success(self, mock_dependencies):
+        """Test finalize with valid AgentFinish."""
+        with patch.object(CrewAgentExecutorFlow, "_show_logs") as mock_show_logs:
+            executor = CrewAgentExecutorFlow(**mock_dependencies)
+            executor.state.current_answer = AgentFinish(
+                thought="final thinking", output="Done", text="complete"
+            )
+
+            result = executor.finalize()
+
+            assert result == "completed"
+            assert executor.state.is_finished is True
+            mock_show_logs.assert_called_once()
+
+    def test_finalize_failure(self, mock_dependencies):
+        """Test finalize skips when given AgentAction instead of AgentFinish."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor.state.current_answer = AgentAction(
+            thought="thinking", tool="search", tool_input="query", text="action text"
+        )
+
+        result = executor.finalize()
+
+        # Should return "skipped" and not set is_finished
+        assert result == "skipped"
+        assert executor.state.is_finished is False
+
+    def test_format_prompt(self, mock_dependencies):
+        """Test prompt formatting."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        inputs = {"input": "test input", "tool_names": "tool1, tool2", "tools": "desc"}
+
+        result = executor._format_prompt("Prompt {input} {tool_names} {tools}", inputs)
+
+        assert "test input" in result
+        assert "tool1, tool2" in result
+        assert "desc" in result
+
+    def test_is_training_mode_false(self, mock_dependencies):
+        """Test training mode detection when not in training."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        assert executor._is_training_mode() is False
+
+    def test_is_training_mode_true(self, mock_dependencies):
+        """Test training mode detection when in training."""
+        mock_dependencies["crew"]._train = True
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        assert executor._is_training_mode() is True
+
+    def test_append_message_to_state(self, mock_dependencies):
+        """Test message appending to state."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        initial_count = len(executor.state.messages)
+
+        executor._append_message_to_state("test message")
+
+        assert len(executor.state.messages) == initial_count + 1
+        assert executor.state.messages[-1]["content"] == "test message"
+
+    def test_invoke_step_callback(self, mock_dependencies):
+        """Test step callback invocation."""
+        callback = Mock()
+        mock_dependencies["step_callback"] = callback
+
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        answer = AgentFinish(thought="thinking", output="test", text="final")
+
+        executor._invoke_step_callback(answer)
+
+        callback.assert_called_once_with(answer)
+
+    def test_invoke_step_callback_none(self, mock_dependencies):
+        """Test step callback when none provided."""
+        mock_dependencies["step_callback"] = None
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+
+        # Should not raise error
+        executor._invoke_step_callback(
+            AgentFinish(thought="thinking", output="test", text="final")
+        )
+
+    @patch("crewai.experimental.crew_agent_executor_flow.handle_output_parser_exception")
+    def test_recover_from_parser_error(
+        self, mock_handle_exception, mock_dependencies
+    ):
+        """Test recovery from OutputParserError."""
+        from crewai.agents.parser import OutputParserError
+
+        mock_handle_exception.return_value = None
+
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor._last_parser_error = OutputParserError("test error")
+        initial_iterations = executor.state.iterations
+
+        result = executor.recover_from_parser_error()
+
+        assert result == "initialized"
+        assert executor.state.iterations == initial_iterations + 1
+        mock_handle_exception.assert_called_once()
+
+    @patch("crewai.experimental.crew_agent_executor_flow.handle_context_length")
+    def test_recover_from_context_length(
+        self, mock_handle_context, mock_dependencies
+    ):
+        """Test recovery from context length error."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor._last_context_error = Exception("context too long")
+        initial_iterations = executor.state.iterations
+
+        result = executor.recover_from_context_length()
+
+        assert result == "initialized"
+        assert executor.state.iterations == initial_iterations + 1
+        mock_handle_context.assert_called_once()
+
+    def test_use_stop_words_property(self, mock_dependencies):
+        """Test use_stop_words property."""
+        mock_dependencies["llm"].supports_stop_words.return_value = True
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        assert executor.use_stop_words is True
+
+        mock_dependencies["llm"].supports_stop_words.return_value = False
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        assert executor.use_stop_words is False
+
+    def test_compatibility_properties(self, mock_dependencies):
+        """Test compatibility properties for mixin."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor.state.messages = [{"role": "user", "content": "test"}]
+        executor.state.iterations = 5
+
+        # Test that compatibility properties return state values
+        assert executor.messages == executor.state.messages
+        assert executor.iterations == executor.state.iterations
+
+
+class TestFlowErrorHandling:
+    """Test error handling in flow methods."""
+
+    @pytest.fixture
+    def mock_dependencies(self):
+        """Create mock dependencies."""
+        llm = Mock()
+        llm.supports_stop_words.return_value = True
+
+        task = Mock()
+        task.description = "Test task"
+
+        crew = Mock()
+        agent = Mock()
+        agent.role = "Test Agent"
+        agent.verbose = False
+
+        prompt = {"prompt": "Test {input}"}
+
+        return {
+            "llm": llm,
+            "task": task,
+            "crew": crew,
+            "agent": agent,
+            "prompt": prompt,
+            "max_iter": 10,
+            "tools": [],
+            "tools_names": "",
+            "stop_words": [],
+            "tools_description": "",
+            "tools_handler": Mock(),
+        }
+
+    @patch("crewai.experimental.crew_agent_executor_flow.get_llm_response")
+    @patch("crewai.experimental.crew_agent_executor_flow.enforce_rpm_limit")
+    def test_call_llm_parser_error(
+        self, mock_enforce_rpm, mock_get_llm, mock_dependencies
+    ):
+        """Test call_llm_and_parse handles OutputParserError."""
+        from crewai.agents.parser import OutputParserError
+
+        mock_enforce_rpm.return_value = None
+        mock_get_llm.side_effect = OutputParserError("parse failed")
+
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        result = executor.call_llm_and_parse()
+
+        assert result == "parser_error"
+        assert executor._last_parser_error is not None
+
+    @patch("crewai.experimental.crew_agent_executor_flow.get_llm_response")
+    @patch("crewai.experimental.crew_agent_executor_flow.enforce_rpm_limit")
+    @patch("crewai.experimental.crew_agent_executor_flow.is_context_length_exceeded")
+    def test_call_llm_context_error(
+        self,
+        mock_is_context_exceeded,
+        mock_enforce_rpm,
+        mock_get_llm,
+        mock_dependencies,
+    ):
+        """Test call_llm_and_parse handles context length error."""
+        mock_enforce_rpm.return_value = None
+        mock_get_llm.side_effect = Exception("context length")
+        mock_is_context_exceeded.return_value = True
+
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        result = executor.call_llm_and_parse()
+
+        assert result == "context_error"
+        assert executor._last_context_error is not None
+
+
+class TestFlowInvoke:
+    """Test the invoke method that maintains backward compatibility."""
+
+    @pytest.fixture
+    def mock_dependencies(self):
+        """Create mock dependencies."""
+        llm = Mock()
+        task = Mock()
+        task.description = "Test"
+        task.human_input = False
+
+        crew = Mock()
+        crew._short_term_memory = None
+        crew._long_term_memory = None
+        crew._entity_memory = None
+        crew._external_memory = None
+
+        agent = Mock()
+        agent.role = "Test"
+        agent.verbose = False
+
+        prompt = {"prompt": "Test {input} {tool_names} {tools}"}
+
+        return {
+            "llm": llm,
+            "task": task,
+            "crew": crew,
+            "agent": agent,
+            "prompt": prompt,
+            "max_iter": 10,
+            "tools": [],
+            "tools_names": "",
+            "stop_words": [],
+            "tools_description": "",
+            "tools_handler": Mock(),
+        }
+
+    @patch.object(CrewAgentExecutorFlow, "kickoff")
+    @patch.object(CrewAgentExecutorFlow, "_create_short_term_memory")
+    @patch.object(CrewAgentExecutorFlow, "_create_long_term_memory")
+    @patch.object(CrewAgentExecutorFlow, "_create_external_memory")
+    def test_invoke_success(
+        self,
+        mock_external_memory,
+        mock_long_term_memory,
+        mock_short_term_memory,
+        mock_kickoff,
+        mock_dependencies,
+    ):
+        """Test successful invoke without human feedback."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+
+        # Mock kickoff to set the final answer in state
+        def mock_kickoff_side_effect():
+            executor.state.current_answer = AgentFinish(
+                thought="final thinking", output="Final result", text="complete"
+            )
+
+        mock_kickoff.side_effect = mock_kickoff_side_effect
+
+        inputs = {"input": "test", "tool_names": "", "tools": ""}
+        result = executor.invoke(inputs)
+
+        assert result == {"output": "Final result"}
+        mock_kickoff.assert_called_once()
+        mock_short_term_memory.assert_called_once()
+        mock_long_term_memory.assert_called_once()
+        mock_external_memory.assert_called_once()
+
+    @patch.object(CrewAgentExecutorFlow, "kickoff")
+    def test_invoke_failure_no_agent_finish(self, mock_kickoff, mock_dependencies):
+        """Test invoke fails without AgentFinish."""
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+        executor.state.current_answer = AgentAction(
+            thought="thinking", tool="test", tool_input="test", text="action text"
+        )
+
+        inputs = {"input": "test", "tool_names": "", "tools": ""}
+
+        with pytest.raises(RuntimeError, match="without reaching a final answer"):
+            executor.invoke(inputs)
+
+    @patch.object(CrewAgentExecutorFlow, "kickoff")
+    @patch.object(CrewAgentExecutorFlow, "_create_short_term_memory")
+    @patch.object(CrewAgentExecutorFlow, "_create_long_term_memory")
+    @patch.object(CrewAgentExecutorFlow, "_create_external_memory")
+    def test_invoke_with_system_prompt(
+        self,
+        mock_external_memory,
+        mock_long_term_memory,
+        mock_short_term_memory,
+        mock_kickoff,
+        mock_dependencies,
+    ):
+        """Test invoke with system prompt configuration."""
+        mock_dependencies["prompt"] = {
+            "system": "System: {input}",
+            "user": "User: {input} {tool_names} {tools}",
+        }
+        executor = CrewAgentExecutorFlow(**mock_dependencies)
+
+        def mock_kickoff_side_effect():
+            executor.state.current_answer = AgentFinish(
+                thought="final thoughts", output="Done", text="complete"
+            )
+
+        mock_kickoff.side_effect = mock_kickoff_side_effect
+
+        inputs = {"input": "test", "tool_names": "", "tools": ""}
+        result = executor.invoke(inputs)
+        mock_short_term_memory.assert_called_once()
+        mock_long_term_memory.assert_called_once()
+        mock_external_memory.assert_called_once()
+        mock_kickoff.assert_called_once()
+
+        assert result == {"output": "Done"}
+        assert len(executor.state.messages) >= 2
--- a/lib/crewai/tests/test_flow_human_input_integration.py
+++ b/lib/crewai/tests/test_flow_human_input_integration.py
@@ -7,22 +7,19 @@ from crewai.events.event_listener import event_listener
 class TestFlowHumanInputIntegration:
    """Test integration between Flow execution and human input functionality."""

-    def test_console_formatter_pause_resume_methods(self):
-        """Test that ConsoleFormatter pause/resume methods work correctly."""
+    def test_console_formatter_pause_resume_methods_exist(self):
+        """Test that ConsoleFormatter pause/resume methods exist and are callable."""
        formatter = event_listener.formatter

-        original_paused_state = formatter._live_paused
+        # Methods should exist and be callable
+        assert hasattr(formatter, "pause_live_updates")
+        assert hasattr(formatter, "resume_live_updates")
+        assert callable(formatter.pause_live_updates)
+        assert callable(formatter.resume_live_updates)

-        try:
-            formatter._live_paused = False
-
-            formatter.pause_live_updates()
-            assert formatter._live_paused
-
-            formatter.resume_live_updates()
-            assert not formatter._live_paused
-        finally:
-            formatter._live_paused = original_paused_state
+        # Should not raise
+        formatter.pause_live_updates()
+        formatter.resume_live_updates()

    @patch("builtins.input", return_value="")
    def test_human_input_pauses_flow_updates(self, mock_input):
@@ -38,23 +35,16 @@ class TestFlowHumanInputIntegration:

        formatter = event_listener.formatter

-        original_paused_state = formatter._live_paused
+        with (
+            patch.object(formatter, "pause_live_updates") as mock_pause,
+            patch.object(formatter, "resume_live_updates") as mock_resume,
+        ):
+            result = executor._ask_human_input("Test result")

-        try:
-            formatter._live_paused = False
-
-            with (
-                patch.object(formatter, "pause_live_updates") as mock_pause,
-                patch.object(formatter, "resume_live_updates") as mock_resume,
-            ):
-                result = executor._ask_human_input("Test result")
-
-                mock_pause.assert_called_once()
-                mock_resume.assert_called_once()
-                mock_input.assert_called_once()
-                assert result == ""
-        finally:
-            formatter._live_paused = original_paused_state
+            mock_pause.assert_called_once()
+            mock_resume.assert_called_once()
+            mock_input.assert_called_once()
+            assert result == ""

    @patch("builtins.input", side_effect=["feedback", ""])
    def test_multiple_human_input_rounds(self, mock_input):
@@ -70,53 +60,46 @@ class TestFlowHumanInputIntegration:

        formatter = event_listener.formatter

-        original_paused_state = formatter._live_paused
+        pause_calls = []
+        resume_calls = []

-        try:
-            pause_calls = []
-            resume_calls = []
+        def track_pause():
+            pause_calls.append(True)

-            def track_pause():
-                pause_calls.append(True)
+        def track_resume():
+            resume_calls.append(True)

-            def track_resume():
-                resume_calls.append(True)
+        with (
+            patch.object(formatter, "pause_live_updates", side_effect=track_pause),
+            patch.object(
+                formatter, "resume_live_updates", side_effect=track_resume
+            ),
+        ):
+            result1 = executor._ask_human_input("Test result 1")
+            assert result1 == "feedback"

-            with (
-                patch.object(formatter, "pause_live_updates", side_effect=track_pause),
-                patch.object(
-                    formatter, "resume_live_updates", side_effect=track_resume
-                ),
-            ):
-                result1 = executor._ask_human_input("Test result 1")
-                assert result1 == "feedback"
+            result2 = executor._ask_human_input("Test result 2")
+            assert result2 == ""

-                result2 = executor._ask_human_input("Test result 2")
-                assert result2 == ""
-
-                assert len(pause_calls) == 2
-                assert len(resume_calls) == 2
-        finally:
-            formatter._live_paused = original_paused_state
+            assert len(pause_calls) == 2
+            assert len(resume_calls) == 2

    def test_pause_resume_with_no_live_session(self):
        """Test pause/resume methods handle case when no Live session exists."""
        formatter = event_listener.formatter

-        original_live = formatter._live
-        original_paused_state = formatter._live_paused
+        original_streaming_live = formatter._streaming_live

        try:
-            formatter._live = None
-            formatter._live_paused = False
+            formatter._streaming_live = None

+            # Should not raise when no session exists
            formatter.pause_live_updates()
            formatter.resume_live_updates()

-            assert not formatter._live_paused
+            assert formatter._streaming_live is None
        finally:
-            formatter._live = original_live
-            formatter._live_paused = original_paused_state
+            formatter._streaming_live = original_streaming_live

    def test_pause_resume_exception_handling(self):
        """Test that resume is called even if exception occurs during human input."""
@@ -131,23 +114,18 @@ class TestFlowHumanInputIntegration:

        formatter = event_listener.formatter

-        original_paused_state = formatter._live_paused
+        with (
+            patch.object(formatter, "pause_live_updates") as mock_pause,
+            patch.object(formatter, "resume_live_updates") as mock_resume,
+            patch(
+                "builtins.input", side_effect=KeyboardInterrupt("Test exception")
+            ),
+        ):
+            with pytest.raises(KeyboardInterrupt):
+                executor._ask_human_input("Test result")

-        try:
-            with (
-                patch.object(formatter, "pause_live_updates") as mock_pause,
-                patch.object(formatter, "resume_live_updates") as mock_resume,
-                patch(
-                    "builtins.input", side_effect=KeyboardInterrupt("Test exception")
-                ),
-            ):
-                with pytest.raises(KeyboardInterrupt):
-                    executor._ask_human_input("Test result")
-
-                mock_pause.assert_called_once()
-                mock_resume.assert_called_once()
-        finally:
-            formatter._live_paused = original_paused_state
+            mock_pause.assert_called_once()
+            mock_resume.assert_called_once()

    def test_training_mode_human_input(self):
        """Test human input in training mode."""
@@ -162,28 +140,25 @@ class TestFlowHumanInputIntegration:

        formatter = event_listener.formatter

-        original_paused_state = formatter._live_paused
+        with (
+            patch.object(formatter, "pause_live_updates") as mock_pause,
+            patch.object(formatter, "resume_live_updates") as mock_resume,
+            patch.object(formatter.console, "print") as mock_console_print,
+            patch("builtins.input", return_value="training feedback"),
+        ):
+            result = executor._ask_human_input("Test result")

-        try:
-            with (
-                patch.object(formatter, "pause_live_updates") as mock_pause,
-                patch.object(formatter, "resume_live_updates") as mock_resume,
-                patch("builtins.input", return_value="training feedback"),
-            ):
-                result = executor._ask_human_input("Test result")
+            mock_pause.assert_called_once()
+            mock_resume.assert_called_once()
+            assert result == "training feedback"

-                mock_pause.assert_called_once()
-                mock_resume.assert_called_once()
-                assert result == "training feedback"
-
-                executor._printer.print.assert_called()
-                call_args = [
-                    call[1]["content"]
-                    for call in executor._printer.print.call_args_list
-                ]
-                training_prompt_found = any(
-                    "TRAINING MODE" in content for content in call_args
-                )
-                assert training_prompt_found
-        finally:
-            formatter._live_paused = original_paused_state
+            # Verify the training panel was printed via formatter's console
+            mock_console_print.assert_called()
+            # Check that a Panel with training title was printed
+            call_args = mock_console_print.call_args_list
+            training_panel_found = any(
+                hasattr(call[0][0], "title") and "Training" in str(call[0][0].title)
+                for call in call_args
+                if call[0]
+            )
+            assert training_panel_found
--- a/lib/crewai/tests/utilities/test_console_formatter_pause_resume.py
+++ b/lib/crewai/tests/utilities/test_console_formatter_pause_resume.py
@@ -1,116 +1,107 @@
 from unittest.mock import MagicMock, patch
-from rich.tree import Tree
 from rich.live import Live
 from crewai.events.utils.console_formatter import ConsoleFormatter


 class TestConsoleFormatterPauseResume:
-    """Test ConsoleFormatter pause/resume functionality."""
+    """Test ConsoleFormatter pause/resume functionality for HITL features."""

-    def test_pause_live_updates_with_active_session(self):
-        """Test pausing when Live session is active."""
+    def test_pause_stops_active_streaming_session(self):
+        """Test pausing stops an active streaming Live session."""
        formatter = ConsoleFormatter()

        mock_live = MagicMock(spec=Live)
-        formatter._live = mock_live
-        formatter._live_paused = False
+        formatter._streaming_live = mock_live

        formatter.pause_live_updates()

        mock_live.stop.assert_called_once()
-        assert formatter._live_paused
+        assert formatter._streaming_live is None

-    def test_pause_live_updates_when_already_paused(self):
-        """Test pausing when already paused does nothing."""
+    def test_pause_is_safe_when_no_session(self):
+        """Test pausing when no streaming session exists doesn't error."""
+        formatter = ConsoleFormatter()
+        formatter._streaming_live = None
+
+        # Should not raise
+        formatter.pause_live_updates()
+
+        assert formatter._streaming_live is None
+
+    def test_multiple_pauses_are_safe(self):
+        """Test calling pause multiple times is safe."""
        formatter = ConsoleFormatter()

        mock_live = MagicMock(spec=Live)
-        formatter._live = mock_live
-        formatter._live_paused = True
+        formatter._streaming_live = mock_live

        formatter.pause_live_updates()
+        mock_live.stop.assert_called_once()
+        assert formatter._streaming_live is None

-        mock_live.stop.assert_not_called()
-        assert formatter._live_paused
-
-    def test_pause_live_updates_with_no_session(self):
-        """Test pausing when no Live session exists."""
-        formatter = ConsoleFormatter()
-
-        formatter._live = None
-        formatter._live_paused = False
-
+        # Second pause should not error (no session to stop)
        formatter.pause_live_updates()

-        assert formatter._live_paused
-
-    def test_resume_live_updates_when_paused(self):
-        """Test resuming when paused."""
+    def test_resume_is_safe(self):
+        """Test resume method exists and doesn't error."""
        formatter = ConsoleFormatter()

-        formatter._live_paused = True
-
+        # Should not raise
        formatter.resume_live_updates()

-        assert not formatter._live_paused
-
-    def test_resume_live_updates_when_not_paused(self):
-        """Test resuming when not paused does nothing."""
+    def test_streaming_after_pause_resume_creates_new_session(self):
+        """Test that streaming after pause/resume creates new Live session."""
        formatter = ConsoleFormatter()
+        formatter.verbose = True

-        formatter._live_paused = False
+        # Simulate having an active session
+        mock_live = MagicMock(spec=Live)
+        formatter._streaming_live = mock_live

+        # Pause stops the session
+        formatter.pause_live_updates()
+        assert formatter._streaming_live is None
+
+        # Resume (no-op, sessions created on demand)
        formatter.resume_live_updates()

-        assert not formatter._live_paused
+        # After resume, streaming should be able to start a new session
+        with patch("crewai.events.utils.console_formatter.Live") as mock_live_class:
+            mock_live_instance = MagicMock()
+            mock_live_class.return_value = mock_live_instance

-    def test_print_after_resume_restarts_live_session(self):
-        """Test that printing a Tree after resume creates new Live session."""
+            # Simulate streaming chunk (this creates a new Live session)
+            formatter.handle_llm_stream_chunk("test chunk", call_type=None)
+
+            mock_live_class.assert_called_once()
+            mock_live_instance.start.assert_called_once()
+            assert formatter._streaming_live == mock_live_instance
+
+    def test_pause_resume_cycle_with_streaming(self):
+        """Test full pause/resume cycle during streaming."""
        formatter = ConsoleFormatter()
-
-        formatter._live_paused = True
-        formatter._live = None
-
-        formatter.resume_live_updates()
-        assert not formatter._live_paused
-
-        tree = Tree("Test")
+        formatter.verbose = True

        with patch("crewai.events.utils.console_formatter.Live") as mock_live_class:
            mock_live_instance = MagicMock()
            mock_live_class.return_value = mock_live_instance

-            formatter.print(tree)
+            # Start streaming
+            formatter.handle_llm_stream_chunk("chunk 1", call_type=None)
+            assert formatter._streaming_live == mock_live_instance

-            mock_live_class.assert_called_once()
-            mock_live_instance.start.assert_called_once()
-            assert formatter._live == mock_live_instance
+            # Pause should stop the session
+            formatter.pause_live_updates()
+            mock_live_instance.stop.assert_called_once()
+            assert formatter._streaming_live is None

-    def test_multiple_pause_resume_cycles(self):
-        """Test multiple pause/resume cycles work correctly."""
-        formatter = ConsoleFormatter()
+            # Resume (no-op)
+            formatter.resume_live_updates()

-        mock_live = MagicMock(spec=Live)
-        formatter._live = mock_live
-        formatter._live_paused = False
+            # Create a new mock for the next session
+            mock_live_instance_2 = MagicMock()
+            mock_live_class.return_value = mock_live_instance_2

-        formatter.pause_live_updates()
-        assert formatter._live_paused
-        mock_live.stop.assert_called_once()
-        assert formatter._live is None  # Live session should be cleared
-
-        formatter.resume_live_updates()
-        assert not formatter._live_paused
-
-        formatter.pause_live_updates()
-        assert formatter._live_paused
-
-        formatter.resume_live_updates()
-        assert not formatter._live_paused
-
-    def test_pause_resume_state_initialization(self):
-        """Test that _live_paused is properly initialized."""
-        formatter = ConsoleFormatter()
-
-        assert hasattr(formatter, "_live_paused")
-        assert not formatter._live_paused
+            # Streaming again creates new session
+            formatter.handle_llm_stream_chunk("chunk 2", call_type=None)
+            assert formatter._streaming_live == mock_live_instance_2
--- a/uv.lock
+++ b/uv.lock