fix: replace @lru_cache with instance-level caching in Repository.is_git_repo()

This fixes a memory leak where using @lru_cache on the is_git_repo() instance method prevented garbage collection of Repository instances. The cache dictionary held references to self, keeping instances alive indefinitely. The fix replaces the @lru_cache decorator with instance-level caching using a _is_git_repo_cache attribute. This maintains O(1) performance for repeated calls while allowing proper garbage collection when instances go out of scope. Fixes #4210 Co-Authored-By: João <joao@crewai.com>
2026-01-23 15:18:14 +00:00 · 2026-01-10 16:02:14 +00:00
13 changed files with 105 additions and 309 deletions
--- a/docs/en/concepts/flows.mdx
+++ b/docs/en/concepts/flows.mdx
@@ -574,10 +574,6 @@ When you run this Flow, the output will change based on the random boolean value

 ### Human in the Loop (human feedback)

-<Note>
-The `@human_feedback` decorator requires **CrewAI version 1.8.0 or higher**.
-</Note>
-
 The `@human_feedback` decorator enables human-in-the-loop workflows by pausing flow execution to collect feedback from a human. This is useful for approval gates, quality review, and decision points that require human judgment.

 ```python Code
--- a/docs/en/learn/human-feedback-in-flows.mdx
+++ b/docs/en/learn/human-feedback-in-flows.mdx
@@ -7,10 +7,6 @@ mode: "wide"

 ## Overview

-<Note>
-The `@human_feedback` decorator requires **CrewAI version 1.8.0 or higher**. Make sure to update your installation before using this feature.
-</Note>
-
 The `@human_feedback` decorator enables human-in-the-loop (HITL) workflows directly within CrewAI Flows. It allows you to pause flow execution, present output to a human for review, collect their feedback, and optionally route to different listeners based on the feedback outcome.

 This is particularly valuable for:
--- a/docs/en/learn/human-in-the-loop.mdx
+++ b/docs/en/learn/human-in-the-loop.mdx
@@ -11,10 +11,10 @@ Human-in-the-Loop (HITL) is a powerful approach that combines artificial intelli

 CrewAI offers two main approaches for implementing human-in-the-loop workflows:

-| Approach | Best For | Integration | Version |
-|----------|----------|-------------|---------|
-| **Flow-based** (`@human_feedback` decorator) | Local development, console-based review, synchronous workflows | [Human Feedback in Flows](/en/learn/human-feedback-in-flows) | **1.8.0+** |
-| **Webhook-based** (Enterprise) | Production deployments, async workflows, external integrations (Slack, Teams, etc.) | This guide | - |
+| Approach | Best For | Integration |
+|----------|----------|-------------|
+| **Flow-based** (`@human_feedback` decorator) | Local development, console-based review, synchronous workflows | [Human Feedback in Flows](/en/learn/human-feedback-in-flows) |
+| **Webhook-based** (Enterprise) | Production deployments, async workflows, external integrations (Slack, Teams, etc.) | This guide |

 <Tip>
 If you're building flows and want to add human review steps with routing based on feedback, check out the [Human Feedback in Flows](/en/learn/human-feedback-in-flows) guide for the `@human_feedback` decorator.
--- a/docs/ko/concepts/flows.mdx
+++ b/docs/ko/concepts/flows.mdx
@@ -567,10 +567,6 @@ Fourth method running

 ### Human in the Loop (인간 피드백)

-<Note>
-`@human_feedback` 데코레이터는 **CrewAI 버전 1.8.0 이상**이 필요합니다.
-</Note>
-
 `@human_feedback` 데코레이터는 인간의 피드백을 수집하기 위해 플로우 실행을 일시 중지하는 human-in-the-loop 워크플로우를 가능하게 합니다. 이는 승인 게이트, 품질 검토, 인간의 판단이 필요한 결정 지점에 유용합니다.

 ```python Code
--- a/docs/ko/learn/human-feedback-in-flows.mdx
+++ b/docs/ko/learn/human-feedback-in-flows.mdx
@@ -7,10 +7,6 @@ mode: "wide"

 ## 개요

-<Note>
-`@human_feedback` 데코레이터는 **CrewAI 버전 1.8.0 이상**이 필요합니다. 이 기능을 사용하기 전에 설치를 업데이트하세요.
-</Note>
-
 `@human_feedback` 데코레이터는 CrewAI Flow 내에서 직접 human-in-the-loop(HITL) 워크플로우를 가능하게 합니다. Flow 실행을 일시 중지하고, 인간에게 검토를 위해 출력을 제시하고, 피드백을 수집하고, 선택적으로 피드백 결과에 따라 다른 리스너로 라우팅할 수 있습니다.

 이는 특히 다음과 같은 경우에 유용합니다:
--- a/docs/ko/learn/human-in-the-loop.mdx
+++ b/docs/ko/learn/human-in-the-loop.mdx
@@ -5,22 +5,9 @@ icon: "user-check"
 mode: "wide"
 ---

-휴먼 인 더 루프(HITL, Human-in-the-Loop)는 인공지능과 인간의 전문 지식을 결합하여 의사결정을 강화하고 작업 결과를 향상시키는 강력한 접근 방식입니다. CrewAI는 필요에 따라 HITL을 구현하는 여러 가지 방법을 제공합니다.
+휴먼 인 더 루프(HITL, Human-in-the-Loop)는 인공지능과 인간의 전문 지식을 결합하여 의사결정을 강화하고 작업 결과를 향상시키는 강력한 접근 방식입니다. 이 가이드에서는 CrewAI 내에서 HITL을 구현하는 방법을 안내합니다.

-## HITL 접근 방식 선택
-
-CrewAI는 human-in-the-loop 워크플로우를 구현하기 위한 두 가지 주요 접근 방식을 제공합니다:
-
-| 접근 방식 | 적합한 용도 | 통합 | 버전 |
-|----------|----------|-------------|---------|
-| **Flow 기반** (`@human_feedback` 데코레이터) | 로컬 개발, 콘솔 기반 검토, 동기식 워크플로우 | [Flow에서 인간 피드백](/ko/learn/human-feedback-in-flows) | **1.8.0+** |
-| **Webhook 기반** (Enterprise) | 프로덕션 배포, 비동기 워크플로우, 외부 통합 (Slack, Teams 등) | 이 가이드 | - |
-
-<Tip>
-Flow를 구축하면서 피드백을 기반으로 라우팅하는 인간 검토 단계를 추가하려면 `@human_feedback` 데코레이터에 대한 [Flow에서 인간 피드백](/ko/learn/human-feedback-in-flows) 가이드를 참조하세요.
-</Tip>
-
-## Webhook 기반 HITL 워크플로우 설정
+## HITL 워크플로우 설정

 <Steps>
    <Step title="작업 구성">
--- a/docs/pt-BR/concepts/flows.mdx
+++ b/docs/pt-BR/concepts/flows.mdx
@@ -309,10 +309,6 @@ Ao executar esse Flow, a saída será diferente dependendo do valor booleano ale

 ### Human in the Loop (feedback humano)

-<Note>
-O decorador `@human_feedback` requer **CrewAI versão 1.8.0 ou superior**.
-</Note>
-
 O decorador `@human_feedback` permite fluxos de trabalho human-in-the-loop, pausando a execução do flow para coletar feedback de um humano. Isso é útil para portões de aprovação, revisão de qualidade e pontos de decisão que requerem julgamento humano.

 ```python Code
--- a/docs/pt-BR/learn/human-feedback-in-flows.mdx
+++ b/docs/pt-BR/learn/human-feedback-in-flows.mdx
@@ -7,10 +7,6 @@ mode: "wide"

 ## Visão Geral

-<Note>
-O decorador `@human_feedback` requer **CrewAI versão 1.8.0 ou superior**. Certifique-se de atualizar sua instalação antes de usar este recurso.
-</Note>
-
 O decorador `@human_feedback` permite fluxos de trabalho human-in-the-loop (HITL) diretamente nos CrewAI Flows. Ele permite pausar a execução do flow, apresentar a saída para um humano revisar, coletar seu feedback e, opcionalmente, rotear para diferentes listeners com base no resultado do feedback.

 Isso é particularmente valioso para:
--- a/docs/pt-BR/learn/human-in-the-loop.mdx
+++ b/docs/pt-BR/learn/human-in-the-loop.mdx
@@ -5,22 +5,9 @@ icon: "user-check"
 mode: "wide"
 ---

-Human-in-the-Loop (HITL) é uma abordagem poderosa que combina a inteligência artificial com a experiência humana para aprimorar a tomada de decisões e melhorar os resultados das tarefas. CrewAI oferece várias maneiras de implementar HITL dependendo das suas necessidades.
+Human-in-the-Loop (HITL) é uma abordagem poderosa que combina a inteligência artificial com a experiência humana para aprimorar a tomada de decisões e melhorar os resultados das tarefas. Este guia mostra como implementar HITL dentro da CrewAI.

-## Escolhendo Sua Abordagem HITL
-
-CrewAI oferece duas abordagens principais para implementar workflows human-in-the-loop:
-
-| Abordagem | Melhor Para | Integração | Versão |
-|----------|----------|-------------|---------|
-| **Baseada em Flow** (decorador `@human_feedback`) | Desenvolvimento local, revisão via console, workflows síncronos | [Feedback Humano em Flows](/pt-BR/learn/human-feedback-in-flows) | **1.8.0+** |
-| **Baseada em Webhook** (Enterprise) | Deployments em produção, workflows assíncronos, integrações externas (Slack, Teams, etc.) | Este guia | - |
-
-<Tip>
-Se você está construindo flows e deseja adicionar etapas de revisão humana com roteamento baseado em feedback, confira o guia [Feedback Humano em Flows](/pt-BR/learn/human-feedback-in-flows) para o decorador `@human_feedback`.
-</Tip>
-
-## Configurando Workflows HITL Baseados em Webhook
+## Configurando Workflows HITL

 <Steps>
    <Step title="Configure sua Tarefa">
--- a/lib/crewai/src/crewai/cli/git.py
+++ b/lib/crewai/src/crewai/cli/git.py
@@ -1,10 +1,10 @@
-from functools import lru_cache
 import subprocess


 class Repository:
    def __init__(self, path: str = ".") -> None:
        self.path = path
+        self._is_git_repo_cache: bool | None = None

        if not self.is_git_installed():
            raise ValueError("Git is not installed or not found in your PATH.")
@@ -40,22 +40,26 @@ class Repository:
            encoding="utf-8",
        ).strip()

-    @lru_cache(maxsize=None)  # noqa: B019
    def is_git_repo(self) -> bool:
        """Check if the current directory is a git repository.

-        Notes:
-          - TODO: This method is cached to avoid redundant checks, but using lru_cache on methods can lead to memory leaks
+        The result is cached at the instance level to avoid redundant checks
+        while allowing proper garbage collection of Repository instances.
        """
+        if self._is_git_repo_cache is not None:
+            return self._is_git_repo_cache
+
        try:
            subprocess.check_output(
                ["git", "rev-parse", "--is-inside-work-tree"],  # noqa: S607
                cwd=self.path,
                encoding="utf-8",
            )
-            return True
+            self._is_git_repo_cache = True
        except subprocess.CalledProcessError:
-            return False
+            self._is_git_repo_cache = False
+
+        return self._is_git_repo_cache

    def has_uncommitted_changes(self) -> bool:
        """Check if the repository has uncommitted changes."""
--- a/lib/crewai/src/crewai/events/event_listener.py
+++ b/lib/crewai/src/crewai/events/event_listener.py
@@ -209,9 +209,10 @@ class EventListener(BaseEventListener):
        @crewai_event_bus.on(TaskCompletedEvent)
        def on_task_completed(source: Any, event: TaskCompletedEvent) -> None:
            # Handle telemetry
-            span = self.execution_spans.pop(source, None)
+            span = self.execution_spans.get(source)
            if span:
                self._telemetry.task_ended(span, source, source.agent.crew)
+            self.execution_spans[source] = None

            # Pass task name if it exists
            task_name = get_task_name(source)
@@ -221,10 +222,11 @@ class EventListener(BaseEventListener):

        @crewai_event_bus.on(TaskFailedEvent)
        def on_task_failed(source: Any, event: TaskFailedEvent) -> None:
-            span = self.execution_spans.pop(source, None)
+            span = self.execution_spans.get(source)
            if span:
                if source.agent and source.agent.crew:
                    self._telemetry.task_ended(span, source, source.agent.crew)
+                self.execution_spans[source] = None

            # Pass task name if it exists
            task_name = get_task_name(source)
--- a/lib/crewai/tests/cli/test_git.py
+++ b/lib/crewai/tests/cli/test_git.py
@@ -1,4 +1,8 @@
+import gc
+import weakref
+
 import pytest
+
 from crewai.cli.git import Repository


@@ -99,3 +103,82 @@ def test_origin_url(fp, repository):
        stdout="https://github.com/user/repo.git\n",
    )
    assert repository.origin_url() == "https://github.com/user/repo.git"
+
+
+def test_repository_garbage_collection(fp):
+    """Test that Repository instances can be garbage collected.
+
+    This test verifies the fix for the memory leak issue where using
+    @lru_cache on the is_git_repo() method prevented garbage collection
+    of Repository instances.
+    """
+    fp.register(["git", "--version"], stdout="git version 2.30.0\n")
+    fp.register(["git", "rev-parse", "--is-inside-work-tree"], stdout="true\n")
+    fp.register(["git", "fetch"], stdout="")
+
+    repo = Repository(path=".")
+    weak_ref = weakref.ref(repo)
+
+    assert weak_ref() is not None
+
+    del repo
+    gc.collect()
+
+    assert weak_ref() is None, (
+        "Repository instance was not garbage collected. "
+        "This indicates a memory leak, likely from @lru_cache on instance methods."
+    )
+
+
+def test_is_git_repo_caching(fp):
+    """Test that is_git_repo() result is cached at the instance level.
+
+    This verifies that the instance-level caching works correctly,
+    only calling the subprocess once per instance.
+    """
+    fp.register(["git", "--version"], stdout="git version 2.30.0\n")
+    fp.register(["git", "rev-parse", "--is-inside-work-tree"], stdout="true\n")
+    fp.register(["git", "fetch"], stdout="")
+
+    repo = Repository(path=".")
+
+    assert repo._is_git_repo_cache is True
+
+    result1 = repo.is_git_repo()
+    result2 = repo.is_git_repo()
+
+    assert result1 is True
+    assert result2 is True
+    assert repo._is_git_repo_cache is True
+
+
+def test_multiple_repository_instances_independent_caches(fp):
+    """Test that multiple Repository instances have independent caches.
+
+    This verifies that the instance-level caching doesn't share state
+    between different Repository instances.
+    """
+    fp.register(["git", "--version"], stdout="git version 2.30.0\n")
+    fp.register(["git", "rev-parse", "--is-inside-work-tree"], stdout="true\n")
+    fp.register(["git", "fetch"], stdout="")
+
+    fp.register(["git", "--version"], stdout="git version 2.30.0\n")
+    fp.register(["git", "rev-parse", "--is-inside-work-tree"], stdout="true\n")
+    fp.register(["git", "fetch"], stdout="")
+
+    repo1 = Repository(path=".")
+    repo2 = Repository(path=".")
+
+    assert repo1._is_git_repo_cache is True
+    assert repo2._is_git_repo_cache is True
+
+    assert repo1._is_git_repo_cache is not repo2._is_git_repo_cache or (
+        repo1._is_git_repo_cache == repo2._is_git_repo_cache
+    )
+
+    weak_ref1 = weakref.ref(repo1)
+    del repo1
+    gc.collect()
+
+    assert weak_ref1() is None
+    assert repo2._is_git_repo_cache is True
--- a/lib/crewai/tests/events/test_event_listener_execution_spans.py
+++ b/lib/crewai/tests/events/test_event_listener_execution_spans.py
@@ -1,243 +0,0 @@
-"""Tests for EventListener execution_spans cleanup to prevent memory leaks."""
-
-import asyncio
-from unittest.mock import MagicMock, patch
-
-import pytest
-
-from crewai.events.event_bus import crewai_event_bus
-from crewai.events.event_listener import EventListener
-from crewai.events.types.task_events import (
-    TaskCompletedEvent,
-    TaskFailedEvent,
-    TaskStartedEvent,
-)
-from crewai.tasks.task_output import TaskOutput
-
-
-class MockAgent:
-    """Mock agent for testing."""
-
-    def __init__(self, role: str = "test_role"):
-        self.role = role
-        self.crew = MagicMock()
-
-
-class MockTask:
-    """Mock task for testing."""
-
-    def __init__(self, task_id: str = "test_task"):
-        self.id = task_id
-        self.name = "Test Task"
-        self.description = "A test task description"
-        self.agent = MockAgent()
-
-
-@pytest.fixture
-def event_listener():
-    """Create a fresh EventListener instance for testing."""
-    EventListener._instance = None
-    EventListener._initialized = False
-    listener = EventListener()
-    listener.setup_listeners(crewai_event_bus)
-    return listener
-
-
-@pytest.fixture
-def mock_task():
-    """Create a mock task for testing."""
-    return MockTask()
-
-
-@pytest.fixture
-def mock_task_output():
-    """Create a mock task output for testing."""
-    return TaskOutput(
-        description="Test task description",
-        raw="Test output",
-        agent="test_agent",
-    )
-
-
-@pytest.mark.asyncio
-async def test_execution_spans_removed_on_task_completed(
-    event_listener, mock_task, mock_task_output
-):
-    """Test that execution_spans entries are properly removed when a task completes.
-
-    This test verifies the fix for the memory leak where completed tasks were
-    setting execution_spans[source] = None instead of removing the key entirely.
-    """
-    with patch.object(event_listener._telemetry, "task_started") as mock_task_started:
-        with patch.object(event_listener._telemetry, "task_ended"):
-            mock_span = MagicMock()
-            mock_task_started.return_value = mock_span
-
-            start_event = TaskStartedEvent(context="test context", task=mock_task)
-            future = crewai_event_bus.emit(mock_task, start_event)
-            if future:
-                await asyncio.wrap_future(future)
-
-            assert mock_task in event_listener.execution_spans
-            assert event_listener.execution_spans[mock_task] == mock_span
-
-            completed_event = TaskCompletedEvent(output=mock_task_output, task=mock_task)
-            future = crewai_event_bus.emit(mock_task, completed_event)
-            if future:
-                await asyncio.wrap_future(future)
-
-            assert mock_task not in event_listener.execution_spans
-
-
-@pytest.mark.asyncio
-async def test_execution_spans_removed_on_task_failed(event_listener, mock_task):
-    """Test that execution_spans entries are properly removed when a task fails.
-
-    This test verifies the fix for the memory leak where failed tasks were
-    setting execution_spans[source] = None instead of removing the key entirely.
-    """
-    with patch.object(event_listener._telemetry, "task_started") as mock_task_started:
-        with patch.object(event_listener._telemetry, "task_ended"):
-            mock_span = MagicMock()
-            mock_task_started.return_value = mock_span
-
-            start_event = TaskStartedEvent(context="test context", task=mock_task)
-            future = crewai_event_bus.emit(mock_task, start_event)
-            if future:
-                await asyncio.wrap_future(future)
-
-            assert mock_task in event_listener.execution_spans
-            assert event_listener.execution_spans[mock_task] == mock_span
-
-            failed_event = TaskFailedEvent(error="Test error", task=mock_task)
-            future = crewai_event_bus.emit(mock_task, failed_event)
-            if future:
-                await asyncio.wrap_future(future)
-
-            assert mock_task not in event_listener.execution_spans
-
-
-@pytest.mark.asyncio
-async def test_execution_spans_dict_size_does_not_grow_unbounded(
-    event_listener, mock_task_output
-):
-    """Test that execution_spans dictionary size remains bounded after many tasks.
-
-    This test simulates the memory leak scenario where many tasks complete/fail
-    and verifies that the dictionary doesn't grow unboundedly.
-    """
-    num_tasks = 100
-
-    with patch.object(event_listener._telemetry, "task_started") as mock_task_started:
-        with patch.object(event_listener._telemetry, "task_ended"):
-            mock_task_started.return_value = MagicMock()
-
-            for i in range(num_tasks):
-                task = MockTask(task_id=f"task_{i}")
-
-                start_event = TaskStartedEvent(context="test context", task=task)
-                future = crewai_event_bus.emit(task, start_event)
-                if future:
-                    await asyncio.wrap_future(future)
-
-                if i % 2 == 0:
-                    completed_event = TaskCompletedEvent(
-                        output=mock_task_output, task=task
-                    )
-                    future = crewai_event_bus.emit(task, completed_event)
-                else:
-                    failed_event = TaskFailedEvent(error="Test error", task=task)
-                    future = crewai_event_bus.emit(task, failed_event)
-
-                if future:
-                    await asyncio.wrap_future(future)
-
-            assert len(event_listener.execution_spans) == 0
-
-
-@pytest.mark.asyncio
-async def test_execution_spans_handles_missing_task_gracefully(
-    event_listener, mock_task, mock_task_output
-):
-    """Test that completing/failing a task not in execution_spans doesn't cause errors.
-
-    This ensures the fix using pop(source, None) handles edge cases gracefully.
-    """
-    with patch.object(event_listener._telemetry, "task_ended"):
-        assert mock_task not in event_listener.execution_spans
-
-        completed_event = TaskCompletedEvent(output=mock_task_output, task=mock_task)
-        future = crewai_event_bus.emit(mock_task, completed_event)
-        if future:
-            await asyncio.wrap_future(future)
-
-        assert mock_task not in event_listener.execution_spans
-
-
-@pytest.mark.asyncio
-async def test_execution_spans_handles_missing_task_on_failure_gracefully(
-    event_listener, mock_task
-):
-    """Test that failing a task not in execution_spans doesn't cause errors.
-
-    This ensures the fix using pop(source, None) handles edge cases gracefully.
-    """
-    with patch.object(event_listener._telemetry, "task_ended"):
-        assert mock_task not in event_listener.execution_spans
-
-        failed_event = TaskFailedEvent(error="Test error", task=mock_task)
-        future = crewai_event_bus.emit(mock_task, failed_event)
-        if future:
-            await asyncio.wrap_future(future)
-
-        assert mock_task not in event_listener.execution_spans
-
-
-@pytest.mark.asyncio
-async def test_telemetry_task_ended_called_with_span_on_completion(
-    event_listener, mock_task, mock_task_output
-):
-    """Test that telemetry.task_ended is called with the correct span on completion."""
-    with patch.object(event_listener._telemetry, "task_started") as mock_task_started:
-        with patch.object(event_listener._telemetry, "task_ended") as mock_task_ended:
-            mock_span = MagicMock()
-            mock_task_started.return_value = mock_span
-
-            start_event = TaskStartedEvent(context="test context", task=mock_task)
-            future = crewai_event_bus.emit(mock_task, start_event)
-            if future:
-                await asyncio.wrap_future(future)
-
-            completed_event = TaskCompletedEvent(output=mock_task_output, task=mock_task)
-            future = crewai_event_bus.emit(mock_task, completed_event)
-            if future:
-                await asyncio.wrap_future(future)
-
-            mock_task_ended.assert_called_once_with(
-                mock_span, mock_task, mock_task.agent.crew
-            )
-
-
-@pytest.mark.asyncio
-async def test_telemetry_task_ended_called_with_span_on_failure(
-    event_listener, mock_task
-):
-    """Test that telemetry.task_ended is called with the correct span on failure."""
-    with patch.object(event_listener._telemetry, "task_started") as mock_task_started:
-        with patch.object(event_listener._telemetry, "task_ended") as mock_task_ended:
-            mock_span = MagicMock()
-            mock_task_started.return_value = mock_span
-
-            start_event = TaskStartedEvent(context="test context", task=mock_task)
-            future = crewai_event_bus.emit(mock_task, start_event)
-            if future:
-                await asyncio.wrap_future(future)
-
-            failed_event = TaskFailedEvent(error="Test error", task=mock_task)
-            future = crewai_event_bus.emit(mock_task, failed_event)
-            if future:
-                await asyncio.wrap_future(future)
-
-            mock_task_ended.assert_called_once_with(
-                mock_span, mock_task, mock_task.agent.crew
-            )