crewAI

mirror of https://github.com/crewAIInc/crewAI.git synced 2026-07-02 21:58:11 +00:00

Author	SHA1	Message	Date
Joao Moura	3eb1da2d9c	fix: resolve CI test flakiness from event bus state pollution Root cause: test_gap_implementations.py assigned directly to crewai_event_bus.emit (instance attribute), which shadowed the class method even after restoration. Later tests using patch.object on the class couldn't intercept calls. Also converts all 19 positional crewai_event_bus.emit() calls across 8 new_agent files to use the event= keyword argument, matching the pattern in llm.py. Adds <summary> tag stripping for both ainvoke() and astream() to prevent summarization prompt leakage in agent responses. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-05-14 16:10:00 -04:00
Joao Moura	16488f5fe5	fix: address PR #5788 review comments - Remove dead `env_vars.get("MODEL")` check in _setup_env (always truthy since MODEL is set two lines above) - Fix test_sync_delegation mock: use return_value instead of side_effect list and disable planning to prevent StopAsyncIteration on Python 3.10 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-05-14 13:08:49 -04:00
alex-clawd	84568860c3	fix: set VCR record_mode=none for test_hierarchical_verbose_manager_agent	2026-05-13 14:02:56 -07:00
alex-clawd	744a07cc0f	fix: pin vcr record_mode=none + bump gitpython/langchain-core/urllib3 vulns - test_streaming_properties_from_docs: add record_mode="none" so VCR never falls through to the real OpenAI API; cassette already exists. - gitpython >=3.1.50 (GHSA-mv93-w799-cj2w) - langchain-core >=1.3.1 (GHSA-pjwx-r37v-7724; resolves to 1.3.3) - urllib3 >=2.7.0 (GHSA-qccp-gfcp-xxvc, GHSA-mf9v-mfxr-j63j; 2.6.4 was never released) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-13 13:37:38 -07:00
alex-clawd	74bf197ccb	fix: resolve lint, test, and review issues - Replace S101 assert guards with explicit if/raise RuntimeError in benchmark.py and cli.py (3 locations) - Fix test_create_llm_from_env_with_unaccepted_attributes to use DEFAULT_LLM_MODEL with clear=True so the assertion isn't brittle against the hardcoded model name - Add n_iterations loop to _test_new_agents (was unused, now mirrors _train_new_agents iteration pattern) - Consolidate dotenv loading in cli.py and agent_tui.py to use the existing load_env_vars() from utils.py instead of duplicating logic Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-13 12:28:25 -07:00
alex-clawd	e2d66c524b	fix: disable VCR and memory for standalone agent test to prevent real API calls	2026-05-13 12:28:25 -07:00
alex-clawd	18e599b0f2	fix: resolve CI failures — mock test LLM and fix mypy type errors - test_lite_agent_standalone_still_works: replace real LLM with Mock to avoid ConnectionError hitting OpenAI in CI - coworker_tools.py:352: add type: ignore[import-not-found] for crewai.a2a.client - coworker_tools.py:415: filter BaseException instances from gather results so return type matches list[str] - executor.py:740: add type: ignore[import-not-found] for checkpoint_events - executor.py:2245: guard r.content access with isinstance(r, Message) check - flow.py:3259: cast model_dump() result to dict[str, Any] - flow.py: fix response/future no-redef errors by hoisting declarations and renaming coro_future to avoid duplicate type annotations Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-13 12:28:25 -07:00
alex-clawd	94b5e2ea7b	fix: address CI failures — ruff, mypy, mock OpenAI tests, JSONC support Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-13 12:28:25 -07:00
alex-clawd	2ddc348ad2	fix: resolve lint, type-check, and test failures - B904: raise KeyboardInterrupt from err in cli_provider.py - mypy: add TYPE_CHECKING import for SQLiteConversationStorage, annotate _initialized class var in TaskScheduler, fix Match type params and Returning Any in create_agent.py - tests: mock aget_llm_response in 3 integration tests that fail when network is blocked but OPENAI_API_KEY is set - flow.py: use asyncio.run_coroutine_threadsafe() instead of asyncio.run() when a loop is already running in ask() and say() - cli.py: fix threshold=0.0 treated as falsy by using `is not None` check Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-13 12:28:25 -07:00
Joao Moura	fc85637e60	feat: enhance benchmark case loading and CLI threshold handling - Introduced a new `LoadedCases` class to encapsulate benchmark cases and optional thresholds, improving data management. - Updated `load_benchmark_cases` function to support loading cases from both bare arrays and object wrappers with a threshold. - Modified CLI options to allow dynamic threshold configuration, defaulting to a value from `config.json` if not specified. - Enhanced error handling for invalid benchmark case formats and added tests to validate new functionality. These changes aim to improve the flexibility and usability of benchmark case management within the CrewAI framework.	2026-05-13 12:28:25 -07:00
Joao Moura	6cb29dce65	feat: enhance agent TUI and CLI with streaming responses and model selection improvements - Added a `_safe_render` function to escape Rich markup and convert markdown to Rich format. - Implemented token-by-token streaming for agent responses in the TUI, improving user experience during interactions. - Updated the CLI to allow selection of LLM providers and models, enhancing flexibility in agent creation. - Refactored benchmark case paths to use a `tests` directory instead of `benchmarks`. - Introduced a `last_stream_result` property in the `NewAgent` class to retrieve the latest streaming response. These changes aim to provide a more interactive and user-friendly experience in managing agents within the CrewAI framework.	2026-05-13 12:28:25 -07:00
Joao Moura	fe7f730546	feat: add interactive agent creation and TUI for multi-agent interaction - Introduced a new `create_agent` command for interactive agent definition. - Added `agent_tui.py` for a conversational TUI supporting multi-agent interactions. - Updated CLI to support agent creation and training workflows. - Enhanced `.gitignore` to exclude demo files and configuration artifacts. - Implemented a benchmark runner for testing agent performance against defined cases. This commit lays the groundwork for a more interactive and user-friendly experience in managing agents within the CrewAI framework.	2026-05-13 12:28:25 -07:00
Lorenze Jay	264da8245a	Lorenze/imp/prompt layering (#5774 ) Some checks failed CodeQL Advanced / Analyze (actions) (push) Has been cancelled Details CodeQL Advanced / Analyze (python) (push) Has been cancelled Details Check Documentation Broken Links / Check broken links (push) Has been cancelled Details Vulnerability Scan / pip-audit (push) Has been cancelled Details Nightly Canary Release / Check for new commits (push) Has been cancelled Details Nightly Canary Release / Build nightly packages (push) Has been cancelled Details Nightly Canary Release / Publish nightly to PyPI (push) Has been cancelled Details Mark stale issues and pull requests / stale (push) Has been cancelled Details * improving prompt structure especially for prompt caching * addressing comments	2026-05-12 12:39:12 -07:00
iris-clawd	3322634625	feat: deprecate CrewAgentExecutor, default Crew agents to AgentExecutor (#5745 ) * feat: deprecate CrewAgentExecutor, default Crew agents to AgentExecutor * regen cassettes * fix tests * addressing pr comments --------- Co-authored-by: Lorenze Jay <63378463+lorenzejay@users.noreply.github.com> Co-authored-by: lorenzejay <lorenzejaytech@gmail.com> Co-authored-by: Greyson LaLonde <greyson.r.lalonde@gmail.com>	2026-05-12 11:22:13 -07:00
Greyson LaLonde	5d757cb626	fix(flow): log HITL pre-review and distillation failures, add learn_strict	2026-05-12 00:26:31 +08:00
Greyson LaLonde	93e786d263	refactor: extract CLI into standalone crewai-cli package	2026-05-06 20:46:46 +08:00
Greyson LaLonde	6494d68ffc	fix(gemini): include thoughts_token_count in completion tokens	2026-05-04 21:03:38 +08:00
Greyson LaLonde	f579aa53ae	fix: preserve task outputs across async batch flush	2026-05-04 20:24:24 +08:00
Greyson LaLonde	095f796922	fix: prevent result_as_answer from returning hook-block message as final answer	2026-05-04 19:42:07 +08:00
Zamuldinov Nikita	bfbdba426f	fix: prevent result_as_answer from returning error as final answer When a tool with result_as_answer=True raises an exception, the agent was receiving result_as_answer=True and returning the error string as the final answer. Now we set result_as_answer=False when an error event is emitted, allowing the agent to reflect and retry. Fixes crewAIInc/crewAI#5156 --------- Co-authored-by: NIK-TIGER-BILL <nik.tiger.bill@github.com> Co-authored-by: Greyson LaLonde <greyson.r.lalonde@gmail.com>	2026-05-04 19:28:21 +08:00
Greyson LaLonde	a058a3b15b	fix(task): use acall for output conversion in async paths	2026-05-04 18:42:12 +08:00
Greyson LaLonde	184c228ae9	fix: prevent shared LLM stop words mutation across agents Some checks failed CodeQL Advanced / Analyze (actions) (push) Has been cancelled Details CodeQL Advanced / Analyze (python) (push) Has been cancelled Details Vulnerability Scan / pip-audit (push) Has been cancelled Details Mark stale issues and pull requests / stale (push) Has been cancelled Details	2026-05-04 14:23:17 +08:00
Greyson LaLonde	17e82743f6	fix: handle BaseModel input in convert_to_model	2026-05-03 14:17:03 +08:00
Tiago Freire	cd2b9ee38a	feat(flow): add `restore_from_state_id` kickoff parameter (#5674 ) ## Summary - Reverts `b0e2fda` ("fix(flow): add execution_id separate from state.id", COR-48): removes `Flow.execution_id` and points `current_flow_id` / `current_flow_request_id` back at `flow_id` (i.e. `state.id`). The separate per-run tracking id was no longer the right abstraction once `restore_from_state_id` reshapes how `state.id` is assigned; - Adds an optional `restore_from_state_id` kwarg to `Flow.kickoff` / `Flow.kickoff_async` that hydrates state from a previously-persisted flow's latest snapshot - Reassigns `state.id` to a fresh value (or `inputs["id"]` if pinned) so the new run's `@persist` writes don't extend the source's history - Existing `inputs["id"]` resume, `@persist`, and `from_checkpoint` paths are unchanged ## Problem `@persist` only supports resume today: `kickoff(inputs={"id": <uuid>})` hydrates state and continues writing under the same `flow_uuid`. There's no way to fork — hydrate from a snapshot but persist under a separate key, leaving the source's history intact. This PR adds that. \| \| `state.id` after kickoff \| `@persist` writes land under \| \|---\|---\|---\| \| `inputs["id"]` (resume) \| supplied id \| supplied id (extends history) \| \| `restore_from_state_id` (fork) \| fresh id, or `inputs["id"]` if pinned \| new id (source preserved) \| ## Behavior \| `inputs.id` \| `restore_from_state_id` \| Effect \| \|---\|---\|---\| \| — \| — \| Fresh kickoff \| \| set \| — \| Existing resume \| \| — \| UUID \| Fork — new `state.id`, hydrated from source \| \| set \| UUID \| Fork into a pinned `state.id`, hydrated from source \| - Source not found → silent fallback (mirrors existing resume) - Both `from_checkpoint` and `restore_from_state_id` set → `ValueError` - `restore_from_state_id=None` → byte-identical to current main ## Design Fork hydration runs before the existing `inputs` block in `kickoff_async`. On a hit, it calls the same `_restore_state` primitive used by resume, then overwrites `state.id` with a fresh UUID (or `inputs["id"]`). A `fork_succeeded` flag gates the existing `inputs["id"]` path so we don't double-load. `_completed_methods` / `_is_execution_resuming` are intentionally untouched — skip-completed-methods remains the territory of `apply_checkpoint` and `from_pending`. ## Test plan - [ ] `pytest tests/test_flow_persistence.py` — 5 new tests (four-row matrix, not-found fallback, default no-op, conflict raise) + 6 existing as regression - [ ] `pytest tests/test_flow.py` — broader flow suite - [ ] Manual end-to-end against an HITL `@persist` flow	2026-05-01 11:46:07 -04:00
Greyson LaLonde	70f391994e	fix(converter): fall through when JSON regex match isn't valid JSON	2026-05-01 00:48:09 +08:00
Vini Brasil	864f0a8a91	Revert "feat(flow): support custom persistence key in @persist (#5649 )" (#5668 ) This reverts commit `e2deac5575`.	2026-04-30 12:04:57 -03:00
Greyson LaLonde	9f13235037	fix(llm): preserve tool_calls when response also contains text	2026-04-30 22:53:01 +08:00
Matt Aitchison	c7f01048b7	feat(azure): forward credential_scopes to Azure AI Inference client (#5661 ) Some checks failed CodeQL Advanced / Analyze (actions) (push) Has been cancelled Details CodeQL Advanced / Analyze (python) (push) Has been cancelled Details Check Documentation Broken Links / Check broken links (push) Has been cancelled Details Vulnerability Scan / pip-audit (push) Has been cancelled Details Nightly Canary Release / Check for new commits (push) Has been cancelled Details Nightly Canary Release / Build nightly packages (push) Has been cancelled Details Nightly Canary Release / Publish nightly to PyPI (push) Has been cancelled Details Mark stale issues and pull requests / stale (push) Has been cancelled Details * feat(azure): forward credential_scopes to Azure AI Inference client Adds a credential_scopes field to the native Azure AI Inference provider and a matching AZURE_CREDENTIAL_SCOPES env var (comma-separated). The value is forwarded to ChatCompletionsClient / AsyncChatCompletionsClient when set, letting keyless / Entra-based callers target a specific Azure AD audience (e.g. https://cognitiveservices.azure.com/.default) without subclassing the provider. Matches the upstream azure.ai.inference SDK kwarg of the same name. Lazy build re-reads the env var so an LLM constructed at module import (before deployment env vars are set) still picks up scopes — same pattern as the existing AZURE_API_KEY / AZURE_ENDPOINT lazy reads. to_config_dict round-trips the field. * refactor(azure): tighten credential_scopes env handling Address review feedback: - Move os.getenv into the helper so AZURE_CREDENTIAL_SCOPES appears once - Match the surrounding api_key/endpoint `or` style in the validator - Drop the list() defensive copy in to_config_dict — every other field in that method (and the base class's `stop`) is assigned by reference	2026-04-29 16:52:29 -05:00
Greyson LaLonde	14c3963d2c	fix(instructor): forward base_url and api_key to instructor.from_provider	2026-04-30 03:00:39 +08:00
Greyson LaLonde	feb2e715a3	fix(mcp): warn and return empty when native MCP server returns no tools	2026-04-30 02:41:01 +08:00
Kunal Karmakar	e0b86750c2	feat(azure): add Responses API support for Azure OpenAI provider (#5201 ) * Support azure openai responses * Revert function supported condition * Revert comment deletion * Update support stop words * Add cassette based tests * Fix linting	2026-04-29 11:12:11 -07:00
Lucas Gomide	e2deac5575	feat(flow): support custom persistence key in @persist (#5649 ) * feat(flow): add optional key param to @persist decorator Allows users to specify which state attribute to use as the persistence key instead of always defaulting to state.id. Usage: @persist(key='conversation_id') Falls back to state.id when key is not provided (no breaking change). Raises ValueError if the specified key is missing or falsy on state. * docs(flow): document @persist key parameter for custom persistence keys * fix(flow): use explicit None check for persist key to avoid empty-string fallback --------- Co-authored-by: iris-clawd <iris-clawd@anthropic.com> Co-authored-by: iris-clawd <iris@crewai.com> Co-authored-by: Lorenze Jay <63378463+lorenzejay@users.noreply.github.com>	2026-04-29 12:41:20 -04:00
Greyson LaLonde	07667829e9	fix(cli): guard crew chat description helpers against LLM failures Some checks failed CodeQL Advanced / Analyze (actions) (push) Has been cancelled Details CodeQL Advanced / Analyze (python) (push) Has been cancelled Details Vulnerability Scan / pip-audit (push) Has been cancelled Details Build uv cache / build-cache (3.10) (push) Has been cancelled Details Build uv cache / build-cache (3.11) (push) Has been cancelled Details Build uv cache / build-cache (3.12) (push) Has been cancelled Details Build uv cache / build-cache (3.13) (push) Has been cancelled Details Mark stale issues and pull requests / stale (push) Has been cancelled Details	2026-04-29 10:30:24 +08:00
Greyson LaLonde	4c74dc0f86	fix(executor): reset messages and iterations between invocations CrewAgentExecutor is reused across sequential tasks but invoke/ainvoke only appended to self.messages and never reset self.iterations, so task 2 inherited task 1's history and iteration count.	2026-04-29 02:10:17 +08:00
Greyson LaLonde	45497478c0	fix(cli): forward trained-agents file through replay and test	2026-04-28 22:46:41 +08:00
Greyson LaLonde	4e9331a2c8	fix(agent): honor custom trained-agents file at inference	2026-04-28 22:09:34 +08:00
Greyson LaLonde	a29977f4f6	fix(crew): bind task-only agents to crew so multimodal input_files reach the LLM Some checks failed Build uv cache / build-cache (3.10) (push) Has been cancelled Details Build uv cache / build-cache (3.11) (push) Has been cancelled Details Build uv cache / build-cache (3.12) (push) Has been cancelled Details Build uv cache / build-cache (3.13) (push) Has been cancelled Details CodeQL Advanced / Analyze (actions) (push) Has been cancelled Details CodeQL Advanced / Analyze (python) (push) Has been cancelled Details Vulnerability Scan / pip-audit (push) Has been cancelled Details	2026-04-28 20:53:39 +08:00
Greyson LaLonde	7a0a8cf56f	fix: serialize guardrail callables as null for JSON checkpointing	2026-04-28 14:57:49 +08:00
Greyson LaLonde	de0b2a4fe0	fix(deps): bump litellm for SSTI fix; ignore unfixable pip CVE	2026-04-28 04:34:17 +08:00
Tiago Freire	b0e2fda105	fix(flow): add execution_id separate from state.id Some checks failed CodeQL Advanced / Analyze (actions) (push) Has been cancelled Details CodeQL Advanced / Analyze (python) (push) Has been cancelled Details Vulnerability Scan / pip-audit (push) Has been cancelled Details Nightly Canary Release / Check for new commits (push) Has been cancelled Details Nightly Canary Release / Build nightly packages (push) Has been cancelled Details Nightly Canary Release / Publish nightly to PyPI (push) Has been cancelled Details Mark stale issues and pull requests / stale (push) Has been cancelled Details * fix(flow): add execution_id separate from state.id (COR-48) When a consumer passes `id` in `kickoff(inputs=...)`, that value overwrites the flow's state.id — which was also being used as the execution tracking identity for telemetry, tracing, and external correlation. Two kickoffs sharing the same consumer id ended up with the same tracking id, breaking any downstream system that joins on it. Introduces `Flow.execution_id`: a stable per-run identifier stored as a `PrivateAttr` on the `Flow` model, exposed via property + setter. It defaults to a fresh `uuid4` per instance, is never touched by `inputs["id"]`, and can be assigned by outer systems that already have an execution identity (e.g. a task id). Switches the `current_flow_id` / `current_flow_request_id` ContextVars to seed from `execution_id` so OTel spans emitted by `FlowTrackable` children correlate on the stable tracking key. `state.id` keeps its existing override semantics for persistence/restore — consumers resuming a persisted flow via `inputs["id"]` work exactly as before. Adds tests covering default uniqueness per instance, immunity to consumer `inputs["id"]`, context-var propagation, absence from serialized state, and parity for dict-state flows. Co-authored-by: Greyson LaLonde <greyson.r.lalonde@gmail.com>	2026-04-24 04:48:14 +08:00
Greyson LaLonde	69d777ca50	fix(flow): replay recorded method events on checkpoint resume	2026-04-24 03:41:55 +08:00
Greyson LaLonde	77b2835a1d	fix(flow): serialize initial_state class refs as JSON schema Some checks failed CodeQL Advanced / Analyze (actions) (push) Has been cancelled Details CodeQL Advanced / Analyze (python) (push) Has been cancelled Details Vulnerability Scan / pip-audit (push) Has been cancelled Details	2026-04-23 21:55:50 +08:00
Lorenze Jay	c77f1632dd	fix: preserve metadata-only agent skills Co-authored-by: Greyson LaLonde <greyson.r.lalonde@gmail.com>	2026-04-23 19:58:12 +08:00
Matt Aitchison	fdf3101b39	feat(azure): fall back to DefaultAzureCredential when no API key Enables keyless Azure auth (OIDC Workload Identity Federation, Managed Identity, Azure CLI, env-configured Service Principal) without any crewAI-specific configuration. Customers whose deployment environment already sets the standard azure-identity env vars get keyless auth for free; the existing API-key path is unchanged. Linear: FAC-40	2026-04-23 04:21:35 +08:00
Renato Nitta	42d6c03ebc	fix: propagate implicit @CrewBase names to crew events (#5574 ) * fix: propagate implicit @CrewBase names to crew events * test: appease static analysis for @CrewBase kickoff test --------- Co-authored-by: Greyson LaLonde <greyson.r.lalonde@gmail.com>	2026-04-21 15:57:19 -03:00
Lorenze Jay	6d153284d4	fix: merge execution metadata on duplicate batch initialization in Tr… (#5573 ) * fix: merge execution metadata on duplicate batch initialization in TraceBatchManager - Updated TraceBatchManager to merge execution metadata when a batch is initialized multiple times. - Enhanced logging to reflect the merging of metadata during duplicate initialization. - Added a test case to verify that execution metadata is correctly merged when initializing a batch after a lazy action. * drop env events emitting from traces listener	2026-04-21 10:12:24 -07:00
Greyson LaLonde	ae242c507d	feat: add checkpoint and fork support to standalone agents Add fork classmethod, _restore_runtime, and _restore_event_scope to BaseAgent. Fix from_checkpoint to set runtime state on the event bus and restore event scopes. Store kickoff event ID across checkpoints to skip re-emission on resume. Handle agent entity type in checkpoint CLI and TUI.	2026-04-20 22:47:37 +08:00
alex-clawd	0b120fac90	fix: use future dates in checkpoint prune tests to prevent time-dependent failures (#5543 ) Some checks failed CodeQL Advanced / Analyze (actions) (push) Has been cancelled Details CodeQL Advanced / Analyze (python) (push) Has been cancelled Details Vulnerability Scan / pip-audit (push) Has been cancelled Details Mark stale issues and pull requests / stale (push) Has been cancelled Details The test_older_than tests in both JSON and SQLite prune suites used hardcoded 2026-04-17 timestamps for the 'new' checkpoint. Once that date passes, the checkpoint is older than 1 day and gets pruned along with the 'old' one, causing assert count >= 1 to fail (count=0). Use 2099-01-01 for the 'new' checkpoint so tests remain stable. Co-authored-by: Joao Moura <joaomdmoura@gmail.com>	2026-04-20 01:27:12 -03:00
Greyson LaLonde	c5192b970c	feat: add checkpoint resume, diff, prune commands and save discoverability Add three new CLI subcommands to improve checkpoint UX: - `crewai checkpoint resume [id]` skips the TUI and resumes from the latest or specified checkpoint directly - `crewai checkpoint diff <id1> <id2>` compares two checkpoints showing changes in metadata, inputs, task status, and outputs - `crewai checkpoint prune --keep N --older-than Xd` removes old checkpoints from JSON dirs or SQLite databases Also writes a resume hint to stderr after every checkpoint save so users discover the command without needing to know it exists.	2026-04-17 04:50:15 +08:00
Greyson LaLonde	54391fdbdf	feat: add from_checkpoint parameter to Agent.kickoff, kickoff_async, akickoff	2026-04-17 03:40:37 +08:00

1 2 3 4 5

245 Commits