crewAI/lib/crewai/tests at 83b07b9d232ea0cbfc157b95ce74754bd3ddf310 - crewAI - Git(ea) 4 Hoffmanns-family

andre/crewAI

mirror of https://github.com/crewAIInc/crewAI.git synced 2026-01-12 01:28:30 +00:00

Files

History

Devin AI 83b07b9d23 fix: prevent LLM observation hallucination by properly attributing tool results

Fixes #4181

The issue was that tool observations were being appended to the assistant
message in the conversation history, which caused the LLM to learn to
hallucinate fake observations during tool calls.

Changes:
- Add llm_response field to AgentAction to store the original LLM response
  before observation is appended
- Modify handle_agent_action_core to store llm_response before appending
  observation to text (text still contains observation for logging)
- Update CrewAgentExecutor._invoke_loop and _ainvoke_loop to:
  - Append LLM response as assistant message
  - Append observation as user message (not assistant)
- Apply same fix to LiteAgent._invoke_loop
- Apply same fix to CrewAgentExecutorFlow.execute_tool_action
- Fix add_image_tool special case in both executors to use same pattern
- Add comprehensive tests for proper message attribution

Co-Authored-By: João <joao@crewai.com>

2026-01-06 06:57:16 +00:00

..

fix: prevent LLM observation hallucination by properly attributing tool results

2026-01-06 06:57:16 +00:00

feat: add streaming tool call events; fix provider id tracking; add tests and cassettes

2026-01-05 14:33:36 -05:00

adjust aop to amp docs lang (#4179 )

2026-01-05 15:30:21 -08:00

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

feat: async crew support

2025-12-04 16:53:19 -05:00

fix: check platform compat for windows signals

2025-12-11 08:38:19 -05:00

fix: ensure otel span is closed

2025-12-05 13:23:26 -05:00

Lorenze/ensure hooks work with lite agents flows (#3981 )

2025-12-04 09:38:39 -08:00

feat: async knowledge support (#4023 )

2025-12-04 10:27:52 -08:00

feat: add streaming tool call events; fix provider id tracking; add tests and cassettes

2026-01-05 14:33:36 -05:00

fix: remove invalid param from sse client (#3980 )

2025-11-26 21:37:55 -08:00

feat: async memory support

2025-12-04 12:54:49 -05:00

chore: restructure test env, cassettes, and conftest; fix flaky tests

2025-11-29 16:55:24 -05:00

fix: hash callback args correctly to ensure caching works

2025-11-05 07:19:09 -05:00

fix: use HuggingFaceEmbeddingFunction for embeddings, update keys and add tests (#4005 )

2025-12-04 15:05:50 -08:00

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

feat: async task support (#4024 )

2025-12-04 13:34:29 -08:00

fix: ensure otel span is closed

2025-12-05 13:23:26 -05:00

feat: use json schema for tool argument serialization

2025-12-11 15:50:19 -05:00

chore: restructure test env, cassettes, and conftest; fix flaky tests

2025-11-29 16:55:24 -05:00

Improve EventListener and TraceCollectionListener for improved event… (#4160 )

2025-12-30 11:36:31 -08:00

__init__.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

test_async_human_feedback.py

Adding HITL for Flows (#4143 )

2025-12-25 21:04:10 -03:00

test_context.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

test_crew_thread_safety.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

test_crew.py

chore: restructure test env, cassettes, and conftest; fix flaky tests

2025-11-29 16:55:24 -05:00

test_custom_llm.py

feat: async llm support

2025-12-01 18:56:56 -05:00

test_flow_default_override.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

test_flow_human_input_integration.py

Improve EventListener and TraceCollectionListener for improved event… (#4160 )

2025-12-30 11:36:31 -08:00

test_flow_persistence.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

test_flow_resumability_regression.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

test_flow_visualization.py

fix: ensure fuzzy returns are more strict, show type warning

2025-11-24 17:35:12 -05:00

test_flow.py

feat: async flow kickoff

2025-12-04 17:08:08 -05:00

test_hallucination_guardrail.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

test_human_feedback_decorator.py

Adding HITL for Flows (#4143 )

2025-12-25 21:04:10 -03:00

test_human_feedback_integration.py

Adding HITL for Flows (#4143 )

2025-12-25 21:04:10 -03:00

test_imports.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

test_llm.py

chore: restructure test env, cassettes, and conftest; fix flaky tests

2025-11-29 16:55:24 -05:00

test_markdown_task.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

test_multimodal_validation.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00

test_project.py

chore: restructure test env, cassettes, and conftest; fix flaky tests

2025-11-29 16:55:24 -05:00

test_streaming_integration.py

chore: restructure test env, cassettes, and conftest; fix flaky tests

2025-11-29 16:55:24 -05:00

test_streaming.py

feat: add streaming result support to flows and crews

2025-11-24 15:43:48 -05:00

test_task_guardrails.py

chore: restructure test env, cassettes, and conftest; fix flaky tests

2025-11-29 16:55:24 -05:00

test_task.py

fix: gracefully terminate the future when executing a task async

2025-12-11 12:03:33 -05:00

utils.py

Release/v1.0.0 (#3618 )

2025-10-20 14:10:19 -07:00