Files
crewAI/src/crewai
Lucas Gomide 08fa3797ca Introducing Agent evaluation (#3130)
* feat: add exchanged messages in LLMCallCompletedEvent

* feat: add GoalAlignment metric for Agent evaluation

* feat: add SemanticQuality metric for Agent evaluation

* feat: add Tool Metrics for Agent evaluation

* feat: add Reasoning Metrics for Agent evaluation, still in progress

* feat: add AgentEvaluator class

This class will evaluate Agent' results and report to user

* fix: do not evaluate Agent by default

This is a experimental feature we still need refine it further

* test: add Agent eval tests

* fix: render all feedback per iteration

* style: resolve linter issues

* style: fix mypy issues

* fix: allow messages be empty on LLMCallCompletedEvent
2025-07-11 13:18:03 -04:00
..
2024-09-27 12:11:17 -03:00
2025-06-02 18:12:24 -04:00
2025-03-14 03:00:30 -03:00
2025-06-10 13:32:32 -04:00
2025-06-10 13:32:32 -04:00
2025-05-25 15:24:59 -07:00
2025-07-11 13:18:03 -04:00
2025-07-11 13:18:03 -04:00
2024-02-02 13:56:35 -03:00