Introduce Evaluator Experiment (#3133)

mirror of https://github.com/crewAIInc/crewAI.git synced 2026-05-02 15:52:34 +00:00

* feat: add exchanged messages in LLMCallCompletedEvent

* feat: add GoalAlignment metric for Agent evaluation

* feat: add SemanticQuality metric for Agent evaluation

* feat: add Tool Metrics for Agent evaluation

* feat: add Reasoning Metrics for Agent evaluation, still in progress

* feat: add AgentEvaluator class

This class will evaluate Agent' results and report to user

* fix: do not evaluate Agent by default

This is a experimental feature we still need refine it further

* test: add Agent eval tests

* fix: render all feedback per iteration

* style: resolve linter issues

* style: fix mypy issues

* fix: allow messages be empty on LLMCallCompletedEvent

* feat: add Experiment evaluation framework with baseline comparison

* fix: reset evaluator for each experiement iteraction

* fix: fix track of new test cases

* chore: split Experimental evaluation classes

* refactor: remove unused method

* refactor: isolate Console print in a dedicated class

* fix: make crew required to run an experiment

* fix: use time-aware to define experiment result

* test: add tests for Evaluator Experiment

* style: fix linter issues

* fix: encode string before hashing

* style: resolve linter issues

* feat: add experimental folder for beta features (#3141)

* test: move tests to experimental folder

This commit is contained in:

Lucas Gomide

2025-07-14 10:06:45 -03:00

committed by

GitHub

parent 3ada4053bd

commit 1b6b2b36d9

27 changed files with 2512 additions and 16 deletions

									
										8

src/crewai/experimental/evaluation/experiment/__init__.py
									
										Normal file
									
												View File
												
				@@ -0,0 +1,8 @@

				from crewai.experimental.evaluation.experiment.runner import ExperimentRunner

				from crewai.experimental.evaluation.experiment.result import ExperimentResults, ExperimentResult

				__all__ = [

				    "ExperimentRunner",

				    "ExperimentResults",

				    "ExperimentResult"

				]

Introduce Evaluator Experiment (#3133)

8 src/crewai/experimental/evaluation/experiment/__init__.py Normal file Unescape Escape View File

8

src/crewai/experimental/evaluation/experiment/init.py Normal file

View File