Removing LangChain and Rebuilding Executor (#1322)

* rebuilding executor * removing langchain * Making all tests good * fixing types and adding ability for nor using system prompts * improving types * pleasing the types gods * pleasing the types gods * fixing parser, tools and executor * making sure all tests pass * final pass * fixing type * Updating Docs * preparing to cut new version
2026-05-05 09:12:39 +00:00 · 2024-09-16 14:14:04 -03:00
parent 322780a5f3
commit e77442cf34
177 changed files with 27272 additions and 1618561 deletions
--- a/docs/core-concepts/Testing.md
+++ b/docs/core-concepts/Testing.md
@@ -9,7 +9,7 @@ Testing is a crucial part of the development process, and it is essential to ens

 ### Using the Testing Feature

-We added the CLI command `crewai test` to make it easy to test your crew. This command will run your crew for a specified number of iterations and provide detailed performance metrics. The parameters are `n_iterations` and `model` which are optional and default to 2 and `gpt-4o-mini` respectively. For now, the only provider available is OpenAI.
+We added the CLI command `crewai test` to make it easy to test your crew. This command will run your crew for a specified number of iterations and provide detailed performance metrics. The parameters are `n_iterations` and `model`, which are optional and default to 2 and `gpt-4o-mini` respectively. For now, the only provider available is OpenAI.

 ```bash
 crewai test
@@ -21,20 +21,36 @@ If you want to run more iterations or use a different model, you can specify the
 crewai test --n_iterations 5 --model gpt-4o
 ```

+or using the short forms:
+
+```bash
+crewai test -n 5 -m gpt-4o
+```
+
 When you run the `crewai test` command, the crew will be executed for the specified number of iterations, and the performance metrics will be displayed at the end of the run.

 A table of scores at the end will show the performance of the crew in terms of the following metrics:

 ```
-                Task Scores
-          (1-10 Higher is better)
-┏━━━━━━━━━━━━┳━━━━━━━┳━━━━━━━┳━━━━━━━━━━━━┓
-┃ Tasks/Crew ┃ Run 1 ┃ Run 2 ┃ Avg. Total ┃
-┡━━━━━━━━━━━━╇━━━━━━━╇━━━━━━━╇━━━━━━━━━━━━┩
-│ Task 1     │ 10.0  │ 9.0   │ 9.5        │
-│ Task 2     │ 9.0   │ 9.0   │ 9.0        │
-│ Crew       │ 9.5   │ 9.0   │ 9.2        │
-└────────────┴───────┴───────┴────────────┘
+                                                     Tasks Scores
+                                                (1-10 Higher is better)
+┏━━━━━━━━━━━━━━━━━━━━┯━━━━━━━┯━━━━━━━┯━━━━━━━━━━━━┯━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┯━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┓
+┃ Tasks/Crew/Agents  │ Run 1 │ Run 2 │ Avg. Total │ Agents                         │                                 ┃
+┠────────────────────┼───────┼───────┼────────────┼────────────────────────────────┼─────────────────────────────────┨
+┃ Task 1             │  9.0  │  9.5  │    9.2     │ - Professional Insights        │                                 ┃
+┃                    │       │       │            │ Researcher                     │                                 ┃
+┃                    │       │       │            │                                │                                 ┃
+┃ Task 2             │  9.0  │ 10.0  │    9.5     │ - Company Profile Investigator │                                 ┃
+┃                    │       │       │            │                                │                                 ┃
+┃ Task 3             │  9.0  │  9.0  │    9.0     │ - Automation Insights          │                                 ┃
+┃                    │       │       │            │ Specialist                     │                                 ┃
+┃                    │       │       │            │                                │                                 ┃
+┃ Task 4             │  9.0  │  9.0  │    9.0     │ - Final Report Compiler        │                                 ┃
+┃                    │       │       │            │                                │ - Automation Insights           ┃
+┃                    │       │       │            │                                │ Specialist                      ┃
+┃ Crew               │ 9.00  │ 9.38  │    9.2     │                                │                                 ┃
+┃ Execution Time (s) │  126  │  145  │    135     │                                │                                 ┃
+┗━━━━━━━━━━━━━━━━━━━━┷━━━━━━━┷━━━━━━━┷━━━━━━━━━━━━┷━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┷━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┛
 ```

 The example above shows the test results for two runs of the crew with two tasks, with the average total score for each task and the crew as a whole.