Add generated Flow Definition authoring skill (#6393)

* Add generated Flow Definition authoring skill Generate a portable skill from the Flow Definition schema so agents can author valid declarative flows with the same reference CrewAI uses to validate them. New declarative flow projects now write this skill. ```python from crewai.flow.flow_definition import FlowDefinition skill = FlowDefinition.skill(skips=(), examples_format="yaml") ``` * `examples_format` accepts `"yaml"` or `"json"`. * Supported skips: `conversational`, `non_linear_flows`, `each`, `hitl`, `persistence`, `config`, `expression_action`, `script_action`, `tool_action` The generated skill includes authoring rules, a routed crew example, and an API reference extracted from the Flow, action, state, agent, crew, and task Pydantic schemas. * Fix declarative flow scaffold without framework import * Fix skipped expression action guidance * Fix markdown links in skill
2026-07-01 05:08:12 +00:00 · 2026-06-30 09:13:33 -07:00
parent e8dced8a2d
commit 1556dbea3e
9 changed files with 1430 additions and 47 deletions
--- a/lib/cli/src/crewai_cli/create_flow.py
+++ b/lib/cli/src/crewai_cli/create_flow.py
@@ -126,10 +126,7 @@ def _create_declarative_flow(

    package_dir = Path(__file__).parent
    templates_dir = package_dir / "templates" / "declarative_flow"
-
-    agents_md_src = package_dir / "templates" / "AGENTS.md"
-    if agents_md_src.exists():
-        shutil.copy2(agents_md_src, project_root / "AGENTS.md")
+    root_template_files = {".gitignore", "AGENTS.md", "README.md", "pyproject.toml"}

    for src_file in templates_dir.rglob("*"):
        if not src_file.is_file():
@@ -138,7 +135,7 @@ def _create_declarative_flow(
        relative_path = src_file.relative_to(templates_dir)
        dst_file = (
            project_root / relative_path
-            if relative_path.name in {".gitignore", "README.md", "pyproject.toml"}
+            if relative_path.name in root_template_files
            else package_root / relative_path
        )
        dst_file.parent.mkdir(parents=True, exist_ok=True)
--- a/lib/cli/src/crewai_cli/templates/declarative_flow/AGENTS.md
+++ b/lib/cli/src/crewai_cli/templates/declarative_flow/AGENTS.md
@@ -0,0 +1,375 @@
+---
+name: flow-definition
+description: Create or edit CrewAI Flow declarations. Use when the user needs a YAML or JSON flow with methods, state, agents, crews, tools, outputs, or conditional branches.
+---
+
+# Flow Definition
+
+You are writing a CrewAI Flow declaration for the user.
+Use these instructions when the user asks you to create or edit a Flow.
+Return one valid `crewai.flow/v1` YAML or JSON document.
+
+Treat this document as instructions for you, not as text to show the user.
+Follow the examples for shape and formatting, then use the API reference to check exact fields.
+
+## Output Format
+
+Return one valid `crewai.flow/v1` Flow declaration.
+Do not include explanatory prose unless the user asks for it.
+
+## Build It In This Order
+
+1. Define `state` first. Use `type: json_schema` and put the JSON Schema inline.
+2. Put required input fields in `state.json_schema.required`. Do not rely on `state.default` to make fields required.
+3. Add at least one method with `start: true`.
+4. Add later methods with `listen`.
+5. Give each method exactly one `do` action object. Never make `do` a list.
+6. Pass data with `${...}` mappings from `state` and completed `outputs`.
+7. Before final output, check every `listen`, `emit`, and `outputs.some_method` reference.
+
+Method names must match `^[A-Za-z_][A-Za-z0-9_]*$`.
+
+## Choose One Action Per Method
+
+Pick the simplest action that does the job.
+
+- Use `call: expression` for simple reads, filters, computed values, and deterministic routing.
+- Use `call: tool` for packaged deterministic work: API calls, searches, lookups, scoring, file work, or custom CrewAI tools.
+- Use `call: agent` for one AI worker that classifies, decides, summarizes, writes, or drafts. Put `role`, `goal`, `backstory`, and `input` under `with`. Do not add an action-level `inputs` map to an agent.
+- Use `call: crew` for coordinated AI work with multiple agents or tasks. Define the crew under `with`. Pass runtime values with the action-level `inputs` map.
+- Use `call: each` when the same ordered mini-pipeline must run once per item. Give every step a `name`.
+- Use `human_feedback` when a method needs a human checkpoint.
+- Use `call: script` only for trusted inline Python. Scripts are not sandboxed.
+
+## Wire Methods Explicitly
+
+- `state` is the initial shared data shape. Action results do not automatically merge into `state`.
+- Read method results with `outputs.method_name` after that method can run.
+- `listen` targets a method name or a router-emitted event name.
+- Method names and emitted event names share one namespace. Avoid reusing the same string for both unless the user explicitly wants that.
+- Use `router: true` plus `emit` when one method chooses between named branches.
+- A router action must return exactly one emitted event string. It must not return JSON, a list, or an explanation.
+- Conditional `listen` values use `and` / `or`, for example `listen: {and: [validated, enriched]}`. They are not CEL.
+- Use `start: true` for normal entrypoints. Use conditional `start` only for advanced event-driven starts.
+
+If an agent is a router, make its goal say exactly what to return, for example:
+`Return exactly one bare value: approved, rejected, or needs_review. Do not include explanation.`
+Prefer `call: expression` when routing can be computed without an agent.
+
+## CEL And Dynamic Values
+
+CEL is the expression language for reading Flow data and making small decisions.
+Use tools, agents and crews, or trusted scripts for larger work or side effects.
+
+Use these expression forms correctly:
+
+- Raw CEL: use in `expr`, `in`, and `if`. Do not wrap raw CEL in `${...}`.
+- Action mappings: wrap the whole string in `${...}`.
+- Crew text: use `{name}` placeholders from crew inputs. Example: `Research {topic}`.
+- Crew inputs become prompt text only when agent or task text references matching `{name}` placeholders.
+- Passing an input that is not referenced by any `{name}` placeholder does not ground the crew. If the crew needs a field, put that placeholder in an agent `goal`, task `description`, or task `expected_output`.
+
+Available CEL variables:
+
+- `state`: initial input data, for example `state.ticket.subject`.
+- `outputs`: completed method outputs, for example `outputs.classify_ticket`.
+- `item`: the current item inside `each`, for example `item.company_domain`.
+- `outputs`: inside `each`, prior step outputs for the current item, for example `outputs.enrich`.
+
+Dynamic value rules:
+
+- A string is dynamic only when the whole trimmed string is wrapped in `${...}`.
+- `Ticket: ${state.ticket}` stays literal. For computed strings, build the string inside the CEL expression.
+- When an agent needs multiple fields, build one single-line CEL string with labels and separators. Example: `input: "${'Ticket ID: ' + state.ticket_id + '; Message: ' + state.message}"`.
+- In YAML, do not put `\n` escapes inside `${...}` strings. Prefer plain `${state.field}` values or the single-line labeled pattern above.
+- `${...}` keeps the value type. It does not always make text.
+- Crew action-level `inputs` are the actual Crew kickoff inputs. Use CEL-wrapped values there for runtime data from `state` or `outputs`.
+- Crew action-level `inputs` alone are not grounding. Include placeholders for the facts the model must use.
+- Crew outputs are objects. Use `${outputs.research_brief.raw}` for text.
+- For structured crew output, use fields like `${outputs.research_brief.json_dict.field}` or `${outputs.research_brief.pydantic.field}`.
+- Do not pass a whole crew output to an agent input, like `${outputs.research_brief}`.
+- Agent outputs may also be objects. Use fields like `${outputs.classify_ticket.raw}` or `${outputs.classify_ticket.pydantic.category}`.
+- Use `with.inputs` only for static Crew input defaults.
+- Agent action `with.input` is the agent's single input value.
+- Trusted `script` actions can mutate state explicitly. Use them only when the user asks for that behavior.
+
+## Do Not
+
+- Do not invent top-level keys outside the Flow declaration shape.
+- Do not use fields outside the declaration schema or tool refs shown here.
+- Do not put more than one action under a method's `do`.
+- Do not make `do` a list.
+- Do not reference `outputs.some_method` before `some_method` can run.
+- Do not use the same string for an emitted event and a method name unless the user asks for it.
+- Do not use `emit` without `router: true` or `human_feedback.emit`.
+- Do not rely on crew action-level `inputs` alone to ground agent behavior. Inputs that do not match placeholders are effectively unused by the prompt.
+- Do not ask agents to infer missing facts when accuracy matters. Tell them to mark missing dates, amounts, offers, logs, or constraints as unknown.
+- Do not set `config.stream: true` unless the caller is expected to consume a streaming result. For normal generated flows and CLI smoke tests, omit it.
+- Do not put conversational settings under `state`, `config`, or a method. Use top-level `conversational` only when the user asks for a chat or conversation flow.
+- Do not use `each` without at least one named step.
+- Do not use `script` for untrusted input or user-authored code.
+
+## Examples
+
+### Crew review with routed follow-up
+
+```yaml
+schema: crewai.flow/v1
+name: ResearchReviewFlow
+state:
+  type: json_schema
+  json_schema:
+    type: object
+    properties:
+      topic:
+        type: string
+      audience:
+        type: string
+    required:
+      - topic
+      - audience
+  default:
+    topic: AI agent orchestration
+    audience: platform engineering leaders
+methods:
+  research_brief:
+    start: true
+    do:
+      call: crew
+      with:
+        agents:
+          researcher:
+            role: Research analyst
+            goal: Research {topic} for {audience}
+            backstory: Expert at concise technical research.
+          reviewer:
+            role: Strategy reviewer
+            goal: Decide whether the research needs an executive follow-up
+            backstory: Experienced at reviewing technical briefs for leaders.
+        tasks:
+          - name: research_task
+            description: Research {topic} for {audience}.
+            expected_output: Key findings and tradeoffs.
+            agent: researcher
+          - name: review_task
+            description: Review the research and decide if an executive follow-up is needed.
+            expected_output: 'A brief review ending with `needs_followup: true` or `needs_followup: false`.'
+            agent: reviewer
+        inputs:
+          topic: Default topic
+          audience: Default audience
+      inputs:
+        topic: "${state.topic}"
+        audience: "${state.audience}"
+  route_followup:
+    listen: research_brief
+    router: true
+    emit:
+      - followup
+      - done
+    do:
+      call: agent
+      with:
+        role: Follow-up router
+        goal: 'Return exactly one bare value: followup or done. Do not include explanation.'
+        backstory: Skilled at routing reviewed research briefs.
+        input: "${outputs.research_brief.raw}"
+  write_followup:
+    listen: followup
+    do:
+      call: agent
+      with:
+        role: Executive communications specialist
+        goal: Draft a concise executive follow-up from the reviewed research
+        backstory: Writes crisp follow-ups for technical leaders.
+        input: "${outputs.research_brief.raw}"
+```
+
+## API Reference
+
+Use this appendix to check exact field names, required fields, linked object types, and allowed action/state shapes. Linked type names point to another section in this reference.
+
+### Flow Definition
+
+Fields:
+- `schema` (optional): must be `crewai.flow/v1`; default `crewai.flow/v1`. Declarative Flow schema identifier and version. Include it explicitly in authored declarations.
+- `name` (required): string. Unique flow name used in logs, events, and traces.
+- `description` (optional): string | null; default `null`. Human-readable summary of the flow.
+- `state` (required): [State](#json-schema-state-statetypejson_schema). State contract for the initial state and updates during execution.
+- `config` (optional): [Config (`config`)](#config-config); default generated default. Serializable flow-level execution configuration.
+- `persist` (optional): [Persistence (`persist`)](#persistence-persist) | null; default `null`. Flow-level persistence configuration.
+- `conversational` (optional): object | null; default `null`. Top-level conversational flow configuration, only when the flow supports chat.
+- `methods` (required): map of string to [Method](#method-methods). Mapping of method names to method definitions.
+
+### JSON Schema State (`state[type=json_schema]`)
+
+Shape:
+- `type: json_schema`
+
+Fields:
+- `type` (optional): must be `json_schema`; default `json_schema`. Inline JSON Schema used as the Flow state contract.
+- `json_schema` (required): map of string to any. JSON Schema used to validate and document flow state. Declare required fields with JSON Schema's `required` array.
+- `default` (optional): map of string to any | null; default `null`. Default values used to initialize Flow state. Defaults are not the same as schema-required fields.
+
+### Method (`methods.<name>`)
+
+Fields:
+- `description` (optional): string | null; default `null`. Human-readable summary of what this method does.
+- `do` (required): [Action](#action). Single action object executed when this method runs.
+- `start` (optional): boolean | string | map of string to any | null; default `null`. Marks a start method. Use `true` for the normal entrypoint. String or map conditions are advanced trigger conditions; use them only when the user asks for event/condition-based starts.
+- `listen` (optional): string | map of string to any | null; default `null`. Trigger condition that runs this method after upstream events. A string target can be a method name or a router-emitted event name, and both live in the same trigger namespace. Map conditions are for `and`/`or` trigger composition, for example `{"and": ["validated", "processed"]}`.
+- `router` (optional): boolean; default `false`. Whether the method output should be treated as the next event name. Router actions must return one event name string, with no surrounding explanation.
+- `emit` (optional): list[string] | null; default `null`. Declared router events this method may emit. Each emitted event name should be unique and should not collide with method names.
+- `human_feedback` (optional): [Human Feedback (`methods.<name>.human_feedback`)](#human-feedback-methodshuman_feedback) | null; default `null`. Optional human feedback step applied after the method action.
+- `persist` (optional): [Persistence (`persist`)](#persistence-persist) | null; default `null`. Method-level persistence override.
+
+### Action
+
+Discriminated union by `call`.
+
+Allowed shapes:
+- [`call: script`](#script-action-methodsdocallscript)
+- [`call: tool`](#tool-action-methodsdocalltool)
+- [`call: crew`](#crew-action-methodsdocallcrew)
+- [`call: agent`](#agent-action-methodsdocallagent)
+- [`call: expression`](#expression-action-methodsdocallexpression)
+- [`call: each`](#each-action-methodsdocalleach)
+
+### Script Action (`methods.<name>.do[call=script]`)
+
+Shape:
+- `call: script`
+
+Fields:
+- `call` (required): must be `script`. Action discriminator. Use script to execute trusted inline Python.
+- `code` (required): string. Trusted inline Python source. Values are available as state and outputs; they are not interpolated into the source. This is not sandboxed.
+- `language` (optional): must be `python`; default `python`. Script language. Only python is currently supported.
+
+### Tool Action (`methods.<name>.do[call=tool]`)
+
+Shape:
+- `call: tool`
+
+Fields:
+- `call` (required): must be `tool`. Action discriminator. Use tool to instantiate and run a CrewAI tool.
+- `ref` (required): string. Reference to the CrewAI tool to run.
+- `with` (optional): map of string to expression data | null; default `null`. Tool input arguments. String values are evaluated as CEL only when the trimmed value starts with ${ and ends with }; all other values are literal.
+
+### Crew Action (`methods.<name>.do[call=crew]`)
+
+Shape:
+- `call: crew`
+
+Fields:
+- `call` (required): must be `crew`. Action discriminator. Use crew to run an inline Crew definition. Example: `crew`
+- `with` (required): inline crew definition. Inline Crew definition to load and execute for this action. Example: `{"agents": {"researcher": {"backstory": "Knows the domain.", "goal": "Research {topic}", "role": "Researcher"}}, "name": "inline_research", "tasks": [{"agent": "researcher", "description": "Research {topic}", "expected_output": "Findings about {topic}", "name": "research_task"}]}`
+- `inputs` (optional): map of string to expression data | null; default `null`. Actual kickoff inputs passed to the Crew. Use CEL-wrapped values here, for example `${state.topic}` or `${outputs.research_brief}`. The evaluated values are available to crew agent and task interpolation as `{name}` placeholders; reference each input the crew needs in agent or task text. Example: `{"topic": "${state.topic}"}`
+
+#### Crew Definition (`methods.<name>.do[call=crew].with`)
+
+Fields:
+- `agents` (required): map of string to any | list[map of string to any]. Inline crew agents keyed by agent name. Example: `{"researcher": {"backstory": "Expert at concise technical research.", "goal": "Research {topic}", "role": "Research analyst"}}`
+- `tasks` (required): list[any]. Ordered crew tasks. Example: `[{"agent": "researcher", "description": "Research {topic}.", "expected_output": "Key findings about {topic}.", "name": "research_task"}]`
+- `inputs` (optional): map of string to any. Static default crew inputs. Values are available to crew agent and task interpolation as `{name}` placeholders, for example `{topic}`. Prefer action-level crew `inputs` for runtime values from `state` or `outputs`, and include placeholders for any inputs the crew must reason over. Example: `{"topic": "AI agents"}`
+
+#### Crew Agent Definition (`methods.<name>.do[call=crew].with.agents.<name>`)
+
+Fields:
+- `role` (required): string. Crew agent role. Crew inputs are interpolated with `{name}` placeholders such as `{topic}`; this is not CEL. Example: `Research analyst`
+- `goal` (required): string. Crew agent goal. Crew inputs are interpolated with `{name}` placeholders such as `{topic}`; this is not CEL. Example: `Research {topic}`
+- `backstory` (required): string. Crew agent backstory. Crew inputs are interpolated with `{name}` placeholders such as `{topic}`; this is not CEL. Example: `Expert at concise technical research.`
+- `settings` (optional): map of string to any. Additional agent settings passed to the loader. Example: `{"llm": "openai/gpt-4o-mini"}`
+
+#### Crew Task Definition (`methods.<name>.do[call=crew].with.tasks[]`)
+
+Fields:
+- `description` (required): string. Task instructions. Crew inputs are interpolated with `{name}` placeholders such as `{topic}`; this is not CEL. Example: `Research {topic}.`
+- `expected_output` (required): string. Expected task output. Crew inputs are interpolated with `{name}` placeholders such as `{topic}`; this is not CEL. Example: `Key findings about {topic}.`
+- `name` (optional): string | null; default `null`. Optional task name. Example: `research_task`
+- `agent` (optional): string | null; default `null`. Name of the crew agent assigned to this task. Example: `researcher`
+
+### Agent Action (`methods.<name>.do[call=agent]`)
+
+Shape:
+- `call: agent`
+
+Fields:
+- `call` (required): must be `agent`. Action discriminator. Use agent to run an individual inline Agent definition outside of a crew. Example: `agent`
+- `with` (required): any. Individual Agent definition to load and execute outside of a crew for this action. Put the agent input in `with.input`; agent actions do not support action-level `inputs`. Example: `{"backstory": "Precise and concise.", "goal": "Answer user questions", "input": "${state.question}", "role": "Analyst", "settings": {"llm": "openai/gpt-4o-mini"}}`
+
+#### Agent Definition (`methods.<name>.do[call=agent].with`)
+
+Fields:
+- `role` (required): string. Individual agent role used by a Flow agent action outside of a crew. Example: `Support specialist`
+- `goal` (required): string. Individual agent goal for the Flow agent action outside of a crew. Example: `Draft a concise customer reply`
+- `backstory` (required): string. Individual agent backstory used to shape behavior outside of a crew. Example: `Expert at resolving SaaS support questions.`
+- `settings` (optional): map of string to any. Additional agent settings passed to the loader. Example: `{"llm": "openai/gpt-4o-mini"}`
+- `input` (required): string. Input passed to the individual agent kickoff outside of a crew. Use a single string value, often a dynamic `${...}` expression. When an agent needs multiple fields, build one single-line CEL string with labels and separators, for example `${'Ticket ID: ' + state.ticket_id + '; Message: ' + state.message}`. In YAML, avoid `\n` escapes inside `${...}` strings. Example: `${state.ticket.body}`
+
+### Expression Action (`methods.<name>.do[call=expression]`)
+
+Shape:
+- `call: expression`
+
+Fields:
+- `call` (required): must be `expression`. Action discriminator. Use expression to evaluate a CEL expression.
+- `expr` (required): string. CEL expression evaluated against state, outputs, and local context.
+
+### Each Action (`methods.<name>.do[call=each]`)
+
+Shape:
+- `call: each`
+
+Fields:
+- `call` (required): must be `each`. Action discriminator. Use each to run a sequence of actions for every item in an input list.
+- `in` (required): string. CEL expression that must evaluate to the list to iterate.
+- `do` (required): list of [Each Step (`methods.<name>.do[call=each].do[]`)](#each-step-methodsdocalleachdo). Ordered steps to run for each item. Each step has a name, optional if expression, and atomic action.
+
+### Each Step (`methods.<name>.do[call=each].do[]`)
+
+Fields:
+- `name` (required): string. Step name used to reference this step's output.
+- `if` (optional): string | null; default `null`. Optional CEL expression evaluated against state, outputs, and local context. When present, the step runs only if the expression evaluates to true.
+- `action` (required): [Tool Action (`methods.<name>.do[call=tool]`)](#tool-action-methodsdocalltool) | [Crew Action (`methods.<name>.do[call=crew]`)](#crew-action-methodsdocallcrew) | [Agent Action (`methods.<name>.do[call=agent]`)](#agent-action-methodsdocallagent) | [Expression Action (`methods.<name>.do[call=expression]`)](#expression-action-methodsdocallexpression) | [Script Action (`methods.<name>.do[call=script]`)](#script-action-methodsdocallscript). Atomic action to run for this step.
+
+### Config (`config`)
+
+Fields:
+- `tracing` (optional): boolean | null; default `null`. Override for flow tracing; when omitted, execution defaults apply.
+- `stream` (optional): boolean; default `false`. Whether the flow should emit streaming events when supported.
+- `memory` (optional): map of string to any | null; default `null`. Serializable memory configuration passed to flow execution.
+- `input_provider` (optional): string | null; default `null`. Provider key used to supply initial state.
+- `suppress_flow_events` (optional): boolean; default `false`. Disable flow event emission for this definition.
+- `max_method_calls` (optional): integer; default `100`. Maximum number of method executions allowed during one kickoff.
+- `defer_trace_finalization` (optional): boolean; default `false`. Defer trace finalization so callers can complete tracing later.
+- `checkpoint` (optional): boolean | map of string to any | null; default `null`. Checkpointing configuration, or true to use default checkpointing.
+
+### Persistence (`persist`)
+
+Fields:
+- `enabled` (optional): boolean; default `false`. Whether persistence is enabled for this flow or method.
+- `verbose` (optional): boolean; default `false`. Whether persistence should emit verbose diagnostic output.
+- `persistence` (optional): any; default `null`. Persistence backend configuration or import reference.
+
+### Human Feedback (`methods.<name>.human_feedback`)
+
+Fields:
+- `message` (required): string. Prompt shown to the human reviewer when feedback is requested.
+- `emit` (optional): list[string] | null; default `null`. Allowed feedback outcomes. When set, the method routes like a router using the selected outcome.
+- `llm` (optional): any; default `gpt-4o-mini`. LLM configuration used to assist or process human feedback.
+- `default_outcome` (optional): string | null; default `null`. Outcome to use when feedback cannot be collected.
+- `metadata` (optional): map of string to any | null; default `null`. Serializable metadata attached to the feedback request.
+- `provider` (optional): any; default `null`. Feedback provider configuration or import reference.
+- `learn` (optional): boolean; default `false`. Whether feedback should be recorded for later learning workflows.
+- `learn_source` (optional): string; default `hitl`. Source label attached to learned feedback records.
+- `learn_strict` (optional): boolean; default `false`. Whether learning should enforce strict validation of feedback data.
+
+### Cross-Field Rules
+
+- A method has exactly one `do` action object with one `call` discriminator.
+- `listen` targets method names and router-emitted event names in one shared namespace.
+- A router method result must match one declared `emit` value.
+- Crew action-level `inputs` are the Crew kickoff inputs; use CEL-wrapped strings there for runtime values.
+- Crew agent/task interpolation uses `{name}` placeholders from evaluated crew inputs.
+- Agent `with.input` must be text. Use `${outputs.method_name.raw}` or a text field like `${outputs.method_name.json_dict.summary}`.
+- `each.do` must contain at least one named step.
--- a/lib/cli/tests/test_create_flow.py
+++ b/lib/cli/tests/test_create_flow.py
@@ -26,6 +26,9 @@ def test_create_flow_declarative_project_can_run(
    assert pyproject["project"]["requires-python"]
    assert pyproject["project"]["dependencies"]
    assert (project_root / pyproject["tool"]["crewai"]["definition"]).is_file()
+    agents_md = (project_root / "AGENTS.md").read_text(encoding="utf-8")
+    assert "CrewAI Flow declaration" in agents_md
+    assert "schema: crewai.flow/v1" in agents_md

    monkeypatch.chdir(project_root)
    result = CliRunner().invoke(crewai, ["run"], env={"UV_RUN_RECURSION_DEPTH": "1"})