mirror of
https://github.com/crewAIInc/crewAI.git
synced 2026-01-15 11:08:33 +00:00
git-subtree-dir: packages/tools git-subtree-split: 78317b9c127f18bd040c1d77e3c0840cdc9a5b38
FirecrawlCrawlWebsiteTool
Description
Firecrawl is a platform for crawling and convert any website into clean markdown or structured data.
Version Compatibility
This implementation is compatible with FireCrawl API v1
Installation
- Get an API key from firecrawl.dev and set it in environment variables (
FIRECRAWL_API_KEY). - Install the Firecrawl SDK along with
crewai[tools]package:
pip install firecrawl-py 'crewai[tools]'
Example
Utilize the FirecrawlScrapeFromWebsiteTool as follows to allow your agent to load websites:
from crewai_tools import FirecrawlCrawlWebsiteTool
from firecrawl import ScrapeOptions
tool = FirecrawlCrawlWebsiteTool(
config={
"limit": 100,
"scrape_options": ScrapeOptions(formats=["markdown", "html"]),
"poll_interval": 30,
}
)
tool.run(url="firecrawl.dev")
Arguments
api_key: Optional. Specifies Firecrawl API key. Defaults is theFIRECRAWL_API_KEYenvironment variable.config: Optional. It contains Firecrawl API parameters.
This is the default configuration
from firecrawl import ScrapeOptions
{
"max_depth": 2,
"ignore_sitemap": True,
"limit": 100,
"allow_backward_links": False,
"allow_external_links": False,
"scrape_options": ScrapeOptions(
formats=["markdown", "screenshot", "links"],
only_main_content=True,
timeout=30000,
),
}