* feat: add crewai-tools workspace structure * Squashed 'temp-crewai-tools/' content from commit 9bae5633 git-subtree-dir: temp-crewai-tools git-subtree-split: 9bae56339096cb70f03873e600192bd2cd207ac9 * feat: configure crewai-tools workspace package with dependencies * fix: apply ruff auto-formatting to crewai-tools code * chore: update lockfile * fix: don't allow tool tests yet * fix: comment out extra pytest flags for now * fix: remove conflicting conftest.py from crewai-tools tests * fix: resolve dependency conflicts and test issues - Pin vcrpy to 7.0.0 to fix pytest-recording compatibility - Comment out types-requests to resolve urllib3 conflict - Update requests requirement in crewai-tools to >=2.32.0
2.7 KiB
BrightData Tools Documentation
Description
A comprehensive suite of CrewAI tools that leverage Bright Data's powerful infrastructure for web scraping, data extraction, and search operations. These tools provide three distinct capabilities:
- BrightDataDatasetTool: Extract structured data from popular data feeds (Amazon, LinkedIn, Instagram, etc.) using pre-built datasets
- BrightDataSearchTool: Perform web searches across multiple search engines with geo-targeting and device simulation
- BrightDataWebUnlockerTool: Scrape any website content while bypassing bot protection mechanisms
Installation
To incorporate these tools into your project, follow the installation instructions below:
pip install crewai[tools] aiohttp requests
Examples
Dataset Tool - Extract Amazon Product Data
from crewai_tools import BrightDataDatasetTool
# Initialize with specific dataset and URL
tool = BrightDataDatasetTool(
dataset_type="amazon_product",
url="https://www.amazon.com/dp/B08QB1QMJ5/"
)
result = tool.run()
Search Tool - Perform Web Search
from crewai_tools import BrightDataSearchTool
# Initialize with search query
tool = BrightDataSearchTool(
query="latest AI trends 2025",
search_engine="google",
country="us"
)
result = tool.run()
Web Unlocker Tool - Scrape Website Content
from crewai_tools import BrightDataWebUnlockerTool
# Initialize with target URL
tool = BrightDataWebUnlockerTool(
url="https://example.com",
data_format="markdown"
)
result = tool.run()
Steps to Get Started
To effectively use the BrightData Tools, follow these steps:
-
Package Installation: Confirm that the
crewai[tools]package is installed in your Python environment. -
API Key Acquisition: Register for a Bright Data account at
https://brightdata.com/and obtain your API credentials from your account settings. -
Environment Configuration: Set up the required environment variables:
export BRIGHT_DATA_API_KEY="your_api_key_here" export BRIGHT_DATA_ZONE="your_zone_here" -
Tool Selection: Choose the appropriate tool based on your needs:
- Use DatasetTool for structured data from supported platforms
- Use SearchTool for web search operations
- Use WebUnlockerTool for general website scraping
Conclusion
By integrating BrightData Tools into your CrewAI agents, you gain access to enterprise-grade web scraping and data extraction capabilities. These tools handle complex challenges like bot protection, geo-restrictions, and data parsing, allowing you to focus on building your applications rather than managing scraping infrastructure.