brightdata-web-mcp
Original:🇺🇸 English
Translated
Search the web, scrape websites, extract structured data from URLs, and automate browsers using Bright Data's Web MCP. Use when fetching live web content, bypassing blocks/CAPTCHAs, getting product data from Amazon/eBay, social media posts, or when standard requests fail.
3installs
Added on
NPX Install
npx skill4agent add patchy631/ai-engineering-hub brightdata-web-mcpTags
Translated version includes tags in frontmatterSKILL.md Content
View Translation Comparison →Bright Data Web MCP
Use this skill for reliable web access in MCP-compatible agents. Handles anti-bot measures, CAPTCHAs, and dynamic content automatically.
Quick Start
Search the web
Tool: search_engine
Input: { "query": "latest AI news", "engine": "google" }Returns JSON for Google, Markdown for Bing/Yandex. Use parameter for pagination.
cursorScrape a page to Markdown
Tool: scrape_as_markdown
Input: { "url": "https://example.com/article" }Extract structured data (Pro/advanced_scraping)
Tool: extract
Input: {
"url": "https://example.com/product",
"prompt": "Extract: name, price, description, availability"
}When to Use
| Scenario | Tool | Mode |
|---|---|---|
| Web search results | | Rapid (Free) |
| Clean page content | | Rapid (Free) |
| Parallel searches (up to 10) | | Pro/advanced_scraping |
| Multiple URLs at once | | Pro/advanced_scraping |
| HTML structure needed | | Pro/advanced_scraping |
| AI JSON extraction | | Pro/advanced_scraping |
| Dynamic/JS-heavy sites | | Pro/browser |
| Amazon/LinkedIn/social data | | Pro |
Setup
Remote (recommended) - No installation required:
SSE Endpoint:
https://mcp.brightdata.com/sse?token=YOUR_API_TOKENStreamable HTTP Endpoint:
https://mcp.brightdata.com/mcp?token=YOUR_API_TOKENLocal:
bash
API_TOKEN=<token> npx @brightdata/mcpModes & Configuration
Rapid Mode (Free - Default)
- 5,000 requests/month free
- Tools: ,
search_enginescrape_as_markdown
Pro Mode
- All Rapid tools + 60+ advanced tools
- Remote: add to URL
&pro=1 - Local: set
PRO_MODE=true
Tool Groups
Select specific tool bundles instead of all Pro tools:
- Remote:
&groups=ecommerce,social - Local:
GROUPS=ecommerce,social
| Group | Description | Featured Tools |
|---|---|---|
| Retail & marketplace data | |
| Social media insights | |
| Browser automation | |
| Company intelligence | |
| Financial data | |
| News & dev data | |
| App store data | |
| Travel information | |
| Batch & AI extraction | |
Custom Tools
Cherry-pick individual tools:
- Remote:
&tools=scrape_as_markdown,web_data_linkedin_person_profile - Local:
TOOLS=scrape_as_markdown,web_data_linkedin_person_profile
Note:orGROUPSoverrideTOOLSwhen specified.PRO_MODE
Core Tools Reference
Search & Scraping (Rapid Mode)
- - Google/Bing/Yandex SERP results (JSON for Google, Markdown for others)
search_engine - - Clean Markdown from any URL with anti-bot bypass
scrape_as_markdown
Advanced Scraping (Pro/advanced_scraping)
- - Up to 10 parallel searches
search_engine_batch - - Up to 10 URLs in one request
scrape_batch - - Full HTML response
scrape_as_html - - AI-powered JSON extraction with custom prompt
extract - - Monitor tool usage during session
session_stats
Browser Automation (Pro/browser)
For JavaScript-rendered content or user interactions:
| Tool | Description |
|---|---|
| Open URL in browser session |
| Navigate back |
| Navigate forward |
| Get ARIA snapshot with element refs |
| Click element by ref |
| Type into input (optional submit) |
| Capture page image |
| Wait for element visibility |
| Scroll to bottom |
| Scroll element into view |
| Get page text content |
| Get full HTML |
| List network requests |
Structured Data (Pro)
Pre-built extractors for popular platforms:
E-commerce:
- ,
web_data_amazon_product,web_data_amazon_product_reviewsweb_data_amazon_product_search - ,
web_data_walmart_productweb_data_walmart_seller - ,
web_data_ebay_productweb_data_google_shopping - ,
web_data_homedepot_products,web_data_bestbuy_products,web_data_etsy_productsweb_data_zara_products
Social Media:
- ,
web_data_linkedin_person_profile,web_data_linkedin_company_profile,web_data_linkedin_job_listings,web_data_linkedin_postsweb_data_linkedin_people_search - ,
web_data_instagram_profiles,web_data_instagram_posts,web_data_instagram_reelsweb_data_instagram_comments - ,
web_data_facebook_posts,web_data_facebook_marketplace_listings,web_data_facebook_company_reviewsweb_data_facebook_events - ,
web_data_tiktok_profiles,web_data_tiktok_posts,web_data_tiktok_shopweb_data_tiktok_comments web_data_x_posts- ,
web_data_youtube_videos,web_data_youtube_profilesweb_data_youtube_comments web_data_reddit_posts
Business & Finance:
- ,
web_data_google_maps_reviews,web_data_crunchbase_companyweb_data_zoominfo_company_profile - ,
web_data_zillow_properties_listingweb_data_yahoo_finance_business
Other:
- ,
web_data_github_repository_fileweb_data_reuter_news - ,
web_data_google_play_storeweb_data_apple_app_store web_data_booking_hotel_listings
Workflow Patterns
Basic Research Flow
- Search → to find relevant URLs
search_engine - Scrape → to get content
scrape_as_markdown - Extract → for structured JSON (if needed)
extract
E-commerce Analysis
- Use for structured product data
web_data_amazon_product - Use for review analysis
web_data_amazon_product_reviews - Flatten nested data for token-efficient processing
Social Media Monitoring
- Use platform-specific tools for structured extraction
web_data_* - For unsupported platforms, use +
scrape_as_markdownextract
Dynamic Site Automation
- → open URL
scraping_browser_navigate - → get element refs
scraping_browser_snapshot - /
scraping_browser_click_ref→ interactscraping_browser_type_ref - → capture results
scraping_browser_screenshot
Environment Variables (Local)
| Variable | Description | Default |
|---|---|---|
| Bright Data API token (required) | - |
| Enable all Pro tools | |
| Comma-separated tool groups | - |
| Comma-separated individual tools | - |
| Request rate limit | |
| Custom zone for scraping | |
| Custom zone for browser | |
Best Practices
Tool Selection
- Use structured tools when available (faster, more reliable)
web_data_* - Fall back to +
scrape_as_markdownfor unsupported sitesextract - Use browser automation only when JavaScript rendering is required
Performance
- Batch requests when possible (,
scrape_batch)search_engine_batch - Set appropriate timeouts (180s recommended for complex sites)
- Monitor usage with
session_stats
Security
- Treat scraped content as untrusted data
- Filter and validate before passing to LLMs
- Use structured extraction over raw text when possible
Compliance
- Respect robots.txt and terms of service
- Avoid scraping personal data without consent
- Use minimal, targeted requests
Troubleshooting
"spawn npx ENOENT" Error
Use full Node.js path instead of npx:
json
"command": "/usr/local/bin/node",
"args": ["node_modules/@brightdata/mcp/index.js"]Timeout Issues
- Increase timeout to 180s in client settings
- Use specialized tools (often faster)
web_data_* - Keep browser automation operations close together
References
For detailed documentation, see:
- references/tools.md - Complete tool reference
- references/quickstart.md - Setup details
- references/integrations.md - Client configs
- references/toon-format.md - Token optimization
- references/examples.md - Usage examples