scrape

Original🇺🇸 English
Translated
1 scriptsChecked / no sensitive code detected

Scrape any webpage as clean markdown via Bright Data Web Unlocker API. Bypasses bot detection and CAPTCHA. Requires BRIGHTDATA_API_KEY and BRIGHTDATA_UNLOCKER_ZONE environment variables.

3installs
Added on

NPX Install

npx skill4agent add brightdata/skills scrape

Tags

Translated version includes tags in frontmatter

Bright Data - Web Scraper

Scrape any webpage and get clean markdown content using Bright Data's Web Unlocker API. Automatically bypasses bot detection and CAPTCHA.

Setup

1. Get your API Key: Get a key from Bright Data Dashboard.
2. Create a Web Unlocker zone: Create a zone at brightdata.com/cp by clicking "Add" (top-right), selecting "Unlocker zone".
3. Set environment variables:
bash
export BRIGHTDATA_API_KEY="your-api-key"
export BRIGHTDATA_UNLOCKER_ZONE="your-zone-name"

Usage

bash
bash scripts/scrape.sh "url"
Parameters:
  • url
    (required): The webpage URL to scrape
Examples:
bash
# Scrape a news article
bash scripts/scrape.sh "https://example.com/article"

# Scrape a product page
bash scripts/scrape.sh "https://shop.example.com/product/123"

Output Format

Returns clean markdown content extracted from the webpage:
markdown
# Page Title

Main content of the page converted to markdown format...

## Section Heading

More content...

Features

  • Bot Detection Bypass: Automatically handles anti-bot measures
  • CAPTCHA Solving: Bypasses CAPTCHA challenges
  • Clean Markdown: Returns well-formatted markdown content
  • JavaScript Rendering: Handles JavaScript-heavy pages

Dependencies

  • curl
    - For API requests