Search Results: web-crawling

Found 28 Skills

Security & Complianceagentic-reserve/blockint-...

katana-web-crawling

Guides use of ProjectDiscovery Katana for web crawling and spidering in security testing and recon workflows. Covers installation, standard vs headless mode, scope and rate limits, JSONL output, and piping from httpx or URL lists. Use when the user mentions Katana, projectdiscovery/katana, web crawling, spidering, endpoint discovery, attack surface mapping, or chaining crawlers in automation pipelines.

🇺🇸|EnglishTranslated

Tools & Utilitiesfirecrawl/skills

firecrawl-build-map

Integrate Firecrawl `/map` into product code for URL discovery on a known site. Use when a feature needs to find pages before scraping or crawling, especially on large docs sites, blogs, or help centers where the exact target URLs are not known yet.

🇺🇸|EnglishTranslated

113

Data Processingfirecrawl/skills

firecrawl-build-crawl

Integrate Firecrawl `/crawl` into product code for bulk extraction across a site or site section. Use when a feature needs many related pages, such as documentation sets, help centers, or blogs, and page-by-page `/scrape` would be too manual.

🇺🇸|EnglishTranslated

110

Data Processingtavily-ai/skills

crawl

Crawl any website and save pages as local markdown files. Use when you need to download documentation, knowledge bases, or web content for offline access or analysis. No code required - just provide a URL.

🇺🇸|EnglishTranslated

101

1 scripts/Attention

Tools & Utilitiesaaaaqwq/claude-code-skill...

multi-search-engine

Multi search engine integration with 17 engines (8 CN + 9 Global). Supports advanced search operators, time filters, site search, privacy engines, and WolframAlpha knowledge queries. No API keys required.

🇺🇸|EnglishTranslated

AI & Machine Learningtavily-ai/skills

tavily-best-practices

Build production-ready Tavily integrations with best practices baked in. Reference documentation for developers using coding assistants (Claude Code, Cursor, etc.) to implement web search, content extraction, crawling, and research in agentic workflows, RAG systems, or autonomous agents.

🇺🇸|EnglishTranslated

Tools & Utilitiestavily-ai/skills

tavily-crawl

Crawl websites and extract content from multiple pages via the Tavily CLI. Use this skill when the user wants to crawl a site, download documentation, extract an entire docs section, bulk-extract pages, save a site as local markdown files, or says "crawl", "get all the pages", "download the docs", "extract everything under /docs", "bulk extract", or needs content from many pages on the same domain. Supports depth/breadth control, path filtering, semantic instructions, and saving each page as a local markdown file.

🇺🇸|EnglishTranslated

Tools & Utilitiesbrowserbase/skills

fetch

Fetch web pages and return their content, headers, and metadata using the Browserbase Fetch API. Use when the user wants to retrieve page content without a full browser session — ideal for scraping static pages, checking HTTP responses, or getting page source. Supports proxies, redirect control, and insecure SSL bypass.

🇺🇸|EnglishTranslated

Tools & Utilitiesbrettdavies/crawl4ai-skil...

crawl4ai

This skill should be used when users need to scrape websites, extract structured data, handle JavaScript-heavy pages, crawl multiple URLs, or build automated web data pipelines. Includes optimized extraction patterns with schema generation for efficient, LLM-free extraction.

🇺🇸|EnglishTranslated

8 scripts/Checked

Tools & Utilitiesintellectronica/agent-ski...

tavily

Use this skill for web search, extraction, mapping, crawling, and research via Tavily’s REST API when web searches are needed and no built-in tool is available, or when Tavily’s LLM-friendly format is beneficial.

🇺🇸|EnglishTranslated

Data Processingmindmorass/reflex

site-crawler

Crawl and extract content from websites

🇺🇸|EnglishTranslated

Data Processinglancelin111/crawl4ai-skil...

crawl4ai-skill

Web crawling and scraping tool with LLM-optimized output. 网页爬虫爬取工具 | Web crawler, web scraper, spider. DuckDuckGo search, site crawling, dynamic page scraping. 智能搜索爬取 | Free, no API key required.

🇺🇸|EnglishTranslated