Loading...
Loading...
Extract clean markdown or text content from specific URLs via the Tavily CLI. Use this skill when the user has one or more URLs and wants their content, says "extract", "grab the content from", "pull the text from", "get the page at", "read this webpage", or needs clean text from web pages. Handles JavaScript-rendered pages, returns LLM-optimized markdown, and supports query-focused chunking for targeted extraction. Can process up to 20 URLs in a single call.
npx skill4agent add tavily-ai/skills tavily-extractcurl -fsSL https://cli.tavily.com/install.sh | bash && tvly login# Single URL
tvly extract "https://example.com/article" --json
# Multiple URLs
tvly extract "https://example.com/page1" "https://example.com/page2" --json
# Query-focused extraction (returns relevant chunks only)
tvly extract "https://example.com/docs" --query "authentication API" --chunks-per-source 3 --json
# JS-heavy pages
tvly extract "https://app.example.com" --extract-depth advanced --json
# Save to file
tvly extract "https://example.com/article" -o article.md| Option | Description |
|---|---|
| Rerank chunks by relevance to this query |
| Chunks per URL (1-5, requires |
| |
| |
| Include image URLs |
| Max wait time (1-60 seconds) |
| Save output to file |
| Structured JSON output |
| Depth | When to use |
|---|---|
| Simple pages, fast — try this first |
| JS-rendered SPAs, dynamic content, tables |
--query--chunks-per-sourcebasicadvanced--timeout--include-raw-content