Loading...
Loading...
AI-powered browser automation toolset, including agent-browser (accessibility tree extraction), actionbook (50+ website automation recipes), and browser-use (Python automation library). Use cases: (1) Scrape web content that requires JS rendering (2) Fetch data from platforms like X/Twitter, GitHub, Reddit, etc. (3) Take web page screenshots (4) Automate browser operations (5) Retrieve the accessibility tree structure of web pages. Use this skill when you need to access dynamic web pages, bypass anti-scraping measures, or perform browser automation.
npx skill4agent add azure12355/weilan-skills browser-agentUser Request
│
├── Simple static content scraping?
│ └── Use curl / WebFetch (faster)
│
├── Need JS rendering / bypass anti-scraping?
│ ├── agent-browser ── Extract accessibility tree
│ │
│ ├── Screenshot? ── agent-browser -s
│ │
│ └── Target site in actionbook list?
│ └── actionbook get <site> ── Get dedicated recipe
│
└── Complex multi-step automation?
└── browser-use (Python) ── AI-powered autonomous operation# Extract web content (accessibility tree)
agent-browser <URL>
# Take screenshot
agent-browser -s <URL>
# Specify output format
agent-browser -f markdown <URL>
agent-browser -f html <URL>
agent-browser -f text <URL>
# Interactive mode (click, scroll available)
agent-browser -i <URL>
# Specify browser
agent-browser --browser chromium <URL>
agent-browser --browser firefox <URL># Get X/Twitter post content
agent-browser "https://x.com/username/status/123456"
# Get GitHub repository information
agent-browser "https://github.com/owner/repo"
# Get Reddit post
agent-browser "https://reddit.com/r/subreddit/comments/abc123"
# Get news article (JS-rendered)
agent-browser "https://example.com/article"# List all supported websites
actionbook list
# Get recipe for a specific site
actionbook get <site>
# Examples
actionbook get github
actionbook get reddit
actionbook get amazonactionbook listactionbook get <site>pip install browser-use
playwright install chromiumfrom browser_use import Agent
from langchain_openai import ChatOpenAI
async def main():
agent = Agent(
task="Go to GitHub and find the trending Python repositories",
llm=ChatOpenAI(model="gpt-4"),
)
result = await agent.run()
print(result)# Form filling
agent = Agent(
task="Go to example.com and fill out the contact form with test data",
llm=llm,
)
# Data scraping
agent = Agent(
task="Go to Amazon, search for 'wireless headphones', and extract the top 5 products with prices",
llm=llm,
)
# Multi-step operations
agent = Agent(
task="Log into Twitter, navigate to settings, and enable two-factor authentication",
llm=llm,
)| Task Type | Recommended Tool | Reason |
|---|---|---|
| Quick single-page scraping | agent-browser | Simple and straightforward, accessibility tree output |
| Need page screenshots | agent-browser -s | Built-in screenshot functionality |
| Target site is in actionbook | actionbook + browser-use | Ready-made best practices available |
| Complex multi-step operations | browser-use | AI autonomous decision-making and execution |
| Sites requiring login | browser-use | Can handle login flows |
| Batch data collection | browser-use | Supports loops and conditional judgments |
# Method 1: Directly use agent-browser (recommended)
agent-browser "https://x.com/username/status/123456"
# Method 2: Use browser-use for more complex operations
# Write a Python script# Method 1: agent-browser
agent-browser "https://github.com/trending"
# Method 2: Use actionbook to get GitHub recipe
actionbook get github
# Then write a script based on the recipe| Issue | Solution |
|---|---|
| Page loading timeout | Use |
| Content not rendered | Use interactive mode |
| Anti-scraping block | Try different user-agents or use browser-use |
| Blank screenshot | Ensure the page is fully loaded before taking the screenshot |