Loading...
Loading...
Browser automation for AI agents. Use when the user needs to interact with websites, navigate pages, fill forms, click buttons, take screenshots, extract data, test web apps, or automate any browser task. Triggers include "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data", "test this web app", "login to a site", or any task requiring programmatic web interaction.
npx skill4agent add nikhilmaddirala/gtd-cc web-browser@e1@e2opensnapshot -isub-skills/| Sub-skill | When to use | Triggers |
|---|---|---|
| agent-browser.md | Interactive exploration, AI-driven navigation, unknown page structure | "navigate to", "fill this form", "click the button", "scrape this page", "explore the site" |
| playwright.md | Scripted automation, testing, batch screenshots, codegen | "write a script", "generate test", "batch screenshot", "record my actions", "create automation script" |
sub-skills/agent-browser.mdplaywright.md# Session isolation (generate random slug like bright-falcon)
npx agent-browser --session <slug> open https://example.com
npx agent-browser --session <slug> snapshot -i
npx agent-browser --session <slug> click @e1
npx agent-browser --session <slug> fill @e2 "text"# Quick screenshot
npx playwright screenshot https://example.com output.png
# Record interactions as code
npx playwright codegen https://example.com
# PDF generation
npx playwright pdf https://example.com output.pdf