Loading...
Loading...
Found 24 Skills
Run tests from skill examples and generate a report (project)
Testing framework for evaluating Databricks skills. Use when building test cases for skills, running skill evaluations, comparing skill versions, or creating ground truth datasets with the Generate-Review-Promote (GRP) pipeline. Triggers include "test skill", "evaluate skill", "skill regression", "ground truth", "GRP pipeline", "skill quality", and "skill metrics".
Deep test, analyze, and audit Claude skills. Use this skill whenever the user wants to test a skill's behavior, analyze how it uses the Claude API, inspect inputs/outputs from scripts, or run security and code review audits against skill scripts. Trigger on: "test my skill", "analyze this skill", "audit skill scripts", "review skill for security issues", "what does this skill actually do when it runs", "inspect API calls from skill", "run a skill through its paces", "check my skill for bugs or vulnerabilities". Also trigger when the user shows you a SKILL.md and asks you to evaluate, critique, or stress-test it.
Minimal validation for crawl-and-skill workflow readiness.
AI-powered E2E testing for any app. Test 8 platforms with natural language — no test code needed.
Sample skill for testing the skill-tester validation pipeline. Demonstrates proper skill structure with scripts, references, and assets.
Day 1 Skill 체험용. "/day1-test-skill", "테스트 스킬" 요청에 사용.
Use when creating or editing skills, before deployment, to verify they work under pressure and resist rationalization - applies RED-GREEN-REFACTOR cycle to process documentation by running baseline without skill, writing to address failures, iterating to close loopholes
Minimal smoke test for product API discovery skill. Validate product pull, merge, and one metadata fetch.
Smoke test for alicloud-ai-content-aicontent. Validate minimal authentication, API reachability, and one read-only query path.
Day 1 테스트 스킬. "/day1-test-skill" 입력 시 실행된다. Skill이 어떻게 동작하는지 직접 체험하는 용도.
Smoke test for alicloud-compute-fc-agentrun. Validate minimal authentication, API reachability, and one read-only query path.