antithesis-triage
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseAntithesis Bug Triage
Antithesis Bug分类处理
Use the skill to read and triage Antithesis test reports.
agent-browser使用技能读取并分类处理Antithesis测试报告。
agent-browserGathering user input
收集用户输入
Before starting, collect the following from the user:
- Report URL or Tenant ID (required) — A full triage report URL like or just the tenant name. If neither is provided, ask the user.
https://TENANT.antithesis.com/... - What they want to know — Are they investigating a specific failure? Getting a general overview? Comparing runs? This determines which workflow to follow.
开始前,请从用户处收集以下信息:
- 报告URL或租户ID(必填)——完整的分类报告URL,例如,或仅租户名称。如果两者都未提供,请询问用户。
https://TENANT.antithesis.com/... - 用户需求——他们是在调查特定失败情况?还是需要整体概览?或是比较不同运行结果?这将决定采用哪种工作流程。
Reference files
参考文件
Each reference file contains the selectors and query file paths for a specific
task. Read the relevant file before performing that task.
| Page | When to read |
|---|---|
| Always — read first to set up the browser session |
| User wants to find or browse recent runs (no specific URL) |
| Need run title, date, or the Explore Logs link |
| Checking property pass/fail status, filtering properties |
| Checking which Docker images were used |
| Viewing behavioral diffs between runs |
| Checking test hours or behavior discovery rate |
| Investigating logs for a specific property example |
每个参考文件包含特定任务的选择器和查询文件路径。执行任务前请先阅读相关文件。
| 页面文件 | 阅读时机 |
|---|---|
| 必须优先阅读——用于设置浏览器会话 |
| 用户需要查找或浏览近期运行结果(无特定URL时) |
| 需要获取运行标题、日期或“浏览日志”链接时 |
| 检查属性的通过/失败状态、筛选属性时 |
| 检查使用了哪些Docker镜像时 |
| 查看不同运行结果间的行为差异时 |
| 检查测试时长或行为发现率时 |
| 调查特定属性示例的日志时 |
Query files
查询文件
Reference files contain references to queries. Queries are stored in the assets directory, organized by the page they are relevant to.
Use the following command pattern to evaluate a query file from the skill root:
cat assets/report/run-metadata.js \
| agent-browser eval --session-name "$SESSION" --stdinDo not run report queries in parallel with , hash-route
navigation, or any other command that can replace the page. Wait until the
target page is settled before starting calls. On a single browser
session, run report queries sequentially; property scripts mutate tab and
expansion state and will interfere with each other if you launch them in
parallel.
agent-browser openeval参考文件包含对查询的引用。查询文件存储在assets目录中,按相关页面分类组织。
使用以下命令格式从技能根目录执行查询文件:
cat assets/report/run-metadata.js \
| agent-browser eval --session-name "$SESSION" --stdin请勿将报告查询与、哈希路由导航或其他可能替换页面的命令并行运行。请等待目标页面加载完成后再启动调用。在单个浏览器会话中,请按顺序运行报告查询;属性脚本会修改标签页和展开状态,如果并行运行会互相干扰。
agent-browser openevalPage Loading Checks
页面加载检查
Each page type has a dedicated query. Use the matching
one before running page-specific queries.
loading-finished.jsCommand pattern:
for _ in $(seq 1 60); do
if [[ "$(
cat <loading-query-file> \
| agent-browser eval --session-name "$SESSION" --stdin
)" == "true" ]]; then
break
fi
sleep 1
doneIf the report page still is not ready after about 60 seconds, inspect the
current state before retrying:
cat assets/report/loading-status.js \
| agent-browser eval --session-name "$SESSION" --stdinUse these loading checks:
- Report page:
assets/report/loading-finished.js - Logs page:
assets/logs/loading-finished.js - Runs page:
assets/runs/loading-finished.js
The report-page loading check returns only when the main report
sections have finished loading, including findings, properties, environment,
and utilization. The report hydrates asynchronously after the browser
event, and findings are often the last section to settle.
trueloadReport queries are only valid on the main report view. If you navigate to an
internal hash route such as , reopen the original report
URL and rerun before using report queries
again.
#/run/.../finding/...assets/report/loading-finished.js每种页面类型都有对应的查询。在运行页面特定查询前,请先使用对应的加载检查查询。
loading-finished.js命令格式:
for _ in $(seq 1 60); do
if [[ "$(
cat <loading-query-file> \
| agent-browser eval --session-name "$SESSION" --stdin
)" == "true" ]]; then
break
fi
sleep 1
done如果报告页面在约60秒后仍未就绪,请先检查当前状态再重试:
cat assets/report/loading-status.js \
| agent-browser eval --session-name "$SESSION" --stdin使用以下加载检查:
- 报告页面:
assets/report/loading-finished.js - 日志页面:
assets/logs/loading-finished.js - 运行结果页面:
assets/runs/loading-finished.js
报告页面的加载检查仅在主报告区域(包括问题发现、属性、环境和利用率)全部加载完成后才会返回。报告在浏览器事件后异步加载内容,其中问题发现区域通常是最后加载完成的部分。
trueload报告查询仅在主报告视图中有效。如果您导航到内部哈希路由(如),请重新打开原始报告URL并重新运行,然后再使用报告查询。
#/run/.../finding/...assets/report/loading-finished.jsRecommended workflows
推荐工作流程
Quick overview of a run
运行结果快速概览
- Read — authenticate and open the report
references/setup-auth.md - Read — get the run title and date
references/run-metadata.md - Read — use the all-properties query for totals, then failed/passed/unfound queries only if you need filtered subsets
references/properties.md - Summarize: total properties, how many passed/failed/unfound, and flag any failures
- 阅读— 完成认证并打开报告
references/setup-auth.md - 阅读— 获取运行标题和日期
references/run-metadata.md - 阅读— 使用全属性查询获取总数,仅在需要筛选子集时使用失败/通过/未找到的查询
references/properties.md - 总结:属性总数、通过/失败/未找到的数量,并标记所有失败项
Investigate a failing property
失败属性调查
- Read — authenticate and open the report
references/setup-auth.md - Read — list properties, filter to failed
references/properties.md - Read — expand failed-property example tables, get log URLs, navigate to logs, find the highlighted assertion event and surrounding context
references/logs.md - Report the failure with: property name, assertion text, relevant log lines, and the timeline context
- 阅读— 完成认证并打开报告
references/setup-auth.md - 阅读— 列出所有属性,筛选出失败的属性
references/properties.md - 阅读— 展开失败属性示例表格,获取日志URL,导航到日志页面,找到高亮的断言事件及其上下文
references/logs.md - 报告失败信息:属性名称、断言文本、相关日志行和时间线上下文
Find a specific run
查找特定运行结果
- Read — authenticate
references/setup-auth.md - Read — browse the runs page to find the target run
references/run-discovery.md - Continue with any of the above workflows once on the report
- 阅读— 完成认证
references/setup-auth.md - 阅读— 浏览运行结果页面找到目标运行结果
references/run-discovery.md - 进入报告后,可继续执行上述任一工作流程
General guidance
通用指南
- Always authenticate first. Every session starts with setup-auth.
- Don't fabricate selectors. The triage report uses custom web components and non-obvious class names. Always consult the resource page for the correct queries.
- Keep report queries on the main report view. If you click into a finding-focused hash route, reopen the original report URL before using report queries again.
- Do not overlap navigation with queries. calls can fail with an execution-context-destroyed error if the report is still navigating or hydrating.
agent-browser eval - Logs require full auth. The report page may load with just an token in the URL, but navigating to log pages requires a fully authenticated session.
auth - Logs use virtual scrolling. Only ~50-70 rows render at a time. You may need to scroll to find specific entries.
- Present results clearly. When reporting property statuses, use a table or list. When reporting log findings, include the virtual timestamp, source, and log text.
- 始终先完成认证。每个会话都从setup-auth开始。
- 不要自行创建选择器。分类报告使用自定义Web组件和不明显的类名。请始终参考资源页面获取正确的查询。
- 在主报告视图中运行报告查询。如果您点击进入以问题发现为中心的哈希路由,请重新打开原始报告URL后再使用报告查询。
- 不要同时进行导航和查询。如果报告仍在导航或加载内容,调用可能会因执行上下文销毁而失败。
agent-browser eval - 日志需要完整认证。报告页面可能仅通过URL中的令牌加载,但导航到日志页面需要完全认证的会话。
auth - 日志使用虚拟滚动。一次仅渲染约50-70行。您可能需要滚动才能找到特定条目。
- 清晰呈现结果。报告属性状态时,请使用表格或列表。报告日志发现时,请包含虚拟时间戳、来源和日志文本。