eval-guidance-actionability

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Eval Guidance & Actionability

评估指导性与可操作性

Use this skill to evaluate whether an assistant response provides clear, usable guidance the user can act on.
使用此skill来评估助手回复是否为用户提供了清晰、可用且可执行的指导。

Inputs

输入要求

Require:
  • The assistant response text to evaluate.
  • (Optional) The user’s request or goal (helps judge whether guidance matches what’s needed).
必填:
  • 待评估的助手回复文本。 -(可选)用户的请求或目标(有助于判断指导内容是否符合需求)。

Internal Rubric (1–5)

内部评分标准(1–5分)

5 = Provides concrete, actionable steps; prioritized; includes key details/constraints; user could execute without guessing
4 = Mostly actionable; minor missing details or ordering, but still usable
3 = Some guidance, but generic; missing important steps/details; requires user to infer next actions
2 = Largely non-actionable; mostly high-level advice; lacks steps or specifics
1 = No usable guidance; purely vague, deflective, or irrelevant to “what to do next”
5分 = 提供具体、可执行的步骤;有优先级;包含关键细节/约束条件;用户无需猜测即可执行
4分 = 基本可执行;仅缺少少量细节或步骤顺序略有问题,但仍可使用
3分 = 有一定指导内容,但较为笼统;缺少重要步骤/细节;需要用户自行推断后续操作
2分 = 基本不可执行;多为高层次建议;缺乏具体步骤或细节
1分 = 无可用指导内容;完全模糊、回避问题或与“下一步该做什么”无关

Workflow

工作流程

  1. Check whether the response includes specific next actions (steps, checklist, examples, decision points).
  2. Check completeness (missing prerequisites, constraints, caveats).
  3. Score on a 1-5 integer scale using the rubric only.
  4. Write concise rationale tied directly to rubric criteria.
  5. Produce actionable suggestions that improve actionability.
  1. 检查回复是否包含具体的后续操作(步骤、清单、示例、决策点)。
  2. 检查内容完整性(是否缺少前提条件、约束条件、注意事项)。
  3. 仅依据上述评分标准给出1-5之间的整数分数。
  4. 撰写与评分标准直接相关的简洁理由。
  5. 提出可提升可操作性的具体改进建议。

Output Contract

输出规范

Return JSON only. Do not include markdown, backticks, prose, or extra keys.
Use exactly this schema:
{ "dimension": "guidance_actionability", "score": 1, "rationale": "...", "improvement_suggestions": [ "..." ] }
仅返回JSON格式内容。不得包含markdown、反引号、散文式文本或额外的键值对。
必须使用以下精确格式:
{ "dimension": "guidance_actionability", "score": 1, "rationale": "...", "improvement_suggestions": [ "..." ] }

Hard Rules

硬性规则

  • dimension
    must always equal
    "guidance_actionability"
    .
  • score
    must be an integer from 1 to 5.
  • rationale
    must be concise (max 3 sentences).
  • Do not include step-by-step reasoning.
  • improvement_suggestions
    must be a non-empty array of concrete edits.
  • Never output text outside the JSON object.
  • dimension
    的值必须始终为
    "guidance_actionability"
  • score
    必须是1到5之间的整数。
  • rationale
    必须简洁(最多3句话)。
  • 不得包含逐步推理的内容。
  • improvement_suggestions
    必须是包含具体修改建议的非空数组。
  • 绝对不能在JSON对象外输出任何文本。