eval-guidance-actionability
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseEval Guidance & Actionability
评估指导性与可操作性
Use this skill to evaluate whether an assistant response provides clear, usable guidance the user can act on.
使用此skill来评估助手回复是否为用户提供了清晰、可用且可执行的指导。
Inputs
输入要求
Require:
- The assistant response text to evaluate.
- (Optional) The user’s request or goal (helps judge whether guidance matches what’s needed).
必填:
- 待评估的助手回复文本。 -(可选)用户的请求或目标(有助于判断指导内容是否符合需求)。
Internal Rubric (1–5)
内部评分标准(1–5分)
5 = Provides concrete, actionable steps; prioritized; includes key details/constraints; user could execute without guessing
4 = Mostly actionable; minor missing details or ordering, but still usable
3 = Some guidance, but generic; missing important steps/details; requires user to infer next actions
2 = Largely non-actionable; mostly high-level advice; lacks steps or specifics
1 = No usable guidance; purely vague, deflective, or irrelevant to “what to do next”
4 = Mostly actionable; minor missing details or ordering, but still usable
3 = Some guidance, but generic; missing important steps/details; requires user to infer next actions
2 = Largely non-actionable; mostly high-level advice; lacks steps or specifics
1 = No usable guidance; purely vague, deflective, or irrelevant to “what to do next”
5分 = 提供具体、可执行的步骤;有优先级;包含关键细节/约束条件;用户无需猜测即可执行
4分 = 基本可执行;仅缺少少量细节或步骤顺序略有问题,但仍可使用
3分 = 有一定指导内容,但较为笼统;缺少重要步骤/细节;需要用户自行推断后续操作
2分 = 基本不可执行;多为高层次建议;缺乏具体步骤或细节
1分 = 无可用指导内容;完全模糊、回避问题或与“下一步该做什么”无关
4分 = 基本可执行;仅缺少少量细节或步骤顺序略有问题,但仍可使用
3分 = 有一定指导内容,但较为笼统;缺少重要步骤/细节;需要用户自行推断后续操作
2分 = 基本不可执行;多为高层次建议;缺乏具体步骤或细节
1分 = 无可用指导内容;完全模糊、回避问题或与“下一步该做什么”无关
Workflow
工作流程
- Check whether the response includes specific next actions (steps, checklist, examples, decision points).
- Check completeness (missing prerequisites, constraints, caveats).
- Score on a 1-5 integer scale using the rubric only.
- Write concise rationale tied directly to rubric criteria.
- Produce actionable suggestions that improve actionability.
- 检查回复是否包含具体的后续操作(步骤、清单、示例、决策点)。
- 检查内容完整性(是否缺少前提条件、约束条件、注意事项)。
- 仅依据上述评分标准给出1-5之间的整数分数。
- 撰写与评分标准直接相关的简洁理由。
- 提出可提升可操作性的具体改进建议。
Output Contract
输出规范
Return JSON only. Do not include markdown, backticks, prose, or extra keys.
Use exactly this schema:
{
"dimension": "guidance_actionability",
"score": 1,
"rationale": "...",
"improvement_suggestions": [
"..."
]
}
仅返回JSON格式内容。不得包含markdown、反引号、散文式文本或额外的键值对。
必须使用以下精确格式:
{
"dimension": "guidance_actionability",
"score": 1,
"rationale": "...",
"improvement_suggestions": [
"..."
]
}
Hard Rules
硬性规则
- must always equal
dimension."guidance_actionability" - must be an integer from 1 to 5.
score - must be concise (max 3 sentences).
rationale - Do not include step-by-step reasoning.
- must be a non-empty array of concrete edits.
improvement_suggestions - Never output text outside the JSON object.
- 的值必须始终为
dimension。"guidance_actionability" - 必须是1到5之间的整数。
score - 必须简洁(最多3句话)。
rationale - 不得包含逐步推理的内容。
- 必须是包含具体修改建议的非空数组。
improvement_suggestions - 绝对不能在JSON对象外输出任何文本。