discernment
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseDiscernment Framework
判断洞察框架
Good judgment is not about confidence. It is about calibration — knowing how much your confidence is warranted by the evidence, your actual expertise, and the quality of your thinking process. This framework draws on four bodies of work to help you assess that.
Use this skill by understanding the situation first, defining the root problem using SCQuARE, then walking through the four lenses in order, and synthesising into a clear verdict with a specific next action.
良好的判断与自信无关,而在于校准——即了解你的自信程度是否有证据、实际专业能力以及思考过程质量的支撑。本框架借鉴四大研究成果,帮助你进行此类评估。
使用本技能时,需先理解情境,用SCQuARE定义核心问题,然后依次梳理四大视角,最后整合为清晰的结论并给出具体的下一步行动。
Step 0: Understand the situation
步骤0:理解情境
Before applying any lens, establish:
- What exactly is being assessed? A decision? A recommendation? A piece of AI-generated text? A course of action someone is about to commit to?
- What are the stakes? Reversible or irreversible? Who is affected? What is the cost of being wrong?
- What has already been decided vs. what is still open? This shapes whether the work is pre-decision quality-checking or post-decision review.
Ask the user for this context if it isn't clear. Don't start the framework on thin air.
在应用任何视角之前,需明确:
- 具体评估对象是什么? 是一项决策?一条建议?一段AI-generated text?还是某人即将付诸行动的方案?
- 风险等级如何? 决策可逆还是不可逆?会影响哪些人?决策失误的成本是什么?
- 已确定事项与待确定事项分别是什么? 这决定了当前工作是决策前的质量检查还是决策后的复盘。
如果上下文信息不明确,请向用户询问相关内容。不要在信息不足的情况下启动框架。
Step 1: Root problem — SCQuARE
步骤1:核心问题——SCQuARE框架
Before applying any judgment lens, establish what problem is actually being solved. This step is easy to skip and almost always worth doing. It is based on the SCQuARE framework (Situation, Complication, Question, Answer, Rationale, Evidence), which forces crisp problem definition before analysis begins.
The risk without this step: excellent thinking applied to the wrong question.
Work through these five elements:
-
Situation — What is the stable, agreed-upon context? What do we know to be true and uncontroversial? (2–3 sentences max. If this is hard to state, the problem isn't well-formed yet.)
-
Complication — What has changed, or what tension exists, that makes the situation insufficient? This is the reason the decision or question has arisen at all. A good complication is specific — not "things are uncertain" but what specifically has created the need to act or decide.
-
Question — Given the situation and complication, what is the precise question that needs answering? This should be a single, crisp question. If there are multiple questions, the scope is too large — name the primary one.
-
Answer — What is the hypothesis or proposed answer to that question? This is what is about to be assessed, acted on, or committed to. Making it explicit is the whole point: you cannot pressure-test a vague intent.
-
Rationale and Evidence — Why do we believe this answer? What evidence supports it? And what evidence against it has been considered? If the rationale is thin, the rest of the discernment process will be doing more work.
The diagnostic question: Is the question we are actually trying to answer the right question — or is it a symptom of a deeper or different problem?
If the SCQuARE step surfaces a different root question than the one originally presented, name it explicitly and confirm with the user before proceeding. Applying the four lenses to the wrong question is a sophisticated way to get the wrong answer.
在应用任何判断视角之前,需明确实际要解决的问题。这一步很容易被跳过,但几乎总是值得去做。它基于SCQuARE框架(Situation、Complication、Question、Answer、Rationale、Evidence),要求在分析前先精准定义问题。
跳过这一步的风险:在错误的问题上投入优质思考。
梳理以下五大要素:
-
Situation(情境)——稳定且已达成共识的背景是什么?我们已知的真实且无争议的信息有哪些?(最多2-3句话。如果难以表述,说明问题尚未明确。)
-
Complication(矛盾点)——出现了哪些变化或矛盾,导致当前情境不再适用?这正是决策或问题产生的根本原因。一个好的矛盾点应具体明确——不是“情况不确定”,而是具体是什么引发了行动或决策的需求。
-
Question(问题)——结合情境与矛盾点,需要回答的精准问题是什么?这应该是一个单一、清晰的问题。如果存在多个问题,说明范围过大——请明确核心问题。
-
Answer(答案)——针对该问题的假设或提议答案是什么?这正是即将被评估、执行或确认的内容。将其明确化是关键:你无法检验模糊的意图。
-
Rationale and Evidence(依据与证据)——我们为何相信这个答案?有哪些证据支持它?又考虑过哪些反对它的证据?如果依据不足,后续的洞察过程将承担更多工作。
诊断问题:我们实际要回答的问题是否正确——还是它只是更深层或不同问题的表象?
如果SCQuARE步骤揭示的核心问题与最初提出的问题不同,请明确指出并与用户确认后再继续。将四大视角应用于错误的问题,只会巧妙地得到错误的答案。
The Four Lenses
四大视角
Lens 1 — Dunning-Kruger: Do you know what you don't know?
视角1——Dunning-Kruger:你是否知道自己不知道什么?
The Dunning-Kruger effect is not just about being wrong. It is about lacking the metacognitive ability to recognise when you are wrong — which means the most dangerous moment is not when you know nothing, but when you know just enough to feel certain.
Apply this lens by exploring:
- What is the person's actual depth of expertise in this specific domain? Not adjacent experience — this domain.
- Is their confidence rising with familiarity, or with evidence? Familiarity-driven confidence is a warning sign.
- Are there signals of "peak of Mount Stupid" thinking? Easy answers, low uncertainty, dismissal of nuance.
- What would a genuine expert in this area say that this person has not considered?
The question to put to the user: If you found out you were wrong about this, what would most likely be the reason? If they struggle to answer that, the Dunning-Kruger risk is real.
Dunning-Kruger效应不仅仅是犯错,更在于缺乏元认知能力来识别自己何时犯错——这意味着最危险的时刻不是你一无所知的时候,而是你懂一点就自以为是的时候。
应用此视角时,需探索:
- 此人在该特定领域的实际专业深度如何? 不是相关经验——而是该领域的专业能力。
- 他们的自信是源于熟悉感还是证据? 基于熟悉感的自信是一个警示信号。
- 是否存在“愚昧山峰”思维的迹象? 比如简单化答案、低不确定性、忽视细微差别。
- 该领域的真正专家会提出哪些此人未考虑到的观点?
向用户提出的问题:如果你发现自己在这件事上错了,最可能的原因是什么? 如果他们难以回答,说明Dunning-Kruger风险确实存在。
Lens 2 — Hogan Judgment: What is your decision-making style and where does it distort?
视角2——Hogan Judgment:你的决策风格是什么,它会在何处产生偏差?
Hogan's Judgment model identifies three personality dimensions that shape how people approach decisions — and each creates predictable blind spots under pressure:
| Dimension | Poles | Blind spot risk |
|---|---|---|
| Motivation | Threat-avoidant ↔ Reward-seeking | Reward-seekers underweight downside risk. Threat-avoiders underweight opportunity. |
| Thinking style | Tactical ↔ Strategic | Tactical thinkers over-optimise the immediate. Strategic thinkers lose the operational detail. |
| Information processing | Intuitive ↔ Data-driven | Intuitive thinkers move fast on pattern-matching. Data-driven thinkers can stall or over-engineer. |
Critically, Hogan adds a fourth dimension most models miss: how a person responds to feedback after a decision goes wrong. Leaders who cannot update on negative feedback repeat the same mistakes.
Apply this lens by asking:
- Which pole of each dimension does this person (or this reasoning) tend toward?
- Given those tendencies, what is likely being underweighted in this decision?
- If this decision turned out badly, what would they do with that information? Would they update, or explain it away?
Hogan的判断模型识别出三种影响人们决策方式的人格维度——每种维度在压力下都会产生可预见的盲点:
| 维度 | 两极 | 盲点风险 |
|---|---|---|
| 动机 | 规避威胁 ↔ 追求回报 | 追求回报者低估下行风险。规避威胁者低估机遇。 |
| 思维风格 | 战术性 ↔ 战略性 | 战术性思考者过度优化眼前事务。战略性思考者忽视操作细节。 |
| 信息处理 | 直觉型 ↔ 数据驱动型 | 直觉型思考者基于模式匹配快速行动。数据驱动型思考者可能停滞不前或过度设计。 |
关键的是,Hogan增加了大多数模型遗漏的第四个维度:决策失误后此人如何回应反馈。无法根据负面反馈调整的领导者会重复相同的错误。
应用此视角时,需询问:
- 此人(或其推理)倾向于每个维度的哪一极?
- 基于这些倾向,该决策中可能低估了什么?
- 如果该决策结果不佳,他们会如何处理相关信息?是调整决策,还是找借口掩饰?
Lens 3 — Mark Egan / Behavioural Insights Team: Are you well-calibrated?
视角3——Mark Egan / Behavioural Insights Team:你的判断是否校准得当?
Dr. Mark Egan's research at BIT shows that overconfidence is the dominant bias — 80%+ of UK adults are more confident than they are correct. The goal is not to become less confident; it is to become well-calibrated, meaning your confidence reliably tracks your accuracy.
The BIT framework distinguishes three states:
- Overconfident: confidence exceeds correctness — the most common and dangerous state
- Underconfident: correctness exceeds confidence — often in experienced professionals with impostor syndrome
- Well-calibrated: confidence and accuracy track each other — the goal
Key debiasing tools from Egan's "good form for thinking" toolkit:
Premortem: Before committing, imagine it is 12 months later and the decision has failed. What went wrong? This surfaces risks that forward-looking optimism hides.
Thinkgroup: Deliberately include a dissenting voice before the decision is finalised. Not a devil's advocate for show — someone who genuinely sees it differently.
Confidence diagnostic: Ask — on similar decisions in the past, how often was I right when I felt this certain? If you don't track that, your confidence numbers are noise.
"Cognitive hygiene" check: Like eyesight, judgment degrades without regular calibration checks. Ask: when did I last get feedback that updated my thinking in this domain?
Apply this lens by asking:
- How confident is the person, on a scale of 0–100%?
- What is their actual track record on similar judgments?
- Has a premortem been run? Has a thinkgroup been consulted?
- Is there evidence of "loud certainty" overriding calibrated thinking?
Mark Egan博士在BIT的研究表明,过度自信是主导性偏差——超过80%的英国成年人的自信程度高于其实际正确率。目标不是降低自信,而是实现良好校准,即自信程度能可靠地反映准确率。
BIT框架区分了三种状态:
- 过度自信:自信程度超过正确率——最常见且最危险的状态
- 信心不足:正确率超过自信程度——常见于患有冒名顶替综合征的资深专业人士
- 校准得当:自信程度与准确率匹配——目标状态
Egan的“良好思考规范”工具包中的关键去偏工具:
Premortem:在做出承诺前,设想12个月后决策失败了,是什么导致了失败?这能揭示前瞻性乐观主义掩盖的风险。
Thinkgroup:在决策最终确定前,故意纳入持不同意见的声音。不是装样子的魔鬼代言人——而是真正有不同看法的人。
信心诊断:问问自己——在过去类似的决策中,当我有这样的自信程度时,我正确的频率有多高?如果你没有记录过,你的自信数值就是无效的。
“认知卫生”检查:就像视力一样,判断能力会因缺乏定期校准而退化。问问自己:我上次在该领域获得能更新我思考的反馈是什么时候?
应用此视角时,需询问:
- 此人的自信程度如何,按0-100%评分?
- 他们在类似判断中的实际记录如何?
- 是否进行过premortem?是否咨询过thinkgroup?
- 是否存在“绝对肯定”压倒校准思考的迹象?
Lens 4 — Tetlock / Superforecasting: Is the reasoning structured and updatable?
视角4——Tetlock / Superforecasting:推理是否结构化且可调整?
Tetlock's research on superforecasters shows that well-calibrated judgment is learnable. Superforecasters are not smarter — they have better process. The key habits:
Outside view first: Before reasoning from inside the specific situation, ask — what is the base rate? Of all similar decisions/situations/predictions, how often does it go this way? This counteracts the inside-view bias where our specific situation always seems exceptional.
Precise probabilities, not vague language: "It's likely" is not a forecast. "I'd put this at 70%" is. Precision forces honesty and makes errors detectable.
What would change your mind? A well-reasoned position has explicit conditions under which it would shift. If there is nothing that would change the person's view, it is not a belief — it is an identity.
Update proportionally to evidence: New information should move the needle in proportion to its actual strength. Overreacting and underreacting are both errors.
Track your predictions: Confidence calibration only improves if you score yourself. The question is not just "was I right?" but "how right was I relative to how sure I was?"
Apply this lens by asking:
- What is the outside-view base rate for this situation?
- What specific evidence would raise or lower confidence in this view?
- Has the person stated a precise confidence level, or used vague language?
- Is there a mechanism to check back and score this decision against the outcome?
Tetlock关于超级预测者的研究表明,校准得当的判断是可以学习的。超级预测者并非更聪明——他们拥有更好的流程。核心习惯包括:
先采用外部视角:在从具体情境内部推理之前,先问问——基准概率是多少?在所有类似的决策/情境/预测中,出现这种结果的频率有多高?这能抵消内部视角偏差,即我们总觉得自己的具体情况与众不同。
精确概率,而非模糊表述:“很可能”不是预测。“我认为概率是70%”才是。精确性迫使诚实,也让错误可被检测。
什么会改变你的想法? 合理的观点有明确的调整条件。如果没有任何东西能改变此人的观点,那这不是信念——而是身份认同。
根据证据按比例调整:新信息应根据其实际强度相应地改变判断。反应过度和反应不足都是错误。
记录你的预测:只有进行自我评分,信心校准才能提升。问题不仅是“我是否正确?”,还有“相对于我的自信程度,我有多正确?”
应用此视角时,需询问:
- 该情境的外部视角基准概率是多少?
- 哪些具体证据会提升或降低对此观点的信心?
- 此人是否给出了精确的自信程度,还是使用了模糊表述?
- 是否有机制可以事后对照结果检验该决策?
Step 5: Synthesise
步骤5:整合
After walking through the four lenses, produce a judgment verdict that covers:
1. Calibration verdict: Is the confidence level warranted? State whether the reasoning is well-calibrated, overconfident, or underconfident — and which lens most clearly reveals the risk.
2. The dominant bias risk: Name the one cognitive pattern most likely to undermine this decision. Be specific — not "confirmation bias" as a catch-all, but why this specific bias matters in this specific context.
3. The quality of the reasoning process: Not whether the conclusion is right — that's often unknowable — but whether the process is sound. Has the outside view been taken? Has a dissenting voice been heard? Has the premortem been run?
4. One concrete next action: Before the person commits, what is the single most valuable thing they should do? Options include: run a premortem, seek a thinkgroup, assign a specific probability and commit to tracking it, identify what would change their mind, or simply wait 48 hours.
梳理完四大视角后,生成一份判断结论,涵盖以下内容:
1. 校准结论:自信程度是否合理?说明推理是校准得当、过度自信还是信心不足——以及哪个视角最清晰地揭示了风险。
2. 主导偏差风险:指出最可能破坏该决策的一种认知模式。要具体——不要笼统地说“确认偏差”,而是说明为何这种特定偏差在该特定情境中至关重要。
3. 推理过程质量:不是判断结论是否正确——这往往无法预知——而是判断过程是否合理。是否采用了外部视角?是否听取了不同意见?是否进行了premortem?
4. 一个具体的下一步行动:在此人做出承诺前,他们最应该做的一件有价值的事是什么?选项包括:进行premortem、咨询thinkgroup、给出具体概率并承诺跟踪、明确能改变想法的条件,或是简单等待48小时。
Tone and approach
语气与方法
This framework is not a checklist to be pedantically completed. It is a thinking tool. Use judgment about which lenses are most relevant to the specific situation — sometimes one lens is doing all the work, sometimes all four.
The goal is not to make the person less confident. It is to make them appropriately confident — someone who acts decisively when the evidence warrants it and holds back when it doesn't.
Avoid abstract diagnosis. Name the specific thing. Be direct about the risk. The value of good discernment is that it can be spoken plainly.
本框架不是一份需要机械完成的清单,而是一种思考工具。根据具体情境判断哪些视角最相关——有时一个视角就足够,有时则需要四个全部用上。
目标不是降低此人的自信,而是让他们适度自信——即在证据支持时果断行动,在证据不足时保持克制。
避免抽象诊断。明确指出具体问题。直接说明风险。良好洞察的价值在于可以直白地表达。
Quick reference: Five questions
快速参考:五大问题
If the context is time-pressured, cut to these five questions:
- SCQuARE: What is the precise question we are trying to answer — and is that actually the right question?
- Dunning-Kruger: If you turned out to be wrong, what would be the most likely reason?
- Hogan: What does your natural decision style tend to underweight — and is that happening here?
- Egan / BIT: How often have you been right when you felt this certain? Have you run a premortem?
- Tetlock: What is the base rate? What specific evidence would change your view?
如果时间紧迫,可直接使用以下五个问题:
- SCQuARE:我们要回答的精确问题是什么——这真的是正确的问题吗?
- Dunning-Kruger:如果你被证明错了,最可能的原因是什么?
- Hogan:你的自然决策风格倾向于低估什么——这种情况是否正在发生?
- Egan / BIT:当你有这样的自信程度时,过去正确的频率有多高?你是否进行过premortem?
- Tetlock:基准概率是多少?哪些具体证据会改变你的观点?
Reference material
参考资料
See for deeper notes on each of the four source frameworks, including key publications and direct quotes.
references/frameworks.md详见,获取四大源框架的更深入说明,包括关键出版物和直接引用。
references/frameworks.md