reference-class-forecasting

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Reference Class Forecasting

What is Reference Class Forecasting?

什么是参考类预测？

Reference class forecasting is the practice of anchoring predictions in historical reality by identifying a class of similar past events and using their statistical frequency as a starting point. This is the "Outside View" - looking at what usually happens to things like this, before getting distracted by the specific details of "this case."

Core Principle: Assume this event is average until you have specific evidence proving otherwise.

Why It Matters:

Defeats "inside view" bias (thinking your case is unique)
Prevents base rate neglect (ignoring statistical baselines)
Provides objective anchor before subjective analysis
Forces humility and statistical thinking

参考类预测是一种将预测锚定在历史事实中的实践方法，通过识别一类相似的历史事件，将其统计频率作为预测的起点。这就是“外部视角”——在被“本次案例”的具体细节干扰之前，先看同类事件通常会出现什么结果。

核心原则： 在你拿到能证明特殊性的明确证据之前，默认当前事件属于平均水平。

价值：

破除“内部视角”偏见（总觉得自己的案例独一无二）
避免基准比率忽视问题（忽略统计基线）
在主观分析前提供客观锚点
倒逼谦逊态度和统计思维

When to Use This Skill

何时使用该技能

Use this skill when:

Starting any forecast - Establish base rate FIRST
Someone says "this time is different" - Test if it really is
Making predictions about success/failure - Find historical frequencies
Evaluating startup/project outcomes - Anchor in class statistics
Challenged by confident predictions - Ground in reality
Before detailed analysis - Get outside view baseline

Do NOT use when:

Event has literally never happened (novel situation)
Working with deterministic physical laws
Pure chaos with no patterns

适用场景：

启动任何预测工作时——首先建立基准比率
有人说“这次不一样”时——验证这一说法是否成立
做成功/失败相关预测时——查找历史发生频率
评估创业/项目结果时——以同类统计数据为锚点
质疑过于笃定的预测时——用现实情况做依据
开展详细分析前——获取外部视角基线

不适用场景：

完全没有先例的事件（全新情况）
遵循确定性物理定律的事件
无规律可循的纯混沌事件

Interactive Menu

交互菜单

What would you like to do?

你想要做什么？

Core Workflows

核心工作流

1. Find My Base Rate - Identify reference class and get statistical baseline

Guided process to select correct reference class
Search strategies for finding historical frequencies
Validation that you have the right anchor

2. Test "This Time Is Different" - Challenge uniqueness claims

Reversal test for uniqueness bias
Similarity matching framework
Burden of proof calculator

3. Calculate Funnel Base Rates - Multi-stage probability chains

When no single base rate exists
Sequential probability modeling
Product rule for compound events

4. Validate My Reference Class - Ensure you chose the right comparison set

Too broad vs too narrow test
Homogeneity check
Sample size evaluation

5. Learn the Framework - Deep dive into methodology

Read Outside View Principles
Read Reference Class Selection Guide
Read Common Pitfalls

6. Exit - Return to main forecasting workflow

1. 查找我的基准比率——识别参考类并获取统计基线

引导你选择正确参考类的流程
查找历史频率的搜索策略
验证你选的锚点是否准确

2. 验证“这次不一样”——挑战特殊性主张

特殊性偏见的反转测试
相似度匹配框架
举证责任计算器

3. 计算漏斗基准比率——多阶段概率链

无单一基准比率时的解决方案
序列概率建模
复合事件的乘积法则

4. 验证我的参考类——确保你选的对比集正确

过宽/过窄测试
同质性检查
样本量评估

5. 学习框架——深入了解方法论

阅读外部视角原则
阅读参考类选择指南
阅读常见陷阱

6. 退出——返回预测主工作流

1. Find My Base Rate

1. 查找我的基准比率

Let's establish your statistical baseline.

我们来建立你的统计基线。

Step 1: What are you forecasting?

步骤1：你要预测什么？

Tell me the specific event or outcome you're predicting.

Example prompts:

"Will this startup succeed?"
"Will this bill pass Congress?"
"Will this project launch on time?"

告诉我你要预测的具体事件或结果。

示例提示：

"这家创业公司会成功吗？"
"这项法案会在国会通过吗？"
"这个项目会按时上线吗？"

Step 2: Identify the Reference Class

步骤2：识别参考类

I'll help you identify what bucket this belongs to.

Framework:

Too broad: "All companies" → meaningless
Just right: "Seed-stage B2B SaaS startups in fintech"
Too narrow: "Companies founded by people named Steve in 2024" → no data

Key Questions:

What type of entity is this? (company, bill, project, person, etc.)
What stage/size/category?
What industry/domain?
What time period is relevant?

I'll work with you to refine this until we have a specific, searchable class.

我会帮你确定该事件属于哪个分类。

框架：

过宽： "所有公司" → 无意义
合适： "金融科技领域的种子阶段B2B SaaS创业公司"
过窄： "2024年由名叫Steve的人创立的公司" → 无数据

核心问题：

该事件属于什么类型的主体？（公司、法案、项目、人物等）
属于什么阶段/规模/类别？
属于什么行业/领域？
相关的时间范围是什么？

我会和你一起优化分类，直到得到一个明确的、可搜索的参考类。

Step 3: Search for Historical Data

步骤3：搜索历史数据

I'll help you find the base rate using:

Web search for published statistics
Academic studies on success rates
Government/industry reports
Proxy metrics if direct data unavailable

Search Strategy:

"historical success rate of [reference class]"
"[reference class] failure statistics"
"[reference class] survival rate"
"what percentage of [reference class]"

我会通过以下渠道帮你查找基准比率：

网页搜索公开的统计数据
学术研究中的成功率数据
政府/行业报告
代理指标（如果没有直接数据）

搜索策略：

"historical success rate of [reference class]"
"[reference class] failure statistics"
"[reference class] survival rate"
"what percentage of [reference class]"

Step 4: Set Your Anchor

步骤4：设定你的锚点

Once we find the base rate, that becomes your starting probability.

The Rule:

You are NOT allowed to move from this base rate until you have specific, evidence-based reasons in your "inside view" analysis.

Default anchors if no data found:

Novel innovation: 10-20% (most innovations fail)
Established industry: 50% (uncertain)
Regulated/proven process: 70-80% (systems work)

Next: Return to menu or proceed to inside view analysis.

我们找到基准比率后，它就会成为你的起始概率。

规则：

在你的“内部视角”分析中拿到有证据支撑的明确理由之前，你不允许偏离这个基准比率。

找不到数据时的默认锚点：

全新创新：10-20%（绝大多数创新都会失败）
成熟行业：50%（不确定性高）
受监管/已验证的流程：70-80%（体系可保障落地）

下一步： 返回菜单或继续开展内部视角分析。

2. Test "This Time Is Different"

2. 验证“这次不一样”

Challenge uniqueness bias.

When someone (including yourself) believes "this case is special," we need to stress-test that belief.

挑战特殊性偏见。

当有人（包括你自己）认为“这个案例很特殊”时，我们需要对这个观点做压力测试。

The Uniqueness Audit

特殊性审计

Question 1: Similarity Matching

What are 5 historical cases that are most similar to this one?
For each, what was the outcome?
How is your case materially different from these?

Question 2: The Reversal Test

If someone claimed a different case was "unique" for the same reasons you're claiming, would you accept it?
Are you applying special pleading?

Question 3: Burden of Proof The base rate says [X]%. You claim it should be [Y]%.

Calculate the gap:

|Y - X|

Required evidence strength:

Gap < 10%: Minimal evidence needed
Gap 10-30%: Moderate evidence needed (2-3 specific factors)
Gap > 30%: Extraordinary evidence needed (multiple independent strong signals)

问题1：相似度匹配

和该案例最相似的5个历史案例是什么？
每个案例的结果是什么？
你的案例和这些案例有什么实质性差异？

问题2：反转测试

如果其他人用和你一样的理由宣称另一个案例“独一无二”，你会认可吗？
你是不是在使用双重标准？

问题3：举证责任 基准比率显示概率为[X]%，你认为应该是[Y]%。

计算差值：

|Y - X|

所需证据强度：

差值<10%：需要极少证据
差值10-30%：需要中等强度证据（2-3个明确因素）
差值>30%：需要极强证据（多个独立的强信号）

Output

输出

I'll tell you:

Whether "this time is different" is justified
How much you can reasonably adjust from the base rate
What evidence would be needed to justify larger moves

Next: Return to menu

我会告诉你：

“这次不一样”的主张是否合理
你可以合理偏离基准比率的幅度
要做更大幅度调整需要什么证据

下一步： 返回菜单

3. Calculate Funnel Base Rates

3. 计算漏斗基准比率

For multi-stage processes without a single base rate.

适用于没有单一基准比率的多阶段流程。

When to Use

适用场景

No direct statistic exists (e.g., "success rate of X")
Event requires multiple sequential steps
Each stage has independent probabilities

没有直接统计数据（比如“X的成功率”）
事件需要多个连续步骤才能完成
每个阶段的概率相互独立

The Funnel Method

漏斗方法

Example: "Will Bill X become law?"

No direct data on "Bill X success rate," but we can model the funnel:

Stage 1: Bills introduced → Bills that reach committee
- P(committee | introduced) = ?
Stage 2: Bills in committee → Bills that reach floor vote
- P(floor | committee) = ?
Stage 3: Bills voted on → Bills that pass
- P(pass | floor vote) = ?

Final Base Rate:

P(law) = P(committee) × P(floor) × P(pass)

示例：“X法案会成为法律吗？”

没有“X法案成功率”的直接数据，但我们可以搭建漏斗模型：

阶段1： 提交的法案 → 进入委员会的法案
- P(进入委员会 | 已提交) = ?
阶段2： 委员会中的法案 → 进入全院投票的法案
- P(进入全院投票 | 在委员会中) = ?
阶段3： 参与投票的法案 → 获得通过的法案
- P(通过 | 全院投票) = ?

最终基准比率：

P(成为法律) = P(进入委员会) × P(进入全院投票) × P(通过)

Process

流程

I'll help you:

Decompose the event into sequential stages
Search for statistics on each stage
Multiply probabilities using the product rule
Validate the model (are stages truly independent?)

我会帮你：

拆解事件为多个连续阶段
搜索每个阶段的统计数据
用乘积法则计算总概率
验证模型（各阶段是否真的独立？）

Common Funnels

常见漏斗场景

Startup success: Seed → Series A → Profitability → Exit
Drug approval: Discovery → Trials → FDA → Market
Project delivery: Planning → Development → Testing → Launch

Next: Return to menu

创业成功：种子轮 → A轮 → 盈利 → 退出
药品获批：研发 → 临床试验 → FDA审批 → 上市
项目交付：规划 → 开发 → 测试 → 上线

下一步： 返回菜单

4. Validate My Reference Class

4. 验证我的参考类

Ensure you chose the right comparison set.

确保你选择了正确的对比集。

The Three Tests

三项测试

Test 1: Homogeneity

Are the members of this class actually similar enough?
Is there high variance in outcomes?
Should you subdivide further?

Example: "Tech startups" is too broad (consumer vs B2B vs hardware are very different). Subdivide.

Test 2: Sample Size

Do you have enough historical cases?
Minimum: 20-30 cases for meaningful statistics
If N < 20: Widen the class or acknowledge high uncertainty

Test 3: Relevance

Have conditions changed since the historical data?
Are there structural differences (regulation, technology, market)?
Time decay: Data from >10 years ago may be stale

测试1：同质性

该类别的成员相似度足够高吗？
结果方差大吗？
你需要进一步拆分分类吗？

示例：“科技创业公司”分类过宽（消费级、B2B、硬件类差异极大），需要拆分。

测试2：样本量

你有足够多的历史案例吗？
最低要求：20-30个案例才能得到有意义的统计结果
如果样本量<20：拓宽分类范围或承认不确定性很高

测试3：相关性

历史数据对应的环境和现在相比有没有变化？
是否存在结构性差异（监管、技术、市场）？
时间衰减：超过10年的旧数据可能已经失效

Validation Checklist

验证 checklist

5. Learn the Framework

5. 学习框架

Deep dive into the methodology.

深入了解方法论。

Resource Files

资源文件

📄 Outside View Principles

Statistical thinking vs narrative thinking
Why the outside view beats experts
Kahneman's planning fallacy research
When outside view fails

📄 Reference Class Selection Guide

Systematic method for choosing comparison sets
Balancing specificity vs data availability
Similarity metrics and matching
Edge cases and judgment calls

📄 Common Pitfalls

Base rate neglect examples
"This time is different" bias
Overfitting to small samples
Ignoring regression to the mean
Availability bias in class selection

Next: Return to menu

📄 外部视角原则

统计思维 vs 叙事思维
为什么外部视角比专家判断更准确
卡尼曼的规划谬误研究
外部视角失效的场景

📄 参考类选择指南

选择对比集的系统方法
平衡 specificity 和数据可用性
相似度指标和匹配方法
边缘场景和判断调用

📄 常见陷阱

基准比率忽视案例
“这次不一样”偏见
小样本过拟合
忽略回归均值
参考类选择中的可得性偏见

下一步： 返回菜单

Quick Reference

快速参考

The Outside View Commandments

外部视角戒律

Base Rate First: Establish statistical baseline BEFORE analyzing specifics
Assume Average: Treat case as typical until proven otherwise
Burden of Proof: Large deviations from base rate require strong evidence
Class Precision: Reference class should be specific but data-rich
No Narratives: Resist compelling stories; trust frequencies

基准比率优先： 在分析具体细节前先建立统计基线
默认平均水平： 在得到反证前，默认当前案例属于典型水平
举证责任： 大幅偏离基准比率需要强证据支撑
分类精度： 参考类需要足够明确，同时有充足数据
拒绝叙事： 不要被吸引人的故事干扰，信任频率数据

One-Sentence Summary

一句话总结

Find what usually happens to things like this, start there, and only move with evidence.

找到同类事件的通常结果，以此为起点，只有拿到证据时再调整预测。

Integration with Other Skills

和其他技能的集成

Before: Use
```
estimation-fermi
```
if you need to calculate base rate from components
After: Use
```
bayesian-reasoning-calibration
```
to update from base rate with new evidence
Companion: Use
```
scout-mindset-bias-check
```
to validate you're not cherry-picking the reference class

前置技能： 如果你需要从组件计算基准比率，可以使用

estimation-fermi

后置技能： 拿到新证据后更新基准比率，可以使用

bayesian-reasoning-calibration

配套技能： 验证你没有 cherry-pick 参考类，可以使用

scout-mindset-bias-check

Resource Files

资源文件

📁 resources/

outside-view-principles.md - Theory and research
reference-class-selection.md - Systematic selection method
common-pitfalls.md - What to avoid

Ready to start? Choose a number from the menu above.

📁 resources/

outside-view-principles.md - 理论和研究
reference-class-selection.md - 系统选择方法
common-pitfalls.md - 避坑指南

reference-class-forecasting

Original

Translation

Reference Class Forecasting

Reference Class Forecasting

Table of Contents

目录

What is Reference Class Forecasting?

什么是参考类预测？

When to Use This Skill

何时使用该技能

Interactive Menu

交互菜单

Core Workflows

核心工作流

1. Find My Base Rate

1. 查找我的基准比率

Step 1: What are you forecasting?

步骤1：你要预测什么？

Step 2: Identify the Reference Class

步骤2：识别参考类

Step 3: Search for Historical Data

步骤3：搜索历史数据

Step 4: Set Your Anchor

步骤4：设定你的锚点

2. Test "This Time Is Different"

2. 验证“这次不一样”

The Uniqueness Audit

特殊性审计

Output

输出

3. Calculate Funnel Base Rates

3. 计算漏斗基准比率

When to Use

适用场景

The Funnel Method

漏斗方法

Process

流程

Common Funnels

常见漏斗场景

4. Validate My Reference Class

4. 验证我的参考类

The Three Tests

三项测试

Validation Checklist

验证 checklist

5. Learn the Framework

5. 学习框架

Resource Files

资源文件

Quick Reference

快速参考

The Outside View Commandments

外部视角戒律

One-Sentence Summary

一句话总结

Integration with Other Skills

和其他技能的集成

Resource Files

资源文件