reference-class-forecasting
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseReference Class Forecasting
Reference Class Forecasting
Table of Contents
目录
What is Reference Class Forecasting?
什么是参考类预测?
Reference class forecasting is the practice of anchoring predictions in historical reality by identifying a class of similar past events and using their statistical frequency as a starting point. This is the "Outside View" - looking at what usually happens to things like this, before getting distracted by the specific details of "this case."
Core Principle: Assume this event is average until you have specific evidence proving otherwise.
Why It Matters:
- Defeats "inside view" bias (thinking your case is unique)
- Prevents base rate neglect (ignoring statistical baselines)
- Provides objective anchor before subjective analysis
- Forces humility and statistical thinking
参考类预测是一种将预测锚定在历史事实中的实践方法,通过识别一类相似的历史事件,将其统计频率作为预测的起点。这就是“外部视角”——在被“本次案例”的具体细节干扰之前,先看同类事件通常会出现什么结果。
核心原则: 在你拿到能证明特殊性的明确证据之前,默认当前事件属于平均水平。
价值:
- 破除“内部视角”偏见(总觉得自己的案例独一无二)
- 避免基准比率忽视问题(忽略统计基线)
- 在主观分析前提供客观锚点
- 倒逼谦逊态度和统计思维
When to Use This Skill
何时使用该技能
Use this skill when:
- Starting any forecast - Establish base rate FIRST
- Someone says "this time is different" - Test if it really is
- Making predictions about success/failure - Find historical frequencies
- Evaluating startup/project outcomes - Anchor in class statistics
- Challenged by confident predictions - Ground in reality
- Before detailed analysis - Get outside view baseline
Do NOT use when:
- Event has literally never happened (novel situation)
- Working with deterministic physical laws
- Pure chaos with no patterns
适用场景:
- 启动任何预测工作时——首先建立基准比率
- 有人说“这次不一样”时——验证这一说法是否成立
- 做成功/失败相关预测时——查找历史发生频率
- 评估创业/项目结果时——以同类统计数据为锚点
- 质疑过于笃定的预测时——用现实情况做依据
- 开展详细分析前——获取外部视角基线
不适用场景:
- 完全没有先例的事件(全新情况)
- 遵循确定性物理定律的事件
- 无规律可循的纯混沌事件
Interactive Menu
交互菜单
What would you like to do?
你想要做什么?
Core Workflows
核心工作流
1. Find My Base Rate - Identify reference class and get statistical baseline
- Guided process to select correct reference class
- Search strategies for finding historical frequencies
- Validation that you have the right anchor
2. Test "This Time Is Different" - Challenge uniqueness claims
- Reversal test for uniqueness bias
- Similarity matching framework
- Burden of proof calculator
3. Calculate Funnel Base Rates - Multi-stage probability chains
- When no single base rate exists
- Sequential probability modeling
- Product rule for compound events
4. Validate My Reference Class - Ensure you chose the right comparison set
- Too broad vs too narrow test
- Homogeneity check
- Sample size evaluation
5. Learn the Framework - Deep dive into methodology
- Read Outside View Principles
- Read Reference Class Selection Guide
- Read Common Pitfalls
6. Exit - Return to main forecasting workflow
1. Find My Base Rate
1. 查找我的基准比率
Let's establish your statistical baseline.
我们来建立你的统计基线。
Step 1: What are you forecasting?
步骤1:你要预测什么?
Tell me the specific event or outcome you're predicting.
Example prompts:
- "Will this startup succeed?"
- "Will this bill pass Congress?"
- "Will this project launch on time?"
告诉我你要预测的具体事件或结果。
示例提示:
- "这家创业公司会成功吗?"
- "这项法案会在国会通过吗?"
- "这个项目会按时上线吗?"
Step 2: Identify the Reference Class
步骤2:识别参考类
I'll help you identify what bucket this belongs to.
Framework:
- Too broad: "All companies" → meaningless
- Just right: "Seed-stage B2B SaaS startups in fintech"
- Too narrow: "Companies founded by people named Steve in 2024" → no data
Key Questions:
- What type of entity is this? (company, bill, project, person, etc.)
- What stage/size/category?
- What industry/domain?
- What time period is relevant?
I'll work with you to refine this until we have a specific, searchable class.
我会帮你确定该事件属于哪个分类。
框架:
- 过宽: "所有公司" → 无意义
- 合适: "金融科技领域的种子阶段B2B SaaS创业公司"
- 过窄: "2024年由名叫Steve的人创立的公司" → 无数据
核心问题:
- 该事件属于什么类型的主体?(公司、法案、项目、人物等)
- 属于什么阶段/规模/类别?
- 属于什么行业/领域?
- 相关的时间范围是什么?
我会和你一起优化分类,直到得到一个明确的、可搜索的参考类。
Step 3: Search for Historical Data
步骤3:搜索历史数据
I'll help you find the base rate using:
- Web search for published statistics
- Academic studies on success rates
- Government/industry reports
- Proxy metrics if direct data unavailable
Search Strategy:
"historical success rate of [reference class]"
"[reference class] failure statistics"
"[reference class] survival rate"
"what percentage of [reference class]"我会通过以下渠道帮你查找基准比率:
- 网页搜索公开的统计数据
- 学术研究中的成功率数据
- 政府/行业报告
- 代理指标(如果没有直接数据)
搜索策略:
"historical success rate of [reference class]"
"[reference class] failure statistics"
"[reference class] survival rate"
"what percentage of [reference class]"Step 4: Set Your Anchor
步骤4:设定你的锚点
Once we find the base rate, that becomes your starting probability.
The Rule:
You are NOT allowed to move from this base rate until you have specific, evidence-based reasons in your "inside view" analysis.
Default anchors if no data found:
- Novel innovation: 10-20% (most innovations fail)
- Established industry: 50% (uncertain)
- Regulated/proven process: 70-80% (systems work)
Next: Return to menu or proceed to inside view analysis.
我们找到基准比率后,它就会成为你的起始概率。
规则:
在你的“内部视角”分析中拿到有证据支撑的明确理由之前,你不允许偏离这个基准比率。
找不到数据时的默认锚点:
- 全新创新:10-20%(绝大多数创新都会失败)
- 成熟行业:50%(不确定性高)
- 受监管/已验证的流程:70-80%(体系可保障落地)
下一步: 返回菜单或继续开展内部视角分析。
2. Test "This Time Is Different"
2. 验证“这次不一样”
Challenge uniqueness bias.
When someone (including yourself) believes "this case is special," we need to stress-test that belief.
挑战特殊性偏见。
当有人(包括你自己)认为“这个案例很特殊”时,我们需要对这个观点做压力测试。
The Uniqueness Audit
特殊性审计
Question 1: Similarity Matching
- What are 5 historical cases that are most similar to this one?
- For each, what was the outcome?
- How is your case materially different from these?
Question 2: The Reversal Test
- If someone claimed a different case was "unique" for the same reasons you're claiming, would you accept it?
- Are you applying special pleading?
Question 3: Burden of Proof
The base rate says [X]%. You claim it should be [Y]%.
Calculate the gap:
|Y - X|Required evidence strength:
- Gap < 10%: Minimal evidence needed
- Gap 10-30%: Moderate evidence needed (2-3 specific factors)
- Gap > 30%: Extraordinary evidence needed (multiple independent strong signals)
问题1:相似度匹配
- 和该案例最相似的5个历史案例是什么?
- 每个案例的结果是什么?
- 你的案例和这些案例有什么实质性差异?
问题2:反转测试
- 如果其他人用和你一样的理由宣称另一个案例“独一无二”,你会认可吗?
- 你是不是在使用双重标准?
问题3:举证责任
基准比率显示概率为[X]%,你认为应该是[Y]%。
计算差值:
|Y - X|所需证据强度:
- 差值<10%:需要极少证据
- 差值10-30%:需要中等强度证据(2-3个明确因素)
- 差值>30%:需要极强证据(多个独立的强信号)
Output
输出
I'll tell you:
- Whether "this time is different" is justified
- How much you can reasonably adjust from the base rate
- What evidence would be needed to justify larger moves
Next: Return to menu
3. Calculate Funnel Base Rates
3. 计算漏斗基准比率
For multi-stage processes without a single base rate.
适用于没有单一基准比率的多阶段流程。
When to Use
适用场景
- No direct statistic exists (e.g., "success rate of X")
- Event requires multiple sequential steps
- Each stage has independent probabilities
- 没有直接统计数据(比如“X的成功率”)
- 事件需要多个连续步骤才能完成
- 每个阶段的概率相互独立
The Funnel Method
漏斗方法
Example: "Will Bill X become law?"
No direct data on "Bill X success rate," but we can model the funnel:
-
Stage 1: Bills introduced → Bills that reach committee
- P(committee | introduced) = ?
-
Stage 2: Bills in committee → Bills that reach floor vote
- P(floor | committee) = ?
-
Stage 3: Bills voted on → Bills that pass
- P(pass | floor vote) = ?
Final Base Rate:
P(law) = P(committee) × P(floor) × P(pass)示例:“X法案会成为法律吗?”
没有“X法案成功率”的直接数据,但我们可以搭建漏斗模型:
-
阶段1: 提交的法案 → 进入委员会的法案
- P(进入委员会 | 已提交) = ?
-
阶段2: 委员会中的法案 → 进入全院投票的法案
- P(进入全院投票 | 在委员会中) = ?
-
阶段3: 参与投票的法案 → 获得通过的法案
- P(通过 | 全院投票) = ?
最终基准比率:
P(成为法律) = P(进入委员会) × P(进入全院投票) × P(通过)Process
流程
I'll help you:
- Decompose the event into sequential stages
- Search for statistics on each stage
- Multiply probabilities using the product rule
- Validate the model (are stages truly independent?)
我会帮你:
- 拆解事件为多个连续阶段
- 搜索每个阶段的统计数据
- 用乘积法则计算总概率
- 验证模型(各阶段是否真的独立?)
Common Funnels
常见漏斗场景
- Startup success: Seed → Series A → Profitability → Exit
- Drug approval: Discovery → Trials → FDA → Market
- Project delivery: Planning → Development → Testing → Launch
Next: Return to menu
4. Validate My Reference Class
4. 验证我的参考类
Ensure you chose the right comparison set.
确保你选择了正确的对比集。
The Three Tests
三项测试
Test 1: Homogeneity
- Are the members of this class actually similar enough?
- Is there high variance in outcomes?
- Should you subdivide further?
Example: "Tech startups" is too broad (consumer vs B2B vs hardware are very different). Subdivide.
Test 2: Sample Size
- Do you have enough historical cases?
- Minimum: 20-30 cases for meaningful statistics
- If N < 20: Widen the class or acknowledge high uncertainty
Test 3: Relevance
- Have conditions changed since the historical data?
- Are there structural differences (regulation, technology, market)?
- Time decay: Data from >10 years ago may be stale
测试1:同质性
- 该类别的成员相似度足够高吗?
- 结果方差大吗?
- 你需要进一步拆分分类吗?
示例:“科技创业公司”分类过宽(消费级、B2B、硬件类差异极大),需要拆分。
测试2:样本量
- 你有足够多的历史案例吗?
- 最低要求:20-30个案例才能得到有意义的统计结果
- 如果样本量<20:拓宽分类范围或承认不确定性很高
测试3:相关性
- 历史数据对应的环境和现在相比有没有变化?
- 是否存在结构性差异(监管、技术、市场)?
- 时间衰减:超过10年的旧数据可能已经失效
Validation Checklist
验证 checklist
I'll walk you through:
- Class has 20+ historical examples
- Members are reasonably homogeneous
- Data is from relevant time period
- No major structural changes since data collection
- Class is specific enough to be meaningful
- Class is broad enough to have data
Output: Confidence level in your reference class (High/Medium/Low)
Next: Return to menu
我会带你逐一确认:
- 分类有20个以上的历史案例
- 分类成员的同质性达标
- 数据来自相关的时间范围
- 数据收集后没有发生重大结构性变化
- 分类足够明确,有实际意义
- 分类足够宽泛,有对应数据
输出: 你的参考类的可信度(高/中/低)
下一步: 返回菜单
5. Learn the Framework
5. 学习框架
Deep dive into the methodology.
深入了解方法论。
Resource Files
资源文件
📄 Outside View Principles
- Statistical thinking vs narrative thinking
- Why the outside view beats experts
- Kahneman's planning fallacy research
- When outside view fails
📄 Reference Class Selection Guide
- Systematic method for choosing comparison sets
- Balancing specificity vs data availability
- Similarity metrics and matching
- Edge cases and judgment calls
📄 Common Pitfalls
- Base rate neglect examples
- "This time is different" bias
- Overfitting to small samples
- Ignoring regression to the mean
- Availability bias in class selection
Next: Return to menu
📄 外部视角原则
- 统计思维 vs 叙事思维
- 为什么外部视角比专家判断更准确
- 卡尼曼的规划谬误研究
- 外部视角失效的场景
📄 参考类选择指南
- 选择对比集的系统方法
- 平衡 specificity 和数据可用性
- 相似度指标和匹配方法
- 边缘场景和判断调用
📄 常见陷阱
- 基准比率忽视案例
- “这次不一样”偏见
- 小样本过拟合
- 忽略回归均值
- 参考类选择中的可得性偏见
下一步: 返回菜单
Quick Reference
快速参考
The Outside View Commandments
外部视角戒律
- Base Rate First: Establish statistical baseline BEFORE analyzing specifics
- Assume Average: Treat case as typical until proven otherwise
- Burden of Proof: Large deviations from base rate require strong evidence
- Class Precision: Reference class should be specific but data-rich
- No Narratives: Resist compelling stories; trust frequencies
- 基准比率优先: 在分析具体细节前先建立统计基线
- 默认平均水平: 在得到反证前,默认当前案例属于典型水平
- 举证责任: 大幅偏离基准比率需要强证据支撑
- 分类精度: 参考类需要足够明确,同时有充足数据
- 拒绝叙事: 不要被吸引人的故事干扰,信任频率数据
One-Sentence Summary
一句话总结
Find what usually happens to things like this, start there, and only move with evidence.
找到同类事件的通常结果,以此为起点,只有拿到证据时再调整预测。
Integration with Other Skills
和其他技能的集成
- Before: Use if you need to calculate base rate from components
estimation-fermi - After: Use to update from base rate with new evidence
bayesian-reasoning-calibration - Companion: Use to validate you're not cherry-picking the reference class
scout-mindset-bias-check
前置技能: 如果你需要从组件计算基准比率,可以使用
后置技能: 拿到新证据后更新基准比率,可以使用
配套技能: 验证你没有 cherry-pick 参考类,可以使用
estimation-fermibayesian-reasoning-calibrationscout-mindset-bias-checkResource Files
资源文件
📁 resources/
- outside-view-principles.md - Theory and research
- reference-class-selection.md - Systematic selection method
- common-pitfalls.md - What to avoid
Ready to start? Choose a number from the menu above.
📁 resources/
- outside-view-principles.md - 理论和研究
- reference-class-selection.md - 系统选择方法
- common-pitfalls.md - 避坑指南
准备好开始了吗?从上方菜单选择一个选项。