proof-architect
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseProof Architect
证明架构师
Structured methodology for constructing and verifying mathematical proofs in statistical research
Use this skill when working on: mathematical proofs, theorem development, derivations, consistency proofs, asymptotic arguments, identification proofs, or verifying proof correctness.
统计研究中构建与验证数学证明的结构化方法
在以下场景中使用本方法:数学证明、定理推导、公式推演、一致性证明、渐近分析、识别证明或验证证明正确性。
Proof Structure Framework
证明结构框架
Standard Proof Components
标准证明组件
Every rigorous statistical proof should contain:
- Claim Statement - Precise mathematical statement of what is being proved
- Assumptions - All conditions required (clearly enumerated A1, A2, ...)
- Notation - Define all symbols before use
- Proof Body - Logical sequence of justified steps
- Conclusion - Explicit statement that claim is established
严谨的统计证明应包含以下部分:
- 声明陈述 - 明确表述待证明的数学命题
- 假设条件 - 清晰列举所有所需条件(标记为A1、A2……)
- 符号定义 - 在使用前定义所有符号
- 证明主体 - 逻辑连贯且每步有依据的推导过程
- 结论 - 明确声明命题已得证
Proof Skeleton Template
证明框架模板
latex
\begin{theorem}[Name]
\label{thm:name}
Under Assumptions \ref{A1}--\ref{An}, [precise claim].
\end{theorem}
\begin{proof}
The proof proceeds in [n] steps.
\textbf{Step 1: [Description]}
[Content with justification for each transition]
\textbf{Step 2: [Description]}
[Content]
\vdots
\textbf{Step n: Conclusion}
Combining Steps 1--[n-1], we obtain [result], completing the proof.
\end{proof}latex
\begin{theorem}[Name]
\label{thm:name}
Under Assumptions \ref{A1}--\ref{An}, [precise claim].
\end{theorem}
\begin{proof}
The proof proceeds in [n] steps.
\textbf{Step 1: [Description]}
[Content with justification for each transition]
\textbf{Step 2: [Description]}
[Content]
\vdots
\textbf{Step n: Conclusion}
Combining Steps 1--[n-1], we obtain [result], completing the proof.
\end{proof}Proof Types in Statistical Methodology
统计方法论中的证明类型
1. Identification Proofs
1. 识别证明
Goal: Show that a causal/statistical quantity is uniquely determined from observed data distribution.
Standard Structure:
- Define target estimand (e.g., $\psi = E[Y(a)]$)
- State identifying assumptions (consistency, positivity, exchangeability)
- Apply identification formula derivation
- Show formula depends only on observable quantities
Template:
latex
\begin{theorem}[Identification of $\psi$]
Under Assumptions \ref{A:consistency}--\ref{A:positivity}, the causal effect
$\psi = E[Y(a)]$ is identified by
\[
\psi = \int E[Y \mid A=a, X=x] \, dP(x).
\]
\end{theorem}
\begin{proof}
\begin{align}
E[Y(a)] &= E[E[Y(a) \mid X]] && \text{(law of iterated expectations)} \\
&= E[E[Y(a) \mid A=a, X]] && \text{(A\ref{A:exchangeability}: $Y(a) \indep A \mid X$)} \\
&= E[E[Y \mid A=a, X]] && \text{(A\ref{A:consistency}: $Y = Y(A)$)} \\
&= \int E[Y \mid A=a, X=x] \, dP(x) && \text{(definition)}
\end{align}
which depends only on the observed data distribution.
\end{proof}目标:证明因果/统计量可由观测数据分布唯一确定。
标准结构:
- 定义目标估计量(如:$\psi = E[Y(a)]$)
- 陈述识别假设(一致性、 positivity、可交换性)
- 推导识别公式
- 证明公式仅依赖于可观测变量
模板:
latex
\begin{theorem}[Identification of $\psi$]
Under Assumptions \ref{A:consistency}--\ref{A:positivity}, the causal effect
$\psi = E[Y(a)]$ is identified by
\[
\psi = \int E[Y \mid A=a, X=x] \, dP(x).
\]
\end{theorem}
\begin{proof}
\begin{align}
E[Y(a)] &= E[E[Y(a) \mid X]] && \text{(law of iterated expectations)} \\
&= E[E[Y(a) \mid A=a, X]] && \text{(A\ref{A:exchangeability}: $Y(a) \indep A \mid X$)} \\
&= E[E[Y \mid A=a, X]] && \text{(A\ref{A:consistency}: $Y = Y(A)$)} \\
&= \int E[Y \mid A=a, X=x] \, dP(x) && \text{(definition)}
\end{align}
which depends only on the observed data distribution.
\end{proof}2. Consistency Proofs
2. 一致性证明
Goal: Show that an estimator converges to the true parameter value.
Standard Structure:
- Define estimator $\hat{\theta}_n$
- Define target parameter $\theta_0$
- Establish convergence: $\hat{\theta}_n \xrightarrow{p} \theta_0$
Key Tools:
- Law of Large Numbers (LLN)
- Continuous Mapping Theorem
- Slutsky's Theorem
- M-estimation theory
Template:
latex
\begin{theorem}[Consistency]
Under Assumptions \ref{A1}--\ref{An}, $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{theorem}
\begin{proof}
Define $M_n(\theta) = n^{-1} \sum_{i=1}^n m(O_i; \theta)$ and
$M(\theta) = E[m(O; \theta)]$.
\textbf{Step 1: Uniform convergence}
By [ULLN conditions], $\sup_{\theta \in \Theta} |M_n(\theta) - M(\theta)| \xrightarrow{p} 0$.
\textbf{Step 2: Unique maximum}
$M(\theta)$ is uniquely maximized at $\theta_0$ (by identifiability).
\textbf{Step 3: Conclusion}
By standard M-estimation theory, Steps 1--2 imply $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{proof}目标:证明估计量收敛于真实参数值。
标准结构:
- 定义估计量 $\hat{\theta}_n$
- 定义目标参数 $\theta_0$
- 证明收敛性:$\hat{\theta}_n \xrightarrow{p} \theta_0$
核心工具:
- 大数定律(LLN)
- 连续映射定理
- Slutsky定理
- M估计理论
模板:
latex
\begin{theorem}[Consistency]
Under Assumptions \ref{A1}--\ref{An}, $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{theorem}
\begin{proof}
Define $M_n(\theta) = n^{-1} \sum_{i=1}^n m(O_i; \theta)$ and
$M(\theta) = E[m(O; \theta)]$.
\textbf{Step 1: Uniform convergence}
By [ULLN conditions], $\sup_{\theta \in \Theta} |M_n(\theta) - M(\theta)| \xrightarrow{p} 0$.
\textbf{Step 2: Unique maximum}
$M(\theta)$ is uniquely maximized at $\theta_0$ (by identifiability).
\textbf{Step 3: Conclusion}
By standard M-estimation theory, Steps 1--2 imply $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{proof}3. Asymptotic Normality Proofs
3. 渐近正态性证明
Goal: Establish $\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)$.
Standard Structure:
- Taylor expansion around true value
- Apply CLT to score/influence function
- Invert Hessian/information matrix
- State limiting distribution
Key Tools:
- Central Limit Theorem (CLT)
- Delta Method
- Influence Function Theory
- Semiparametric Efficiency Theory
Template:
latex
\begin{theorem}[Asymptotic Normality]
Under Assumptions \ref{A1}--\ref{An},
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)
\]
where $V = E[\phi(O)\phi(O)^\top]$ and $\phi$ is the influence function.
\end{theorem}
\begin{proof}
\textbf{Step 1: Score equation}
$\hat{\theta}_n$ solves $\mathbb{P}_n[\psi(O; \theta)] = 0$ where $\psi = \partial_\theta m$.
\textbf{Step 2: Taylor expansion}
\[
0 = \mathbb{P}_n[\psi(O; \hat{\theta}_n)] = \mathbb{P}_n[\psi(O; \theta_0)]
+ \mathbb{P}_n[\dot{\psi}(O; \tilde{\theta})](\hat{\theta}_n - \theta_0)
\]
\textbf{Step 3: Rearrangement}
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) = -\left(\mathbb{P}_n[\dot{\psi}]\right)^{-1}
\sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)]
\]
\textbf{Step 4: Apply CLT}
$\sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)] \xrightarrow{d} N(0, \text{Var}(\psi))$ by CLT.
\textbf{Step 5: Slutsky}
$\mathbb{P}_n[\dot{\psi}] \xrightarrow{p} E[\dot{\psi}]$ by WLLN. Apply Slutsky's theorem.
\end{proof}目标:证明 $\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)$。
标准结构:
- 在真实值附近进行泰勒展开
- 对得分/影响函数应用中心极限定理
- 反转海森矩阵/信息矩阵
- 陈述极限分布
核心工具:
- 中心极限定理(CLT)
- Delta方法
- 影响函数理论
- 半参数效率理论
模板:
latex
\begin{theorem}[Asymptotic Normality]
Under Assumptions \ref{A1}--\ref{An},
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)
\]
where $V = E[\phi(O)\phi(O)^\top]$ and $\phi$ is the influence function.
\end{theorem}
\begin{proof}
\textbf{Step 1: Score equation}
$\hat{\theta}_n$ solves $\mathbb{P}_n[\psi(O; \theta)] = 0$ where $\psi = \partial_\theta m$.
\textbf{Step 2: Taylor expansion}
\[
0 = \mathbb{P}_n[\psi(O; \hat{\theta}_n)] = \mathbb{P}_n[\psi(O; \theta_0)]
+ \mathbb{P}_n[\dot{\psi}(O; \tilde{\theta})](\hat{\theta}_n - \theta_0)
\]
\textbf{Step 3: Rearrangement}
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) = -\left(\mathbb{P}_n[\dot{\psi}]\right)^{-1}
\sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)]
\]
\textbf{Step 4: Apply CLT}
$\sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)] \xrightarrow{d} N(0, \text{Var}(\psi))$ by CLT.
\textbf{Step 5: Slutsky}
$\mathbb{P}_n[\dot{\psi}] \xrightarrow{p} E[\dot{\psi}]$ by WLLN. Apply Slutsky's theorem.
\end{proof}4. Efficiency Proofs
4. 效率证明
Goal: Show estimator achieves semiparametric efficiency bound.
Standard Structure:
- Characterize the tangent space
- Derive efficient influence function (EIF)
- Show estimator's influence function equals EIF
- Conclude variance achieves bound
Template:
latex
\begin{theorem}[Semiparametric Efficiency]
$\hat{\theta}_n$ is semiparametrically efficient with influence function
\[
\phi(O) = [optimal formula]
\]
achieving the efficiency bound $V_{\text{eff}} = E[\phi(O)^2]$.
\end{theorem}目标:证明估计量达到半参数效率边界。
标准结构:
- 刻画切空间
- 推导有效影响函数(EIF)
- 证明估计量的影响函数等于EIF
- 得出方差达到边界的结论
模板:
latex
\begin{theorem}[Semiparametric Efficiency]
$\hat{\theta}_n$ is semiparametrically efficient with influence function
\[
\phi(O) = [optimal formula]
\]
achieving the efficiency bound $V_{\text{eff}} = E[\phi(O)^2]$.
\end{theorem}5. Double Robustness Proofs
5. 双重稳健性证明
Goal: Show estimator is consistent if either nuisance model is correctly specified.
Standard Structure:
- Write estimating equation with both nuisance functions
- Show bias term is product of two errors
- Conclude: if either error is zero, estimator is consistent
Template:
latex
\begin{theorem}[Double Robustness]
The estimator $\hat{\psi}_{DR}$ is consistent if either:
\begin{enumerate}
\item The outcome model $\mu(a,x) = E[Y \mid A=a, X=x]$ is correctly specified, or
\item The propensity score $\pi(x) = P(A=1 \mid X=x)$ is correctly specified.
\end{enumerate}
\end{theorem}
\begin{proof}
The estimating equation has the form:
\[
\psi - \hat{\psi}_{DR} = E\left[\frac{(A-\pi)(Y-\mu)}{\pi(1-\pi)}\right] + o_p(1)
\]
The bias term $(A-\pi)(Y-\mu)$ is zero in expectation if either:
\begin{itemize}
\item $E[A-\pi \mid X] = 0$ (propensity correctly specified), or
\item $E[Y-\mu \mid A, X] = 0$ (outcome correctly specified).
\end{itemize}
\end{proof}目标:证明只要任一干扰模型设定正确,估计量就具有一致性。
标准结构:
- 写出包含两个干扰函数的估计方程
- 证明偏差项是两个误差的乘积
- 得出结论:若任一误差为零,估计量即一致
模板:
latex
\begin{theorem}[Double Robustness]
The estimator $\hat{\psi}_{DR}$ is consistent if either:
\begin{enumerate}
\item The outcome model $\mu(a,x) = E[Y \mid A=a, X=x]$ is correctly specified, or
\item The propensity score $\pi(x) = P(A=1 \mid X=x)$ is correctly specified.
\end{enumerate}
\end{theorem}
\begin{proof}
The estimating equation has the form:
\[
\psi - \hat{\psi}_{DR} = E\left[\frac{(A-\pi)(Y-\mu)}{\pi(1-\pi)}\right] + o_p(1)
\]
The bias term $(A-\pi)(Y-\mu)$ is zero in expectation if either:
\begin{itemize}
\item $E[A-\pi \mid X] = 0$ (propensity correctly specified), or
\item $E[Y-\mu \mid A, X] = 0$ (outcome correctly specified).
\end{itemize}
\end{proof}Proof Verification Checklist
证明验证检查清单
Level 1: Structure Check
第一级:结构检查
- Claim clearly stated with all conditions
- All notation defined before use
- Logical flow apparent (steps labeled)
- Each step has explicit justification
- Conclusion explicitly stated
- 声明清晰且包含所有条件
- 所有符号在使用前已定义
- 逻辑流程明确(步骤已标记)
- 每一步都有明确依据
- 结论已明确陈述
Level 2: Step Validation
第二级:步骤验证
For each step, verify:
- Mathematical operation is valid
- Cited results apply (check conditions)
- Inequalities have correct direction
- Limits/integrals converge
- Dimensions/types match
对每个步骤,验证:
- 数学操作合法
- 引用的定理适用(检查条件)
- 不等式方向正确
- 极限/积分收敛
- 维度/类型匹配
Level 3: Edge Cases
第三级:边缘情况
- Boundary cases handled (n=1, p=0, etc.)
- Degenerate cases addressed
- Assumptions actually used (not vacuous)
- What happens at assumption boundaries?
- 边界情况已处理(如n=1、p=0等)
- 退化情况已说明
- 假设确实被使用(非冗余)
- 假设边界处的情况已考虑
Level 4: Consistency
第四级:一致性
- Result matches intuition
- Special cases recover known results
- Numerical verification possible?
- Consistent with simulation evidence?
- 结果符合直觉
- 特殊情况可还原已知结论
- 可进行数值验证
- 与模拟结果一致
Common Proof Errors
常见证明错误
Technical Errors
技术错误
| Error | Example | Fix |
|---|---|---|
| Interchanging limits | $\lim \sum \neq \sum \lim$ | Verify DCT/MCT conditions |
| Division by zero | $1/\pi(x)$ when $\pi(x)=0$ | State positivity assumption |
| Incorrect conditioning | $E[Y \mid A,X] \neq E[Y \mid X]$ | Check independence structure |
| Wrong norm | $|f|2$ vs $|f|\infty$ | Verify which space |
| Missing measurability | Random variable not measurable | State measurability |
| 错误类型 | 示例 | 修正方法 |
|---|---|---|
| 极限交换顺序 | $\lim \sum \neq \sum \lim$ | 验证控制收敛定理/单调收敛定理条件 |
| 除以零 | $1/\pi(x)$ 当 $\pi(x)=0$ | 明确positivity假设 |
| 条件错误 | $E[Y \mid A,X] \neq E[Y \mid X]$ | 检查独立结构 |
| 范数错误 | $|f|2$ vs $|f|\infty$ | 确认使用的空间 |
| 可测性缺失 | 随机变量不可测 | 明确可测性假设 |
Logical Errors
逻辑错误
| Error | Example | Fix |
|---|---|---|
| Circular reasoning | Using result to prove itself | Check logical dependency |
| Unstated assumption | "Clearly, X holds" | Make all assumptions explicit |
| Incorrect quantifier | $\exists$ vs $\forall$ | Be precise about scope |
| Missing case | Not handling $\theta = 0$ | Enumerate all cases |
| 错误类型 | 示例 | 修正方法 |
|---|---|---|
| 循环论证 | 用结论证明结论 | 检查逻辑依赖关系 |
| 未声明假设 | "显然,X成立" | 明确所有假设 |
| 量词错误 | $\exists$ vs $\forall$ | 精确表述范围 |
| 遗漏情况 | 未处理$\theta = 0$ | 枚举所有情况 |
Statistical Errors
统计错误
| Error | Example | Fix |
|---|---|---|
| Confusing $\xrightarrow{p}$ and $\xrightarrow{d}$ | Different convergence modes | State which mode |
| Ignoring dependence | Applying iid CLT to dependent data | Check independence |
| Wrong variance | Using population variance for sample | Distinguish estimator/parameter |
| 错误类型 | 示例 | 修正方法 |
|---|---|---|
| 混淆收敛模式 | $\xrightarrow{p}$ 和 $\xrightarrow{d}$ 混淆 | 明确收敛模式 |
| 忽略相关性 | 对相依数据应用独立同分布中心极限定理 | 检查独立性 |
| 方差错误 | 用总体方差代替样本方差 | 区分估计量与参数 |
Notation Standards (VanderWeele Convention)
符号标准(VanderWeele约定)
Causal Quantities
因果量
| Symbol | Meaning |
|---|---|
| $Y(a)$ | Potential outcome under treatment $a$ |
| $Y(a,m)$ | Potential outcome under $A=a$, $M=m$ |
| $M(a)$ | Potential mediator under treatment $a$ |
| $NDE$ | Natural Direct Effect: $E[Y(1,M(0)) - Y(0,M(0))]$ |
| $NIE$ | Natural Indirect Effect: $E[Y(1,M(1)) - Y(1,M(0))]$ |
| $TE$ | Total Effect: $E[Y(1) - Y(0)] = NDE + NIE$ |
| $P_M$ | Proportion Mediated: $NIE/TE$ |
| 符号 | 含义 |
|---|---|
| $Y(a)$ | 处理a下的潜在结果 |
| $Y(a,m)$ | 处理A=a、中介M=m下的潜在结果 |
| $M(a)$ | 处理a下的潜在中介 |
| $NDE$ | 自然直接效应:$E[Y(1,M(0)) - Y(0,M(0))]$ |
| $NIE$ | 自然间接效应:$E[Y(1,M(1)) - Y(1,M(0))]$ |
| $TE$ | 总效应:$E[Y(1) - Y(0)] = NDE + NIE$ |
| $P_M$ | 中介比例:$NIE/TE$ |
Statistical Quantities
统计量
| Symbol | Meaning |
|---|---|
| $\theta_0$ | True parameter value |
| $\hat{\theta}_n$ | Estimator based on $n$ observations |
| $\phi(O)$ | Influence function |
| $\mathbb{P}_n$ | Empirical measure |
| $\mathbb{G}_n$ | Empirical process: $\sqrt{n}(\mathbb{P}_n - P)$ |
| 符号 | 含义 |
|---|---|
| $\theta_0$ | 真实参数值 |
| $\hat{\theta}_n$ | 基于n个观测的估计量 |
| $\phi(O)$ | 影响函数 |
| $\mathbb{P}_n$ | 经验测度 |
| $\mathbb{G}_n$ | 经验过程:$\sqrt{n}(\mathbb{P}_n - P)$ |
Convergence
收敛性
| Symbol | Meaning |
|---|---|
| $\xrightarrow{p}$ | Convergence in probability |
| $\xrightarrow{d}$ | Convergence in distribution |
| $\xrightarrow{a.s.}$ | Almost sure convergence |
| $O_p(1)$ | Bounded in probability |
| $o_p(1)$ | Converges to zero in probability |
| 符号 | 含义 |
|---|---|
| $\xrightarrow{p}$ | 依概率收敛 |
| $\xrightarrow{d}$ | 依分布收敛 |
| $\xrightarrow{a.s.}$ | 几乎必然收敛 |
| $O_p(1)$ | 依概率有界 |
| $o_p(1)$ | 依概率收敛于0 |
Proof Construction Workflow
证明构建工作流
Step 1: Understand the Goal
步骤1:明确目标
- What exactly needs to be proved?
- What type of proof is this? (identification, consistency, etc.)
- What are the key challenges?
- 具体需要证明什么?
- 这属于哪种类型的证明?(识别、一致性等)
- 核心挑战是什么?
Step 2: Gather Tools
步骤2:准备工具
- What theorems/lemmas are available?
- What regularity conditions will be needed?
- Are there similar proofs to reference?
- 有哪些可用的定理/引理?
- 需要哪些正则条件?
- 有没有可参考的类似证明?
Step 3: Outline Structure
步骤3:构建大纲
- Break into logical steps
- Identify the key technical step
- Plan how to handle edge cases
- 拆分为逻辑步骤
- 确定关键技术步骤
- 规划边缘情况的处理
Step 4: Write First Draft
步骤4:撰写初稿
- Fill in details for each step
- Be explicit about every transition
- Note where conditions are used
- 填充每个步骤的细节
- 明确每一步的推导依据
- 标记假设的使用位置
Step 5: Verify
步骤5:验证
- Run through verification checklist
- Check each step independently
- Test special cases
- 对照检查清单逐一验证
- 独立检查每个步骤
- 测试特殊情况
Step 6: Polish
步骤6:优化
- Improve notation consistency
- Add intuitive explanations
- Ensure assumptions are minimal
- 提升符号一致性
- 添加直观解释
- 确保假设尽可能少
Integration with Other Skills
与其他方法的集成
This skill works with:
- identification-theory - For causal identification proofs
- asymptotic-theory - For inference proofs
- methods-paper-writer - For presenting proofs in manuscripts
- proof-verifier - For systematic verification
Version: 1.0
Created: 2025-12-08
Domain: Mathematical Statistics, Causal Inference
本方法可与以下方法结合使用:
- identification-theory - 用于因果识别证明
- asymptotic-theory - 用于推断证明
- methods-paper-writer - 用于在论文中呈现证明
- proof-verifier - 用于系统验证
Version: 1.0
Created: 2025-12-08
领域: 数理统计、因果推断
Key References
核心参考文献
- van der Vaart
- Lehmann
- Casella
- Bickel
- Serfling
- van der Vaart
- Lehmann
- Casella
- Bickel
- Serfling