proof-architect

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Proof Architect

证明架构师

Structured methodology for constructing and verifying mathematical proofs in statistical research
Use this skill when working on: mathematical proofs, theorem development, derivations, consistency proofs, asymptotic arguments, identification proofs, or verifying proof correctness.

统计研究中构建与验证数学证明的结构化方法
在以下场景中使用本方法:数学证明、定理推导、公式推演、一致性证明、渐近分析、识别证明或验证证明正确性。

Proof Structure Framework

证明结构框架

Standard Proof Components

标准证明组件

Every rigorous statistical proof should contain:
  1. Claim Statement - Precise mathematical statement of what is being proved
  2. Assumptions - All conditions required (clearly enumerated A1, A2, ...)
  3. Notation - Define all symbols before use
  4. Proof Body - Logical sequence of justified steps
  5. Conclusion - Explicit statement that claim is established
严谨的统计证明应包含以下部分:
  1. 声明陈述 - 明确表述待证明的数学命题
  2. 假设条件 - 清晰列举所有所需条件(标记为A1、A2……)
  3. 符号定义 - 在使用前定义所有符号
  4. 证明主体 - 逻辑连贯且每步有依据的推导过程
  5. 结论 - 明确声明命题已得证

Proof Skeleton Template

证明框架模板

latex
\begin{theorem}[Name]
\label{thm:name}
Under Assumptions \ref{A1}--\ref{An}, [precise claim].
\end{theorem}

\begin{proof}
The proof proceeds in [n] steps.

\textbf{Step 1: [Description]}
[Content with justification for each transition]

\textbf{Step 2: [Description]}
[Content]

\vdots

\textbf{Step n: Conclusion}
Combining Steps 1--[n-1], we obtain [result], completing the proof.
\end{proof}

latex
\begin{theorem}[Name]
\label{thm:name}
Under Assumptions \ref{A1}--\ref{An}, [precise claim].
\end{theorem}

\begin{proof}
The proof proceeds in [n] steps.

\textbf{Step 1: [Description]}
[Content with justification for each transition]

\textbf{Step 2: [Description]}
[Content]

\vdots

\textbf{Step n: Conclusion}
Combining Steps 1--[n-1], we obtain [result], completing the proof.
\end{proof}

Proof Types in Statistical Methodology

统计方法论中的证明类型

1. Identification Proofs

1. 识别证明

Goal: Show that a causal/statistical quantity is uniquely determined from observed data distribution.
Standard Structure:
  1. Define target estimand (e.g., $\psi = E[Y(a)]$)
  2. State identifying assumptions (consistency, positivity, exchangeability)
  3. Apply identification formula derivation
  4. Show formula depends only on observable quantities
Template:
latex
\begin{theorem}[Identification of $\psi$]
Under Assumptions \ref{A:consistency}--\ref{A:positivity}, the causal effect
$\psi = E[Y(a)]$ is identified by
\[
\psi = \int E[Y \mid A=a, X=x] \, dP(x).
\]
\end{theorem}

\begin{proof}
\begin{align}
E[Y(a)] &= E[E[Y(a) \mid X]] && \text{(law of iterated expectations)} \\
        &= E[E[Y(a) \mid A=a, X]] && \text{(A\ref{A:exchangeability}: $Y(a) \indep A \mid X$)} \\
        &= E[E[Y \mid A=a, X]] && \text{(A\ref{A:consistency}: $Y = Y(A)$)} \\
        &= \int E[Y \mid A=a, X=x] \, dP(x) && \text{(definition)}
\end{align}
which depends only on the observed data distribution.
\end{proof}
目标:证明因果/统计量可由观测数据分布唯一确定。
标准结构:
  1. 定义目标估计量(如:$\psi = E[Y(a)]$)
  2. 陈述识别假设(一致性、 positivity、可交换性)
  3. 推导识别公式
  4. 证明公式仅依赖于可观测变量
模板:
latex
\begin{theorem}[Identification of $\psi$]
Under Assumptions \ref{A:consistency}--\ref{A:positivity}, the causal effect
$\psi = E[Y(a)]$ is identified by
\[
\psi = \int E[Y \mid A=a, X=x] \, dP(x).
\]
\end{theorem}

\begin{proof}
\begin{align}
E[Y(a)] &= E[E[Y(a) \mid X]] && \text{(law of iterated expectations)} \\
        &= E[E[Y(a) \mid A=a, X]] && \text{(A\ref{A:exchangeability}: $Y(a) \indep A \mid X$)} \\
        &= E[E[Y \mid A=a, X]] && \text{(A\ref{A:consistency}: $Y = Y(A)$)} \\
        &= \int E[Y \mid A=a, X=x] \, dP(x) && \text{(definition)}
\end{align}
which depends only on the observed data distribution.
\end{proof}

2. Consistency Proofs

2. 一致性证明

Goal: Show that an estimator converges to the true parameter value.
Standard Structure:
  1. Define estimator $\hat{\theta}_n$
  2. Define target parameter $\theta_0$
  3. Establish convergence: $\hat{\theta}_n \xrightarrow{p} \theta_0$
Key Tools:
  • Law of Large Numbers (LLN)
  • Continuous Mapping Theorem
  • Slutsky's Theorem
  • M-estimation theory
Template:
latex
\begin{theorem}[Consistency]
Under Assumptions \ref{A1}--\ref{An}, $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{theorem}

\begin{proof}
Define $M_n(\theta) = n^{-1} \sum_{i=1}^n m(O_i; \theta)$ and
$M(\theta) = E[m(O; \theta)]$.

\textbf{Step 1: Uniform convergence}
By [ULLN conditions], $\sup_{\theta \in \Theta} |M_n(\theta) - M(\theta)| \xrightarrow{p} 0$.

\textbf{Step 2: Unique maximum}
$M(\theta)$ is uniquely maximized at $\theta_0$ (by identifiability).

\textbf{Step 3: Conclusion}
By standard M-estimation theory, Steps 1--2 imply $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{proof}
目标:证明估计量收敛于真实参数值。
标准结构:
  1. 定义估计量 $\hat{\theta}_n$
  2. 定义目标参数 $\theta_0$
  3. 证明收敛性:$\hat{\theta}_n \xrightarrow{p} \theta_0$
核心工具:
  • 大数定律(LLN)
  • 连续映射定理
  • Slutsky定理
  • M估计理论
模板:
latex
\begin{theorem}[Consistency]
Under Assumptions \ref{A1}--\ref{An}, $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{theorem}

\begin{proof}
Define $M_n(\theta) = n^{-1} \sum_{i=1}^n m(O_i; \theta)$ and
$M(\theta) = E[m(O; \theta)]$.

\textbf{Step 1: Uniform convergence}
By [ULLN conditions], $\sup_{\theta \in \Theta} |M_n(\theta) - M(\theta)| \xrightarrow{p} 0$.

\textbf{Step 2: Unique maximum}
$M(\theta)$ is uniquely maximized at $\theta_0$ (by identifiability).

\textbf{Step 3: Conclusion}
By standard M-estimation theory, Steps 1--2 imply $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{proof}

3. Asymptotic Normality Proofs

3. 渐近正态性证明

Goal: Establish $\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)$.
Standard Structure:
  1. Taylor expansion around true value
  2. Apply CLT to score/influence function
  3. Invert Hessian/information matrix
  4. State limiting distribution
Key Tools:
  • Central Limit Theorem (CLT)
  • Delta Method
  • Influence Function Theory
  • Semiparametric Efficiency Theory
Template:
latex
\begin{theorem}[Asymptotic Normality]
Under Assumptions \ref{A1}--\ref{An},
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)
\]
where $V = E[\phi(O)\phi(O)^\top]$ and $\phi$ is the influence function.
\end{theorem}

\begin{proof}
\textbf{Step 1: Score equation}
$\hat{\theta}_n$ solves $\mathbb{P}_n[\psi(O; \theta)] = 0$ where $\psi = \partial_\theta m$.

\textbf{Step 2: Taylor expansion}
\[
0 = \mathbb{P}_n[\psi(O; \hat{\theta}_n)] = \mathbb{P}_n[\psi(O; \theta_0)]
    + \mathbb{P}_n[\dot{\psi}(O; \tilde{\theta})](\hat{\theta}_n - \theta_0)
\]

\textbf{Step 3: Rearrangement}
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) = -\left(\mathbb{P}_n[\dot{\psi}]\right)^{-1}
    \sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)]
\]

\textbf{Step 4: Apply CLT}
$\sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)] \xrightarrow{d} N(0, \text{Var}(\psi))$ by CLT.

\textbf{Step 5: Slutsky}
$\mathbb{P}_n[\dot{\psi}] \xrightarrow{p} E[\dot{\psi}]$ by WLLN. Apply Slutsky's theorem.
\end{proof}
目标:证明 $\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)$。
标准结构:
  1. 在真实值附近进行泰勒展开
  2. 对得分/影响函数应用中心极限定理
  3. 反转海森矩阵/信息矩阵
  4. 陈述极限分布
核心工具:
  • 中心极限定理(CLT)
  • Delta方法
  • 影响函数理论
  • 半参数效率理论
模板:
latex
\begin{theorem}[Asymptotic Normality]
Under Assumptions \ref{A1}--\ref{An},
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)
\]
where $V = E[\phi(O)\phi(O)^\top]$ and $\phi$ is the influence function.
\end{theorem}

\begin{proof}
\textbf{Step 1: Score equation}
$\hat{\theta}_n$ solves $\mathbb{P}_n[\psi(O; \theta)] = 0$ where $\psi = \partial_\theta m$.

\textbf{Step 2: Taylor expansion}
\[
0 = \mathbb{P}_n[\psi(O; \hat{\theta}_n)] = \mathbb{P}_n[\psi(O; \theta_0)]
    + \mathbb{P}_n[\dot{\psi}(O; \tilde{\theta})](\hat{\theta}_n - \theta_0)
\]

\textbf{Step 3: Rearrangement}
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) = -\left(\mathbb{P}_n[\dot{\psi}]\right)^{-1}
    \sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)]
\]

\textbf{Step 4: Apply CLT}
$\sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)] \xrightarrow{d} N(0, \text{Var}(\psi))$ by CLT.

\textbf{Step 5: Slutsky}
$\mathbb{P}_n[\dot{\psi}] \xrightarrow{p} E[\dot{\psi}]$ by WLLN. Apply Slutsky's theorem.
\end{proof}

4. Efficiency Proofs

4. 效率证明

Goal: Show estimator achieves semiparametric efficiency bound.
Standard Structure:
  1. Characterize the tangent space
  2. Derive efficient influence function (EIF)
  3. Show estimator's influence function equals EIF
  4. Conclude variance achieves bound
Template:
latex
\begin{theorem}[Semiparametric Efficiency]
$\hat{\theta}_n$ is semiparametrically efficient with influence function
\[
\phi(O) = [optimal formula]
\]
achieving the efficiency bound $V_{\text{eff}} = E[\phi(O)^2]$.
\end{theorem}
目标:证明估计量达到半参数效率边界。
标准结构:
  1. 刻画切空间
  2. 推导有效影响函数(EIF)
  3. 证明估计量的影响函数等于EIF
  4. 得出方差达到边界的结论
模板:
latex
\begin{theorem}[Semiparametric Efficiency]
$\hat{\theta}_n$ is semiparametrically efficient with influence function
\[
\phi(O) = [optimal formula]
\]
achieving the efficiency bound $V_{\text{eff}} = E[\phi(O)^2]$.
\end{theorem}

5. Double Robustness Proofs

5. 双重稳健性证明

Goal: Show estimator is consistent if either nuisance model is correctly specified.
Standard Structure:
  1. Write estimating equation with both nuisance functions
  2. Show bias term is product of two errors
  3. Conclude: if either error is zero, estimator is consistent
Template:
latex
\begin{theorem}[Double Robustness]
The estimator $\hat{\psi}_{DR}$ is consistent if either:
\begin{enumerate}
\item The outcome model $\mu(a,x) = E[Y \mid A=a, X=x]$ is correctly specified, or
\item The propensity score $\pi(x) = P(A=1 \mid X=x)$ is correctly specified.
\end{enumerate}
\end{theorem}

\begin{proof}
The estimating equation has the form:
\[
\psi - \hat{\psi}_{DR} = E\left[\frac{(A-\pi)(Y-\mu)}{\pi(1-\pi)}\right] + o_p(1)
\]
The bias term $(A-\pi)(Y-\mu)$ is zero in expectation if either:
\begin{itemize}
\item $E[A-\pi \mid X] = 0$ (propensity correctly specified), or
\item $E[Y-\mu \mid A, X] = 0$ (outcome correctly specified).
\end{itemize}
\end{proof}

目标:证明只要任一干扰模型设定正确,估计量就具有一致性。
标准结构:
  1. 写出包含两个干扰函数的估计方程
  2. 证明偏差项是两个误差的乘积
  3. 得出结论:若任一误差为零,估计量即一致
模板:
latex
\begin{theorem}[Double Robustness]
The estimator $\hat{\psi}_{DR}$ is consistent if either:
\begin{enumerate}
\item The outcome model $\mu(a,x) = E[Y \mid A=a, X=x]$ is correctly specified, or
\item The propensity score $\pi(x) = P(A=1 \mid X=x)$ is correctly specified.
\end{enumerate}
\end{theorem}

\begin{proof}
The estimating equation has the form:
\[
\psi - \hat{\psi}_{DR} = E\left[\frac{(A-\pi)(Y-\mu)}{\pi(1-\pi)}\right] + o_p(1)
\]
The bias term $(A-\pi)(Y-\mu)$ is zero in expectation if either:
\begin{itemize}
\item $E[A-\pi \mid X] = 0$ (propensity correctly specified), or
\item $E[Y-\mu \mid A, X] = 0$ (outcome correctly specified).
\end{itemize}
\end{proof}

Proof Verification Checklist

证明验证检查清单

Level 1: Structure Check

第一级:结构检查

  • Claim clearly stated with all conditions
  • All notation defined before use
  • Logical flow apparent (steps labeled)
  • Each step has explicit justification
  • Conclusion explicitly stated
  • 声明清晰且包含所有条件
  • 所有符号在使用前已定义
  • 逻辑流程明确(步骤已标记)
  • 每一步都有明确依据
  • 结论已明确陈述

Level 2: Step Validation

第二级:步骤验证

For each step, verify:
  • Mathematical operation is valid
  • Cited results apply (check conditions)
  • Inequalities have correct direction
  • Limits/integrals converge
  • Dimensions/types match
对每个步骤,验证:
  • 数学操作合法
  • 引用的定理适用(检查条件)
  • 不等式方向正确
  • 极限/积分收敛
  • 维度/类型匹配

Level 3: Edge Cases

第三级:边缘情况

  • Boundary cases handled (n=1, p=0, etc.)
  • Degenerate cases addressed
  • Assumptions actually used (not vacuous)
  • What happens at assumption boundaries?
  • 边界情况已处理(如n=1、p=0等)
  • 退化情况已说明
  • 假设确实被使用(非冗余)
  • 假设边界处的情况已考虑

Level 4: Consistency

第四级:一致性

  • Result matches intuition
  • Special cases recover known results
  • Numerical verification possible?
  • Consistent with simulation evidence?

  • 结果符合直觉
  • 特殊情况可还原已知结论
  • 可进行数值验证
  • 与模拟结果一致

Common Proof Errors

常见证明错误

Technical Errors

技术错误

ErrorExampleFix
Interchanging limits$\lim \sum \neq \sum \lim$Verify DCT/MCT conditions
Division by zero$1/\pi(x)$ when $\pi(x)=0$State positivity assumption
Incorrect conditioning$E[Y \mid A,X] \neq E[Y \mid X]$Check independence structure
Wrong norm$|f|2$ vs $|f|\infty$Verify which space
Missing measurabilityRandom variable not measurableState measurability
错误类型示例修正方法
极限交换顺序$\lim \sum \neq \sum \lim$验证控制收敛定理/单调收敛定理条件
除以零$1/\pi(x)$ 当 $\pi(x)=0$明确positivity假设
条件错误$E[Y \mid A,X] \neq E[Y \mid X]$检查独立结构
范数错误$|f|2$ vs $|f|\infty$确认使用的空间
可测性缺失随机变量不可测明确可测性假设

Logical Errors

逻辑错误

ErrorExampleFix
Circular reasoningUsing result to prove itselfCheck logical dependency
Unstated assumption"Clearly, X holds"Make all assumptions explicit
Incorrect quantifier$\exists$ vs $\forall$Be precise about scope
Missing caseNot handling $\theta = 0$Enumerate all cases
错误类型示例修正方法
循环论证用结论证明结论检查逻辑依赖关系
未声明假设"显然,X成立"明确所有假设
量词错误$\exists$ vs $\forall$精确表述范围
遗漏情况未处理$\theta = 0$枚举所有情况

Statistical Errors

统计错误

ErrorExampleFix
Confusing $\xrightarrow{p}$ and $\xrightarrow{d}$Different convergence modesState which mode
Ignoring dependenceApplying iid CLT to dependent dataCheck independence
Wrong varianceUsing population variance for sampleDistinguish estimator/parameter

错误类型示例修正方法
混淆收敛模式$\xrightarrow{p}$ 和 $\xrightarrow{d}$ 混淆明确收敛模式
忽略相关性对相依数据应用独立同分布中心极限定理检查独立性
方差错误用总体方差代替样本方差区分估计量与参数

Notation Standards (VanderWeele Convention)

符号标准(VanderWeele约定)

Causal Quantities

因果量

SymbolMeaning
$Y(a)$Potential outcome under treatment $a$
$Y(a,m)$Potential outcome under $A=a$, $M=m$
$M(a)$Potential mediator under treatment $a$
$NDE$Natural Direct Effect: $E[Y(1,M(0)) - Y(0,M(0))]$
$NIE$Natural Indirect Effect: $E[Y(1,M(1)) - Y(1,M(0))]$
$TE$Total Effect: $E[Y(1) - Y(0)] = NDE + NIE$
$P_M$Proportion Mediated: $NIE/TE$
符号含义
$Y(a)$处理a下的潜在结果
$Y(a,m)$处理A=a、中介M=m下的潜在结果
$M(a)$处理a下的潜在中介
$NDE$自然直接效应:$E[Y(1,M(0)) - Y(0,M(0))]$
$NIE$自然间接效应:$E[Y(1,M(1)) - Y(1,M(0))]$
$TE$总效应:$E[Y(1) - Y(0)] = NDE + NIE$
$P_M$中介比例:$NIE/TE$

Statistical Quantities

统计量

SymbolMeaning
$\theta_0$True parameter value
$\hat{\theta}_n$Estimator based on $n$ observations
$\phi(O)$Influence function
$\mathbb{P}_n$Empirical measure
$\mathbb{G}_n$Empirical process: $\sqrt{n}(\mathbb{P}_n - P)$
符号含义
$\theta_0$真实参数值
$\hat{\theta}_n$基于n个观测的估计量
$\phi(O)$影响函数
$\mathbb{P}_n$经验测度
$\mathbb{G}_n$经验过程:$\sqrt{n}(\mathbb{P}_n - P)$

Convergence

收敛性

SymbolMeaning
$\xrightarrow{p}$Convergence in probability
$\xrightarrow{d}$Convergence in distribution
$\xrightarrow{a.s.}$Almost sure convergence
$O_p(1)$Bounded in probability
$o_p(1)$Converges to zero in probability

符号含义
$\xrightarrow{p}$依概率收敛
$\xrightarrow{d}$依分布收敛
$\xrightarrow{a.s.}$几乎必然收敛
$O_p(1)$依概率有界
$o_p(1)$依概率收敛于0

Proof Construction Workflow

证明构建工作流

Step 1: Understand the Goal

步骤1:明确目标

  • What exactly needs to be proved?
  • What type of proof is this? (identification, consistency, etc.)
  • What are the key challenges?
  • 具体需要证明什么?
  • 这属于哪种类型的证明?(识别、一致性等)
  • 核心挑战是什么?

Step 2: Gather Tools

步骤2:准备工具

  • What theorems/lemmas are available?
  • What regularity conditions will be needed?
  • Are there similar proofs to reference?
  • 有哪些可用的定理/引理?
  • 需要哪些正则条件?
  • 有没有可参考的类似证明?

Step 3: Outline Structure

步骤3:构建大纲

  • Break into logical steps
  • Identify the key technical step
  • Plan how to handle edge cases
  • 拆分为逻辑步骤
  • 确定关键技术步骤
  • 规划边缘情况的处理

Step 4: Write First Draft

步骤4:撰写初稿

  • Fill in details for each step
  • Be explicit about every transition
  • Note where conditions are used
  • 填充每个步骤的细节
  • 明确每一步的推导依据
  • 标记假设的使用位置

Step 5: Verify

步骤5:验证

  • Run through verification checklist
  • Check each step independently
  • Test special cases
  • 对照检查清单逐一验证
  • 独立检查每个步骤
  • 测试特殊情况

Step 6: Polish

步骤6:优化

  • Improve notation consistency
  • Add intuitive explanations
  • Ensure assumptions are minimal

  • 提升符号一致性
  • 添加直观解释
  • 确保假设尽可能少

Integration with Other Skills

与其他方法的集成

This skill works with:
  • identification-theory - For causal identification proofs
  • asymptotic-theory - For inference proofs
  • methods-paper-writer - For presenting proofs in manuscripts
  • proof-verifier - For systematic verification

Version: 1.0 Created: 2025-12-08 Domain: Mathematical Statistics, Causal Inference
本方法可与以下方法结合使用:
  • identification-theory - 用于因果识别证明
  • asymptotic-theory - 用于推断证明
  • methods-paper-writer - 用于在论文中呈现证明
  • proof-verifier - 用于系统验证

Version: 1.0 Created: 2025-12-08 领域: 数理统计、因果推断

Key References

核心参考文献

  • van der Vaart
  • Lehmann
  • Casella
  • Bickel
  • Serfling
  • van der Vaart
  • Lehmann
  • Casella
  • Bickel
  • Serfling