proof-architect

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Proof Architect

证明架构师

Structured methodology for constructing and verifying mathematical proofs in statistical research

Use this skill when working on: mathematical proofs, theorem development, derivations, consistency proofs, asymptotic arguments, identification proofs, or verifying proof correctness.

统计研究中构建与验证数学证明的结构化方法

在以下场景中使用本方法：数学证明、定理推导、公式推演、一致性证明、渐近分析、识别证明或验证证明正确性。

Proof Structure Framework

证明结构框架

Standard Proof Components

标准证明组件

Every rigorous statistical proof should contain:

Claim Statement - Precise mathematical statement of what is being proved
Assumptions - All conditions required (clearly enumerated A1, A2, ...)
Notation - Define all symbols before use
Proof Body - Logical sequence of justified steps
Conclusion - Explicit statement that claim is established

严谨的统计证明应包含以下部分：

声明陈述 - 明确表述待证明的数学命题
假设条件 - 清晰列举所有所需条件（标记为A1、A2……）
符号定义 - 在使用前定义所有符号
证明主体 - 逻辑连贯且每步有依据的推导过程
结论 - 明确声明命题已得证

Proof Skeleton Template

证明框架模板

latex

\begin{theorem}[Name]
\label{thm:name}
Under Assumptions \ref{A1}--\ref{An}, [precise claim].
\end{theorem}

\begin{proof}
The proof proceeds in [n] steps.

\textbf{Step 1: [Description]}
[Content with justification for each transition]

\textbf{Step 2: [Description]}
[Content]

\vdots

\textbf{Step n: Conclusion}
Combining Steps 1--[n-1], we obtain [result], completing the proof.
\end{proof}

latex

\begin{theorem}[Name]
\label{thm:name}
Under Assumptions \ref{A1}--\ref{An}, [precise claim].
\end{theorem}

\begin{proof}
The proof proceeds in [n] steps.

\textbf{Step 1: [Description]}
[Content with justification for each transition]

\textbf{Step 2: [Description]}
[Content]

\vdots

\textbf{Step n: Conclusion}
Combining Steps 1--[n-1], we obtain [result], completing the proof.
\end{proof}

Proof Types in Statistical Methodology

统计方法论中的证明类型

1. Identification Proofs

1. 识别证明

Goal: Show that a causal/statistical quantity is uniquely determined from observed data distribution.

Standard Structure:

Define target estimand (e.g., $\psi = E[Y(a)]$)
State identifying assumptions (consistency, positivity, exchangeability)
Apply identification formula derivation
Show formula depends only on observable quantities

Template:

latex

\begin{theorem}[Identification of $\psi$]
Under Assumptions \ref{A:consistency}--\ref{A:positivity}, the causal effect
$\psi = E[Y(a)]$ is identified by
\[
\psi = \int E[Y \mid A=a, X=x] \, dP(x).
\]
\end{theorem}

\begin{proof}
\begin{align}
E[Y(a)] &= E[E[Y(a) \mid X]] && \text{(law of iterated expectations)} \\
        &= E[E[Y(a) \mid A=a, X]] && \text{(A\ref{A:exchangeability}: $Y(a) \indep A \mid X$)} \\
        &= E[E[Y \mid A=a, X]] && \text{(A\ref{A:consistency}: $Y = Y(A)$)} \\
        &= \int E[Y \mid A=a, X=x] \, dP(x) && \text{(definition)}
\end{align}
which depends only on the observed data distribution.
\end{proof}

目标：证明因果/统计量可由观测数据分布唯一确定。

标准结构:

定义目标估计量（如：$\psi = E[Y(a)]$）
陈述识别假设（一致性、 positivity、可交换性）
推导识别公式
证明公式仅依赖于可观测变量

模板:

latex

\begin{theorem}[Identification of $\psi$]
Under Assumptions \ref{A:consistency}--\ref{A:positivity}, the causal effect
$\psi = E[Y(a)]$ is identified by
\[
\psi = \int E[Y \mid A=a, X=x] \, dP(x).
\]
\end{theorem}

\begin{proof}
\begin{align}
E[Y(a)] &= E[E[Y(a) \mid X]] && \text{(law of iterated expectations)} \\
        &= E[E[Y(a) \mid A=a, X]] && \text{(A\ref{A:exchangeability}: $Y(a) \indep A \mid X$)} \\
        &= E[E[Y \mid A=a, X]] && \text{(A\ref{A:consistency}: $Y = Y(A)$)} \\
        &= \int E[Y \mid A=a, X=x] \, dP(x) && \text{(definition)}
\end{align}
which depends only on the observed data distribution.
\end{proof}

2. Consistency Proofs

2. 一致性证明

Goal: Show that an estimator converges to the true parameter value.

Standard Structure:

Define estimator $\hat{\theta}_n$
Define target parameter $\theta_0$
Establish convergence: $\hat{\theta}_n \xrightarrow{p} \theta_0$

Key Tools:

Law of Large Numbers (LLN)
Continuous Mapping Theorem
Slutsky's Theorem
M-estimation theory

Template:

latex

\begin{theorem}[Consistency]
Under Assumptions \ref{A1}--\ref{An}, $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{theorem}

\begin{proof}
Define $M_n(\theta) = n^{-1} \sum_{i=1}^n m(O_i; \theta)$ and
$M(\theta) = E[m(O; \theta)]$.

\textbf{Step 1: Uniform convergence}
By [ULLN conditions], $\sup_{\theta \in \Theta} |M_n(\theta) - M(\theta)| \xrightarrow{p} 0$.

\textbf{Step 2: Unique maximum}
$M(\theta)$ is uniquely maximized at $\theta_0$ (by identifiability).

\textbf{Step 3: Conclusion}
By standard M-estimation theory, Steps 1--2 imply $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{proof}

目标：证明估计量收敛于真实参数值。

标准结构:

定义估计量 $\hat{\theta}_n$
定义目标参数 $\theta_0$
证明收敛性：$\hat{\theta}_n \xrightarrow{p} \theta_0$

核心工具:

大数定律（LLN）
连续映射定理
Slutsky定理
M估计理论

模板:

latex

\begin{theorem}[Consistency]
Under Assumptions \ref{A1}--\ref{An}, $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{theorem}

\begin{proof}
Define $M_n(\theta) = n^{-1} \sum_{i=1}^n m(O_i; \theta)$ and
$M(\theta) = E[m(O; \theta)]$.

\textbf{Step 1: Uniform convergence}
By [ULLN conditions], $\sup_{\theta \in \Theta} |M_n(\theta) - M(\theta)| \xrightarrow{p} 0$.

\textbf{Step 2: Unique maximum}
$M(\theta)$ is uniquely maximized at $\theta_0$ (by identifiability).

\textbf{Step 3: Conclusion}
By standard M-estimation theory, Steps 1--2 imply $\hat{\theta}_n \xrightarrow{p} \theta_0$.
\end{proof}

3. Asymptotic Normality Proofs

3. 渐近正态性证明

Goal: Establish $\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)$.

Standard Structure:

Taylor expansion around true value
Apply CLT to score/influence function
Invert Hessian/information matrix
State limiting distribution

Key Tools:

Central Limit Theorem (CLT)
Delta Method
Influence Function Theory
Semiparametric Efficiency Theory

Template:

latex

\begin{theorem}[Asymptotic Normality]
Under Assumptions \ref{A1}--\ref{An},
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)
\]
where $V = E[\phi(O)\phi(O)^\top]$ and $\phi$ is the influence function.
\end{theorem}

\begin{proof}
\textbf{Step 1: Score equation}
$\hat{\theta}_n$ solves $\mathbb{P}_n[\psi(O; \theta)] = 0$ where $\psi = \partial_\theta m$.

\textbf{Step 2: Taylor expansion}
\[
0 = \mathbb{P}_n[\psi(O; \hat{\theta}_n)] = \mathbb{P}_n[\psi(O; \theta_0)]
    + \mathbb{P}_n[\dot{\psi}(O; \tilde{\theta})](\hat{\theta}_n - \theta_0)
\]

\textbf{Step 3: Rearrangement}
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) = -\left(\mathbb{P}_n[\dot{\psi}]\right)^{-1}
    \sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)]
\]

\textbf{Step 4: Apply CLT}
$\sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)] \xrightarrow{d} N(0, \text{Var}(\psi))$ by CLT.

\textbf{Step 5: Slutsky}
$\mathbb{P}_n[\dot{\psi}] \xrightarrow{p} E[\dot{\psi}]$ by WLLN. Apply Slutsky's theorem.
\end{proof}

目标：证明 $\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)$。

标准结构:

在真实值附近进行泰勒展开
对得分/影响函数应用中心极限定理
反转海森矩阵/信息矩阵
陈述极限分布

核心工具:

中心极限定理（CLT）
Delta方法
影响函数理论
半参数效率理论

模板:

latex

\begin{theorem}[Asymptotic Normality]
Under Assumptions \ref{A1}--\ref{An},
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) \xrightarrow{d} N(0, V)
\]
where $V = E[\phi(O)\phi(O)^\top]$ and $\phi$ is the influence function.
\end{theorem}

\begin{proof}
\textbf{Step 1: Score equation}
$\hat{\theta}_n$ solves $\mathbb{P}_n[\psi(O; \theta)] = 0$ where $\psi = \partial_\theta m$.

\textbf{Step 2: Taylor expansion}
\[
0 = \mathbb{P}_n[\psi(O; \hat{\theta}_n)] = \mathbb{P}_n[\psi(O; \theta_0)]
    + \mathbb{P}_n[\dot{\psi}(O; \tilde{\theta})](\hat{\theta}_n - \theta_0)
\]

\textbf{Step 3: Rearrangement}
\[
\sqrt{n}(\hat{\theta}_n - \theta_0) = -\left(\mathbb{P}_n[\dot{\psi}]\right)^{-1}
    \sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)]
\]

\textbf{Step 4: Apply CLT}
$\sqrt{n} \mathbb{P}_n[\psi(O; \theta_0)] \xrightarrow{d} N(0, \text{Var}(\psi))$ by CLT.

\textbf{Step 5: Slutsky}
$\mathbb{P}_n[\dot{\psi}] \xrightarrow{p} E[\dot{\psi}]$ by WLLN. Apply Slutsky's theorem.
\end{proof}

4. Efficiency Proofs

4. 效率证明

Goal: Show estimator achieves semiparametric efficiency bound.

Standard Structure:

Characterize the tangent space
Derive efficient influence function (EIF)
Show estimator's influence function equals EIF
Conclude variance achieves bound

Template:

latex

\begin{theorem}[Semiparametric Efficiency]
$\hat{\theta}_n$ is semiparametrically efficient with influence function
\[
\phi(O) = [optimal formula]
\]
achieving the efficiency bound $V_{\text{eff}} = E[\phi(O)^2]$.
\end{theorem}

目标：证明估计量达到半参数效率边界。

标准结构:

刻画切空间
推导有效影响函数（EIF）
证明估计量的影响函数等于EIF
得出方差达到边界的结论

模板:

latex

\begin{theorem}[Semiparametric Efficiency]
$\hat{\theta}_n$ is semiparametrically efficient with influence function
\[
\phi(O) = [optimal formula]
\]
achieving the efficiency bound $V_{\text{eff}} = E[\phi(O)^2]$.
\end{theorem}

5. Double Robustness Proofs

5. 双重稳健性证明

Goal: Show estimator is consistent if either nuisance model is correctly specified.

Standard Structure:

Write estimating equation with both nuisance functions
Show bias term is product of two errors
Conclude: if either error is zero, estimator is consistent

Template:

latex

\begin{theorem}[Double Robustness]
The estimator $\hat{\psi}_{DR}$ is consistent if either:
\begin{enumerate}
\item The outcome model $\mu(a,x) = E[Y \mid A=a, X=x]$ is correctly specified, or
\item The propensity score $\pi(x) = P(A=1 \mid X=x)$ is correctly specified.
\end{enumerate}
\end{theorem}

\begin{proof}
The estimating equation has the form:
\[
\psi - \hat{\psi}_{DR} = E\left[\frac{(A-\pi)(Y-\mu)}{\pi(1-\pi)}\right] + o_p(1)
\]
The bias term $(A-\pi)(Y-\mu)$ is zero in expectation if either:
\begin{itemize}
\item $E[A-\pi \mid X] = 0$ (propensity correctly specified), or
\item $E[Y-\mu \mid A, X] = 0$ (outcome correctly specified).
\end{itemize}
\end{proof}

目标：证明只要任一干扰模型设定正确，估计量就具有一致性。

标准结构:

写出包含两个干扰函数的估计方程
证明偏差项是两个误差的乘积
得出结论：若任一误差为零，估计量即一致

模板:

latex

\begin{theorem}[Double Robustness]
The estimator $\hat{\psi}_{DR}$ is consistent if either:
\begin{enumerate}
\item The outcome model $\mu(a,x) = E[Y \mid A=a, X=x]$ is correctly specified, or
\item The propensity score $\pi(x) = P(A=1 \mid X=x)$ is correctly specified.
\end{enumerate}
\end{theorem}

\begin{proof}
The estimating equation has the form:
\[
\psi - \hat{\psi}_{DR} = E\left[\frac{(A-\pi)(Y-\mu)}{\pi(1-\pi)}\right] + o_p(1)
\]
The bias term $(A-\pi)(Y-\mu)$ is zero in expectation if either:
\begin{itemize}
\item $E[A-\pi \mid X] = 0$ (propensity correctly specified), or
\item $E[Y-\mu \mid A, X] = 0$ (outcome correctly specified).
\end{itemize}
\end{proof}

Proof Verification Checklist

证明验证检查清单

Level 1: Structure Check

第一级：结构检查

Level 2: Step Validation

第二级：步骤验证

Level 3: Edge Cases

第三级：边缘情况

Boundary cases handled (n=1, p=0, etc.)
Degenerate cases addressed
Assumptions actually used (not vacuous)
What happens at assumption boundaries?

边界情况已处理（如n=1、p=0等）
退化情况已说明
假设确实被使用（非冗余）
假设边界处的情况已考虑

Level 4: Consistency

第四级：一致性

Result matches intuition
Special cases recover known results
Numerical verification possible?
Consistent with simulation evidence?

结果符合直觉
特殊情况可还原已知结论
可进行数值验证
与模拟结果一致

Common Proof Errors

常见证明错误

Technical Errors

技术错误

Error	Example	Fix
Interchanging limits	$\lim \sum \neq \sum \lim$	Verify DCT/MCT conditions
Division by zero	$1/\pi(x)$ when $\pi(x)=0$	State positivity assumption
Incorrect conditioning	$E[Y \mid A,X] \neq E[Y \mid X]$	Check independence structure
Wrong norm	$\|f\|2$ vs $\|f\|\infty$	Verify which space
Missing measurability	Random variable not measurable	State measurability

错误类型	示例	修正方法
极限交换顺序	$\lim \sum \neq \sum \lim$	验证控制收敛定理/单调收敛定理条件
除以零	$1/\pi(x)$ 当 $\pi(x)=0$	明确positivity假设
条件错误	$E[Y \mid A,X] \neq E[Y \mid X]$	检查独立结构
范数错误	$\|f\|2$ vs $\|f\|\infty$	确认使用的空间
可测性缺失	随机变量不可测	明确可测性假设

Logical Errors

逻辑错误

Error	Example	Fix
Circular reasoning	Using result to prove itself	Check logical dependency
Unstated assumption	"Clearly, X holds"	Make all assumptions explicit
Incorrect quantifier	$\exists$ vs $\forall$	Be precise about scope
Missing case	Not handling $\theta = 0$	Enumerate all cases

错误类型	示例	修正方法
循环论证	用结论证明结论	检查逻辑依赖关系
未声明假设	"显然，X成立"	明确所有假设
量词错误	$\exists$ vs $\forall$	精确表述范围
遗漏情况	未处理$\theta = 0$	枚举所有情况

Statistical Errors

统计错误

Error	Example	Fix
Confusing $\xrightarrow{p}$ and $\xrightarrow{d}$	Different convergence modes	State which mode
Ignoring dependence	Applying iid CLT to dependent data	Check independence
Wrong variance	Using population variance for sample	Distinguish estimator/parameter

错误类型	示例	修正方法
混淆收敛模式	$\xrightarrow{p}$ 和 $\xrightarrow{d}$ 混淆	明确收敛模式
忽略相关性	对相依数据应用独立同分布中心极限定理	检查独立性
方差错误	用总体方差代替样本方差	区分估计量与参数

Notation Standards (VanderWeele Convention)

符号标准（VanderWeele约定）

Causal Quantities

因果量

Symbol	Meaning
$Y(a)$	Potential outcome under treatment $a$
$Y(a,m)$	Potential outcome under $A=a$, $M=m$
$M(a)$	Potential mediator under treatment $a$
$NDE$	Natural Direct Effect: $E[Y(1,M(0)) - Y(0,M(0))]$
$NIE$	Natural Indirect Effect: $E[Y(1,M(1)) - Y(1,M(0))]$
$TE$	Total Effect: $E[Y(1) - Y(0)] = NDE + NIE$
$P_M$	Proportion Mediated: $NIE/TE$

符号	含义
$Y(a)$	处理a下的潜在结果
$Y(a,m)$	处理A=a、中介M=m下的潜在结果
$M(a)$	处理a下的潜在中介
$NDE$	自然直接效应：$E[Y(1,M(0)) - Y(0,M(0))]$
$NIE$	自然间接效应：$E[Y(1,M(1)) - Y(1,M(0))]$
$TE$	总效应：$E[Y(1) - Y(0)] = NDE + NIE$
$P_M$	中介比例：$NIE/TE$

Statistical Quantities

统计量

Symbol	Meaning
$\theta_0$	True parameter value
$\hat{\theta}_n$	Estimator based on $n$ observations
$\phi(O)$	Influence function
$\mathbb{P}_n$	Empirical measure
$\mathbb{G}_n$	Empirical process: $\sqrt{n}(\mathbb{P}_n - P)$

符号	含义
$\theta_0$	真实参数值
$\hat{\theta}_n$	基于n个观测的估计量
$\phi(O)$	影响函数
$\mathbb{P}_n$	经验测度
$\mathbb{G}_n$	经验过程：$\sqrt{n}(\mathbb{P}_n - P)$

Convergence

收敛性

Symbol	Meaning
$\xrightarrow{p}$	Convergence in probability
$\xrightarrow{d}$	Convergence in distribution
$\xrightarrow{a.s.}$	Almost sure convergence
$O_p(1)$	Bounded in probability
$o_p(1)$	Converges to zero in probability

符号	含义
$\xrightarrow{p}$	依概率收敛
$\xrightarrow{d}$	依分布收敛
$\xrightarrow{a.s.}$	几乎必然收敛
$O_p(1)$	依概率有界
$o_p(1)$	依概率收敛于0

Proof Construction Workflow

证明构建工作流

Step 1: Understand the Goal

步骤1：明确目标

What exactly needs to be proved?
What type of proof is this? (identification, consistency, etc.)
What are the key challenges?

具体需要证明什么？
这属于哪种类型的证明？（识别、一致性等）
核心挑战是什么？

Step 2: Gather Tools

步骤2：准备工具

What theorems/lemmas are available?
What regularity conditions will be needed?
Are there similar proofs to reference?

有哪些可用的定理/引理？
需要哪些正则条件？
有没有可参考的类似证明？

Step 3: Outline Structure

步骤3：构建大纲

Break into logical steps
Identify the key technical step
Plan how to handle edge cases

拆分为逻辑步骤
确定关键技术步骤
规划边缘情况的处理

Step 4: Write First Draft

步骤4：撰写初稿

Fill in details for each step
Be explicit about every transition
Note where conditions are used

填充每个步骤的细节
明确每一步的推导依据
标记假设的使用位置

Step 5: Verify

步骤5：验证

Run through verification checklist
Check each step independently
Test special cases

对照检查清单逐一验证
独立检查每个步骤
测试特殊情况

Step 6: Polish

步骤6：优化

Improve notation consistency
Add intuitive explanations
Ensure assumptions are minimal

提升符号一致性
添加直观解释
确保假设尽可能少

Integration with Other Skills

与其他方法的集成

This skill works with:

identification-theory - For causal identification proofs
asymptotic-theory - For inference proofs
methods-paper-writer - For presenting proofs in manuscripts
proof-verifier - For systematic verification

Version: 1.0 Created: 2025-12-08 Domain: Mathematical Statistics, Causal Inference

本方法可与以下方法结合使用：

identification-theory - 用于因果识别证明
asymptotic-theory - 用于推断证明
methods-paper-writer - 用于在论文中呈现证明
proof-verifier - 用于系统验证

Version: 1.0 Created: 2025-12-08 领域: 数理统计、因果推断

Key References

核心参考文献

van der Vaart
Lehmann
Casella
Bickel
Serfling

van der Vaart
Lehmann
Casella
Bickel
Serfling