smart-sourcing

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Smart Sourcing

智能溯源

Intelligent sourcing that requires citations only when the cost is justified by the value of verification.

智能溯源仅在验证价值能覆盖成本时才要求提供引用。

Philosophy

核心理念

Full sourcing is prohibitively expensive (10-16x token increase). Smart sourcing targets high-value claims where verification materially improves accuracy.

完全溯源成本极高（Token使用量增加10-16倍）。智能溯源针对那些验证能显著提升准确性的高价值声明。

When to Source

溯源场景判定

REQUIRE Sources

需要溯源的场景

Claim Type	Example	Why Source
Version numbers	"Python 3.12 added..."	Versions change, easy to verify
Performance claims	"30% faster than..."	Quantitative claims need evidence
Security recommendations	"Use bcrypt for..."	Security advice must be current
API specifications	"The function accepts..."	APIs change between versions
Release dates	"Released in Q4 2025"	Factual, verifiable
Pricing/limits	"Free tier allows 1000 requests"	Business terms change
Deprecated features	"X was removed in version Y"	Breaking changes need verification

声明类型	示例	为何需要溯源
版本号	"Python 3.12 added..."	版本会更新，易于验证
性能声明	"30% faster than..."	量化声明需要证据支持
安全建议	"Use bcrypt for..."	安全建议必须保持时效性
API规范	"The function accepts..."	API会随版本迭代变更
发布日期	"Released in Q4 2025"	事实性内容，可验证
定价/限制	"Free tier allows 1000 requests"	商业条款可能变更
废弃特性	"X was removed in version Y"	破坏性变更需要验证

DO NOT Require Sources

无需溯源的场景

Claim Type	Example	Why No Source
General concepts	"Async improves concurrency"	Foundational knowledge
Code examples	Demonstrative snippets	Illustrative, not factual claims
Opinion/preference	"Consider using..."	Clearly framed as suggestion
Common knowledge	"Git tracks changes"	Universal understanding
Logical derivations	"Therefore, X implies Y"	Reasoning, not factual claim
Syntax	"Use `def` for functions"	Language documentation is stable

声明类型	示例	为何无需溯源
通用概念	"Async improves concurrency"	基础性知识
代码示例	演示代码片段	仅作说明用途，非事实声明
观点/偏好	"Consider using..."	明确为建议性质
常识内容	"Git tracks changes"	普遍认知内容
逻辑推导	"Therefore, X implies Y"	推理过程，非事实声明
语法规则	"Use `def` for functions"	语言文档相对稳定

Decision Framework

决策框架

Is this a factual claim?
├── No → No source needed
└── Yes → Would being wrong cause harm?
    ├── No (minor inconvenience) → No source needed
    └── Yes → Is verification cost-effective?
        ├── No (>5k tokens for low-value claim) → State uncertainty
        └── Yes → Perform web search, cite source

Is this a factual claim?
├── No → No source needed
└── Yes → Would being wrong cause harm?
    ├── No (minor inconvenience) → No source needed
    └── Yes → Is verification cost-effective?
        ├── No (>5k tokens for low-value claim) → State uncertainty
        └── Yes → Perform web search, cite source

Implementation

实现方式

Inline Citations

内联引用

When sourcing is warranted:

markdown

Python 3.12 introduced `type` statement for type aliases[1].

Sources:
1. [PEP 695](https://peps.python.org/pep-0695/)

当需要溯源时：

markdown

Python 3.12 introduced `type` statement for type aliases[1].

Sources:
1. [PEP 695](https://peps.python.org/pep-0695/)

Uncertainty Markers

不确定性标记

When sourcing isn't cost-effective but uncertainty exists:

markdown

Python 3.12 introduced the `type` statement (verify in release notes for your version).

当溯源成本过高但存在不确定性时：

markdown

Python 3.12 introduced the `type` statement (verify in release notes for your version).

Confidence Levels

置信度等级

Confidence	Action
High (stable facts)	State directly
Medium (may have changed)	Add verification note
Low (uncertain)	Source or acknowledge uncertainty

置信度	操作
高（稳定事实）	直接陈述
中（可能已变更）	添加验证提示
低（不确定）	溯源或说明不确定性

Token Cost Analysis

Token成本分析

Approach	Token Cost	Use Case
No sourcing	Baseline	General knowledge
Uncertainty marker	+5-10 tokens	Medium confidence
Single source	+2-5k tokens	High-value claim
Multiple sources	+10-20k tokens	Research tasks

方案	Token成本	使用场景
不溯源	基准值	通用知识
不确定性标记	+5-10 tokens	中等置信度场景
单一引用	+2-5k tokens	高价值声明
多引用	+10-20k tokens	研究任务

Examples

示例

Good: Source Version-Specific Claim

良好实践：版本相关声明溯源

markdown

User: What's new in FastAPI 0.109?

Response: FastAPI 0.109 added Pydantic v2 support as default[1] and
improved OpenAPI schema generation[2].

Sources:
1. [FastAPI Release Notes](https://fastapi.tiangolo.com/release-notes/)
2. [GitHub Release 0.109.0](https://github.com/tiangolo/fastapi/releases/tag/0.109.0)

markdown

User: What's new in FastAPI 0.109?

Response: FastAPI 0.109 added Pydantic v2 support as default[1] and
improved OpenAPI schema generation[2].

Sources:
1. [FastAPI Release Notes](https://fastapi.tiangolo.com/release-notes/)
2. [GitHub Release 0.109.0](https://github.com/tiangolo/fastapi/releases/tag/0.109.0)

Good: No Source for General Concept

良好实践：通用概念无需溯源

markdown

User: How does async/await work?

Response: Async/await enables non-blocking I/O by allowing coroutines
to yield control while waiting for operations. The event loop manages
execution, resuming coroutines when their awaited operations complete.

markdown

User: How does async/await work?

Response: Async/await enables non-blocking I/O by allowing coroutines
to yield control while waiting for operations. The event loop manages
execution, resuming coroutines when their awaited operations complete.

Good: Uncertainty Marker

良好实践：添加不确定性标记

markdown

The connection pool default is typically 10 connections (verify in your
database driver documentation as this varies by library and version).

markdown

The connection pool default is typically 10 connections (verify in your
database driver documentation as this varies by library and version).

Integration

集成兼容

Works with:

```
conserve:response-compression
```
- Keeps sourced responses concise
```
conserve:token-conservation
```
- Weighs source cost vs value
```
memory-palace:research
```
- Full sourcing for knowledge corpus

可与以下组件配合使用：

```
conserve:response-compression
```
- 保持溯源后的回复简洁
```
conserve:token-conservation
```
- 权衡溯源成本与价值
```
memory-palace:research
```
- 用于知识语料库的完全溯源

When To Use Full Sourcing

何时使用完全溯源

When NOT To Use

何时不使用

Internal project code that doesn't need citations
Casual conversation without factual claims

Escalate to full sourcing (accept high token cost) for:

Knowledge corpus entries (permanent documentation)
Security advisories (safety-critical)
Compliance/legal claims (audit requirements)
Research tasks (user expects thorough investigation)

For these cases, use

memory-palace:research

workflow which is designed for comprehensive sourcing.

无需引用的内部项目代码
无事实声明的随意对话

当以下情况时，升级为完全溯源（接受高Token成本）：

知识语料库条目（永久文档）
安全公告（安全关键场景）
合规/法律声明（审计要求）
研究任务（用户期望深入调查）

对于这些场景，请使用专为全面溯源设计的

memory-palace:research

工作流。