geo-optimizer
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseGEO Optimizer
GEO优化器
Generative Engine Optimization (GEO) for AI search visibility.
面向AI搜索可见性的生成式引擎优化(Generative Engine Optimization,简称GEO)。
What is GEO?
什么是GEO?
GEO is NOT traditional SEO. AI search engines (ChatGPT, Perplexity, Claude, Gemini) work fundamentally differently:
- They fetch raw HTML - Many don't render JavaScript
- They have size limits - Crawlers abandon pages >1MB HTML
- They evaluate confidence - Hedged language ("maybe", "possibly") ranks 3x lower than confident assertions
- They need machine-readable metadata - AgentFacts/NANDA protocol for AI agent discovery
GEO并非传统SEO。AI搜索引擎(ChatGPT、Perplexity、Claude、Gemini)的工作原理有着本质区别:
- 它们抓取原始HTML - 许多引擎不渲染JavaScript
- 它们有大小限制 - 爬虫会放弃HTML大于1MB的页面
- 它们评估置信度 - 模糊表述(如“也许”、“可能”)的排名比确定性断言低3倍
- 它们需要机器可读元数据 - 用于AI Agent发现的AgentFacts/NANDA协议
Quick Audit Workflow
快速审计工作流
1. Technical Visibility Check
1. 技术可见性检查
bash
undefinedbash
undefinedRun full GEO audit
运行完整GEO审计
python3 ${CLAUDE_SKILL_DIR}/scripts/audit-geo.py https://example.com
python3 ${CLAUDE_SKILL_DIR}/scripts/audit-geo.py https://example.com
Check HTML size and JS dependency
检查HTML大小和JavaScript依赖
curl -sI https://example.com | grep -i content-length
undefinedcurl -sI https://example.com | grep -i content-length
undefined2. Content Authority Analysis
2. 内容权威性分析
bash
undefinedbash
undefinedCheck hedge density of your content
分析内容的模糊表述密度
python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --url https://example.com/blog/post
python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --url https://example.com/blog/post
Or analyze text directly
或直接分析文本
python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --text "Your content here"
undefinedpython3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --text "Your content here"
undefined3. Agent Infrastructure Setup
3. Agent基础设施配置
bash
undefinedbash
undefinedGenerate AgentFacts schema for your domain
为你的域名生成AgentFacts schema
python3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain example.com
--agent-name "Example Service"
--capabilities text,image
--output /.well-known/agent-facts
--domain example.com
--agent-name "Example Service"
--capabilities text,image
--output /.well-known/agent-facts
undefinedpython3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain example.com
--agent-name "Example Service"
--capabilities text,image
--output /.well-known/agent-facts
--domain example.com
--agent-name "Example Service"
--capabilities text,image
--output /.well-known/agent-facts
undefinedCore Metrics
核心指标
Hedge Density Score
模糊表述密度得分
Measures uncertainty language in content. Target: <0.2%
Hedge words to avoid:
- maybe, possibly, perhaps, might, could be
- however, although, it seems, arguably
- some believe, it appears, potentially
Why it matters: Research shows confident citations rank 3x higher in AI responses. Every hedge word signals uncertainty to LLMs.
See:
references/hedge-density.md衡量内容中的不确定性表述占比,目标值:<0.2%
需避免的模糊表述词汇:
- 也许、可能、大概、或许、说不定
- 然而、尽管、看起来、可以说
- 有人认为、似乎、潜在地
重要性: 研究表明,确定性表述在AI回复中的排名比模糊表述高3倍。每个模糊词汇都会向LLM传递不确定性信号。
参考:
references/hedge-density.mdHTML Size Budget
HTML大小限制
Maximum: 1MB raw HTML. Above this threshold:
- 18% of pages abandoned by crawlers
- Content truncation risks
- Slower indexing
Check with:
curl -sI URL | grep content-lengthSee:
references/technical-visibility.md最大值:1MB原始HTML。超过此阈值:
- 18%的页面会被爬虫放弃
- 存在内容截断风险
- 索引速度变慢
检查命令:
curl -sI URL | grep content-length参考:
references/technical-visibility.mdJS Dependency Risk
JavaScript依赖风险
AI crawlers vary in JavaScript rendering:
- Rendering crawlers: GPTBot, PerplexityBot (sometimes)
- Non-rendering: ClaudeBot, many others
If your content requires JS to display, you're invisible to ~40% of AI crawlers.
AI爬虫对JavaScript的渲染能力各不相同:
- 支持渲染的爬虫:GPTBot、PerplexityBot(部分情况)
- 不支持渲染的爬虫:ClaudeBot及其他多数爬虫
如果你的内容需要依赖JavaScript才能显示,那么你将对约40%的AI爬虫不可见。
Discovery Gap
发现差距
Startups face a "recency wall" - 30:1 visibility disadvantage vs established domains:
| Site Age | Visibility | Strategy |
|---|---|---|
| <2 years | ~3.3% | Web-augmented: Reddit, referring domains |
| 2+ years | ~99% | GEO content: entity density, hedge reduction |
See:
references/discovery-strategies.md初创企业面临“时效性壁垒”——与成熟域名相比,可见性处于30:1的劣势:
| 网站年限 | 可见性 | 策略 |
|---|---|---|
| <2年 | ~3.3% | 网络增强型:Reddit、引用域名 |
| 2年以上 | ~99% | GEO内容型:实体密度、减少模糊表述 |
参考:
references/discovery-strategies.mdAgentFacts Setup
AgentFacts配置
The NANDA protocol provides machine-readable metadata for AI agent discovery. Place at :
/.well-known/agent-factsjson
{
"@context": "https://nanda.dev/ns/agent-facts/v1",
"id": "nanda:example.com",
"agent_name": "urn:agent:example:com",
"endpoints": {
"static": ["https://api.example.com/v1/agent"]
},
"capabilities": {
"modalities": ["text"],
"authentication": {
"methods": ["oauth2", "jwt"]
}
},
"trust": {
"certification": "self-attested",
"human_oversight": "true"
}
}See:
references/agentfacts-schema.mdNANDA协议为AI Agent发现提供机器可读元数据。需将其放置在路径下:
/.well-known/agent-factsjson
{
"@context": "https://nanda.dev/ns/agent-facts/v1",
"id": "nanda:example.com",
"agent_name": "urn:agent:example:com",
"endpoints": {
"static": ["https://api.example.com/v1/agent"]
},
"capabilities": {
"modalities": ["text"],
"authentication": {
"methods": ["oauth2", "jwt"]
}
},
"trust": {
"certification": "self-attested",
"human_oversight": "true"
}
}参考:
references/agentfacts-schema.mdAudit Report Sections
审计报告章节
A complete GEO audit covers:
-
Technical Visibility
- HTML payload size (target: <1MB)
- JS dependency assessment
- Crawler accessibility
-
Content Authority
- Hedge density score (target: <0.2%)
- Confidence assessment
- Entity density
-
Discovery Strategy
- Site age analysis
- Recency wall detection
- Recommended approach
-
Agent Infrastructure
- AgentFacts presence
- Schema validation
- NANDA compliance
完整的GEO审计涵盖以下部分:
-
技术可见性
- HTML负载大小(目标:<1MB)
- JavaScript依赖评估
- 爬虫可访问性
-
内容权威性
- 模糊表述密度得分(目标:<0.2%)
- 置信度评估
- 实体密度
-
发现策略
- 网站年限分析
- 时效性壁垒检测
- 推荐方案
-
Agent基础设施
- AgentFacts存在性
- Schema验证
- NANDA合规性
Scripts
脚本列表
| Script | Purpose |
|---|---|
| Full GEO audit for a URL |
| Analyze content confidence |
| Create NANDA-compliant schema |
| 脚本 | 用途 |
|---|---|
| 对URL进行完整GEO审计 |
| 分析内容置信度 |
| 创建符合NANDA标准的Schema |
Reference Documentation
参考文档
| File | Contents |
|---|---|
| Confidence scoring methodology |
| NANDA protocol specification |
| Crawler behavior & limits |
| Startup vs incumbent approaches |
| 文件 | 内容 |
|---|---|
| 置信度评分方法论 |
| NANDA协议规范 |
| 爬虫行为与限制 |
| 初创企业 vs 成熟企业的差异化方案 |
Example Commands
示例命令
bash
undefinedbash
undefinedFull audit with report
生成带报告的完整审计
python3 ${CLAUDE_SKILL_DIR}/scripts/audit-geo.py https://mysite.com --output report.md
python3 ${CLAUDE_SKILL_DIR}/scripts/audit-geo.py https://mysite.com --output report.md
Quick hedge check
快速检查模糊表述
python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --url https://mysite.com/about
python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --url https://mysite.com/about
Generate AgentFacts for SaaS
为SaaS生成AgentFacts
python3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain myapp.com
--agent-name "MyApp AI"
--capabilities text
--auth oauth2,apikey
--domain myapp.com
--agent-name "MyApp AI"
--capabilities text
--auth oauth2,apikey
undefinedpython3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain myapp.com
--agent-name "MyApp AI"
--capabilities text
--auth oauth2,apikey
--domain myapp.com
--agent-name "MyApp AI"
--capabilities text
--auth oauth2,apikey
undefinedKey Takeaways
核心要点
- GEO != SEO - Different crawlers, different rules
- Confidence wins - Hedge density <0.2% is the target
- Size matters - Stay under 1MB HTML
- JS is risky - Content must work without rendering
- Age affects strategy - Startups need web-augmented signals
- AgentFacts is future-proofing - Implement NANDA now
- GEO ≠ SEO - 爬虫不同,规则不同
- 置信度制胜 - 模糊表述密度需低于0.2%
- 大小很重要 - HTML需控制在1MB以内
- JavaScript有风险 - 内容需在无渲染情况下可正常显示
- 年限影响策略 - 初创企业需要网络增强型信号
- AgentFacts是未来保障 - 立即部署NANDA协议