geo-optimizer

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

GEO Optimizer

GEO优化器

Generative Engine Optimization (GEO) for AI search visibility.
面向AI搜索可见性的生成式引擎优化(Generative Engine Optimization,简称GEO)。

What is GEO?

什么是GEO?

GEO is NOT traditional SEO. AI search engines (ChatGPT, Perplexity, Claude, Gemini) work fundamentally differently:
  1. They fetch raw HTML - Many don't render JavaScript
  2. They have size limits - Crawlers abandon pages >1MB HTML
  3. They evaluate confidence - Hedged language ("maybe", "possibly") ranks 3x lower than confident assertions
  4. They need machine-readable metadata - AgentFacts/NANDA protocol for AI agent discovery
GEO并非传统SEO。AI搜索引擎(ChatGPT、Perplexity、Claude、Gemini)的工作原理有着本质区别:
  1. 它们抓取原始HTML - 许多引擎不渲染JavaScript
  2. 它们有大小限制 - 爬虫会放弃HTML大于1MB的页面
  3. 它们评估置信度 - 模糊表述(如“也许”、“可能”)的排名比确定性断言低3倍
  4. 它们需要机器可读元数据 - 用于AI Agent发现的AgentFacts/NANDA协议

Quick Audit Workflow

快速审计工作流

1. Technical Visibility Check

1. 技术可见性检查

bash
undefined
bash
undefined

Run full GEO audit

运行完整GEO审计

python3 ${CLAUDE_SKILL_DIR}/scripts/audit-geo.py https://example.com
python3 ${CLAUDE_SKILL_DIR}/scripts/audit-geo.py https://example.com

Check HTML size and JS dependency

检查HTML大小和JavaScript依赖

curl -sI https://example.com | grep -i content-length
undefined
curl -sI https://example.com | grep -i content-length
undefined

2. Content Authority Analysis

2. 内容权威性分析

bash
undefined
bash
undefined

Check hedge density of your content

分析内容的模糊表述密度

python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --url https://example.com/blog/post
python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --url https://example.com/blog/post

Or analyze text directly

或直接分析文本

python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --text "Your content here"
undefined
python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --text "Your content here"
undefined

3. Agent Infrastructure Setup

3. Agent基础设施配置

bash
undefined
bash
undefined

Generate AgentFacts schema for your domain

为你的域名生成AgentFacts schema

python3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain example.com
--agent-name "Example Service"
--capabilities text,image
--output /.well-known/agent-facts
undefined
python3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain example.com
--agent-name "Example Service"
--capabilities text,image
--output /.well-known/agent-facts
undefined

Core Metrics

核心指标

Hedge Density Score

模糊表述密度得分

Measures uncertainty language in content. Target: <0.2%
Hedge words to avoid:
  • maybe, possibly, perhaps, might, could be
  • however, although, it seems, arguably
  • some believe, it appears, potentially
Why it matters: Research shows confident citations rank 3x higher in AI responses. Every hedge word signals uncertainty to LLMs.
See:
references/hedge-density.md
衡量内容中的不确定性表述占比,目标值:<0.2%
需避免的模糊表述词汇:
  • 也许、可能、大概、或许、说不定
  • 然而、尽管、看起来、可以说
  • 有人认为、似乎、潜在地
重要性: 研究表明,确定性表述在AI回复中的排名比模糊表述高3倍。每个模糊词汇都会向LLM传递不确定性信号。
参考:
references/hedge-density.md

HTML Size Budget

HTML大小限制

Maximum: 1MB raw HTML. Above this threshold:
  • 18% of pages abandoned by crawlers
  • Content truncation risks
  • Slower indexing
Check with:
curl -sI URL | grep content-length
See:
references/technical-visibility.md
最大值:1MB原始HTML。超过此阈值:
  • 18%的页面会被爬虫放弃
  • 存在内容截断风险
  • 索引速度变慢
检查命令:
curl -sI URL | grep content-length
参考:
references/technical-visibility.md

JS Dependency Risk

JavaScript依赖风险

AI crawlers vary in JavaScript rendering:
  • Rendering crawlers: GPTBot, PerplexityBot (sometimes)
  • Non-rendering: ClaudeBot, many others
If your content requires JS to display, you're invisible to ~40% of AI crawlers.
AI爬虫对JavaScript的渲染能力各不相同:
  • 支持渲染的爬虫:GPTBot、PerplexityBot(部分情况)
  • 不支持渲染的爬虫:ClaudeBot及其他多数爬虫
如果你的内容需要依赖JavaScript才能显示,那么你将对约40%的AI爬虫不可见。

Discovery Gap

发现差距

Startups face a "recency wall" - 30:1 visibility disadvantage vs established domains:
Site AgeVisibilityStrategy
<2 years~3.3%Web-augmented: Reddit, referring domains
2+ years~99%GEO content: entity density, hedge reduction
See:
references/discovery-strategies.md
初创企业面临“时效性壁垒”——与成熟域名相比,可见性处于30:1的劣势:
网站年限可见性策略
<2年~3.3%网络增强型:Reddit、引用域名
2年以上~99%GEO内容型:实体密度、减少模糊表述
参考:
references/discovery-strategies.md

AgentFacts Setup

AgentFacts配置

The NANDA protocol provides machine-readable metadata for AI agent discovery. Place at
/.well-known/agent-facts
:
json
{
  "@context": "https://nanda.dev/ns/agent-facts/v1",
  "id": "nanda:example.com",
  "agent_name": "urn:agent:example:com",
  "endpoints": {
    "static": ["https://api.example.com/v1/agent"]
  },
  "capabilities": {
    "modalities": ["text"],
    "authentication": {
      "methods": ["oauth2", "jwt"]
    }
  },
  "trust": {
    "certification": "self-attested",
    "human_oversight": "true"
  }
}
See:
references/agentfacts-schema.md
NANDA协议为AI Agent发现提供机器可读元数据。需将其放置在
/.well-known/agent-facts
路径下:
json
{
  "@context": "https://nanda.dev/ns/agent-facts/v1",
  "id": "nanda:example.com",
  "agent_name": "urn:agent:example:com",
  "endpoints": {
    "static": ["https://api.example.com/v1/agent"]
  },
  "capabilities": {
    "modalities": ["text"],
    "authentication": {
      "methods": ["oauth2", "jwt"]
    }
  },
  "trust": {
    "certification": "self-attested",
    "human_oversight": "true"
  }
}
参考:
references/agentfacts-schema.md

Audit Report Sections

审计报告章节

A complete GEO audit covers:
  1. Technical Visibility
    • HTML payload size (target: <1MB)
    • JS dependency assessment
    • Crawler accessibility
  2. Content Authority
    • Hedge density score (target: <0.2%)
    • Confidence assessment
    • Entity density
  3. Discovery Strategy
    • Site age analysis
    • Recency wall detection
    • Recommended approach
  4. Agent Infrastructure
    • AgentFacts presence
    • Schema validation
    • NANDA compliance
完整的GEO审计涵盖以下部分:
  1. 技术可见性
    • HTML负载大小(目标:<1MB)
    • JavaScript依赖评估
    • 爬虫可访问性
  2. 内容权威性
    • 模糊表述密度得分(目标:<0.2%)
    • 置信度评估
    • 实体密度
  3. 发现策略
    • 网站年限分析
    • 时效性壁垒检测
    • 推荐方案
  4. Agent基础设施
    • AgentFacts存在性
    • Schema验证
    • NANDA合规性

Scripts

脚本列表

ScriptPurpose
audit-geo.py
Full GEO audit for a URL
check-hedge-density.py
Analyze content confidence
generate-agentfacts.py
Create NANDA-compliant schema
脚本用途
audit-geo.py
对URL进行完整GEO审计
check-hedge-density.py
分析内容置信度
generate-agentfacts.py
创建符合NANDA标准的Schema

Reference Documentation

参考文档

FileContents
hedge-density.md
Confidence scoring methodology
agentfacts-schema.md
NANDA protocol specification
technical-visibility.md
Crawler behavior & limits
discovery-strategies.md
Startup vs incumbent approaches
文件内容
hedge-density.md
置信度评分方法论
agentfacts-schema.md
NANDA协议规范
technical-visibility.md
爬虫行为与限制
discovery-strategies.md
初创企业 vs 成熟企业的差异化方案

Example Commands

示例命令

bash
undefined
bash
undefined

Full audit with report

生成带报告的完整审计

python3 ${CLAUDE_SKILL_DIR}/scripts/audit-geo.py https://mysite.com --output report.md
python3 ${CLAUDE_SKILL_DIR}/scripts/audit-geo.py https://mysite.com --output report.md

Quick hedge check

快速检查模糊表述

python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --url https://mysite.com/about
python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --url https://mysite.com/about

Generate AgentFacts for SaaS

为SaaS生成AgentFacts

python3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain myapp.com
--agent-name "MyApp AI"
--capabilities text
--auth oauth2,apikey
undefined
python3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain myapp.com
--agent-name "MyApp AI"
--capabilities text
--auth oauth2,apikey
undefined

Key Takeaways

核心要点

  1. GEO != SEO - Different crawlers, different rules
  2. Confidence wins - Hedge density <0.2% is the target
  3. Size matters - Stay under 1MB HTML
  4. JS is risky - Content must work without rendering
  5. Age affects strategy - Startups need web-augmented signals
  6. AgentFacts is future-proofing - Implement NANDA now
  1. GEO ≠ SEO - 爬虫不同,规则不同
  2. 置信度制胜 - 模糊表述密度需低于0.2%
  3. 大小很重要 - HTML需控制在1MB以内
  4. JavaScript有风险 - 内容需在无渲染情况下可正常显示
  5. 年限影响策略 - 初创企业需要网络增强型信号
  6. AgentFacts是未来保障 - 立即部署NANDA协议