geo-optimizer

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

GEO Optimizer

GEO优化器

Generative Engine Optimization (GEO) for AI search visibility.

面向AI搜索可见性的生成式引擎优化（Generative Engine Optimization，简称GEO）。

What is GEO?

什么是GEO？

GEO is NOT traditional SEO. AI search engines (ChatGPT, Perplexity, Claude, Gemini) work fundamentally differently:

They fetch raw HTML - Many don't render JavaScript
They have size limits - Crawlers abandon pages >1MB HTML
They evaluate confidence - Hedged language ("maybe", "possibly") ranks 3x lower than confident assertions
They need machine-readable metadata - AgentFacts/NANDA protocol for AI agent discovery

GEO并非传统SEO。AI搜索引擎（ChatGPT、Perplexity、Claude、Gemini）的工作原理有着本质区别：

它们抓取原始HTML - 许多引擎不渲染JavaScript
它们有大小限制 - 爬虫会放弃HTML大于1MB的页面
它们评估置信度 - 模糊表述（如“也许”、“可能”）的排名比确定性断言低3倍
它们需要机器可读元数据 - 用于AI Agent发现的AgentFacts/NANDA协议

Quick Audit Workflow

快速审计工作流

1. Technical Visibility Check

1. 技术可见性检查

bash

undefined

bash

undefined

Run full GEO audit

运行完整GEO审计

python3 ${CLAUDE_SKILL_DIR}/scripts/audit-geo.py https://example.com

Check HTML size and JS dependency

检查HTML大小和JavaScript依赖

curl -sI https://example.com | grep -i content-length

undefined

curl -sI https://example.com | grep -i content-length

undefined

2. Content Authority Analysis

2. 内容权威性分析

bash

undefined

bash

undefined

Check hedge density of your content

分析内容的模糊表述密度

python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --url https://example.com/blog/post

Or analyze text directly

或直接分析文本

python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --text "Your content here"

undefined

python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --text "Your content here"

undefined

3. Agent Infrastructure Setup

3. Agent基础设施配置

bash

undefined

bash

undefined

Generate AgentFacts schema for your domain

为你的域名生成AgentFacts schema

python3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain example.com
--agent-name "Example Service"
--capabilities text,image
--output /.well-known/agent-facts

undefined

python3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain example.com
--agent-name "Example Service"
--capabilities text,image
--output /.well-known/agent-facts

undefined

Core Metrics

核心指标

Hedge Density Score

模糊表述密度得分

Measures uncertainty language in content. Target: <0.2%

Hedge words to avoid:

maybe, possibly, perhaps, might, could be
however, although, it seems, arguably
some believe, it appears, potentially

Why it matters: Research shows confident citations rank 3x higher in AI responses. Every hedge word signals uncertainty to LLMs.

See:

references/hedge-density.md

衡量内容中的不确定性表述占比，目标值：<0.2%

需避免的模糊表述词汇：

也许、可能、大概、或许、说不定
然而、尽管、看起来、可以说
有人认为、似乎、潜在地

重要性： 研究表明，确定性表述在AI回复中的排名比模糊表述高3倍。每个模糊词汇都会向LLM传递不确定性信号。

参考：

references/hedge-density.md

HTML Size Budget

HTML大小限制

Maximum: 1MB raw HTML. Above this threshold:

18% of pages abandoned by crawlers
Content truncation risks
Slower indexing

Check with:

curl -sI URL | grep content-length

See:

references/technical-visibility.md

最大值：1MB原始HTML。超过此阈值：

18%的页面会被爬虫放弃
存在内容截断风险
索引速度变慢

检查命令：

curl -sI URL | grep content-length

参考：

references/technical-visibility.md

JS Dependency Risk

JavaScript依赖风险

AI crawlers vary in JavaScript rendering:

Rendering crawlers: GPTBot, PerplexityBot (sometimes)
Non-rendering: ClaudeBot, many others

If your content requires JS to display, you're invisible to ~40% of AI crawlers.

AI爬虫对JavaScript的渲染能力各不相同：

支持渲染的爬虫：GPTBot、PerplexityBot（部分情况）
不支持渲染的爬虫：ClaudeBot及其他多数爬虫

如果你的内容需要依赖JavaScript才能显示，那么你将对约40%的AI爬虫不可见。

Discovery Gap

发现差距

Startups face a "recency wall" - 30:1 visibility disadvantage vs established domains:

Site Age	Visibility	Strategy
<2 years	~3.3%	Web-augmented: Reddit, referring domains
2+ years	~99%	GEO content: entity density, hedge reduction

See:

references/discovery-strategies.md

初创企业面临“时效性壁垒”——与成熟域名相比，可见性处于30:1的劣势：

网站年限	可见性	策略
<2年	~3.3%	网络增强型：Reddit、引用域名
2年以上	~99%	GEO内容型：实体密度、减少模糊表述

参考：

references/discovery-strategies.md

AgentFacts Setup

AgentFacts配置

The NANDA protocol provides machine-readable metadata for AI agent discovery. Place at

/.well-known/agent-facts

json

{
  "@context": "https://nanda.dev/ns/agent-facts/v1",
  "id": "nanda:example.com",
  "agent_name": "urn:agent:example:com",
  "endpoints": {
    "static": ["https://api.example.com/v1/agent"]
  },
  "capabilities": {
    "modalities": ["text"],
    "authentication": {
      "methods": ["oauth2", "jwt"]
    }
  },
  "trust": {
    "certification": "self-attested",
    "human_oversight": "true"
  }
}

See:

references/agentfacts-schema.md

NANDA协议为AI Agent发现提供机器可读元数据。需将其放置在

/.well-known/agent-facts

路径下：

json

{
  "@context": "https://nanda.dev/ns/agent-facts/v1",
  "id": "nanda:example.com",
  "agent_name": "urn:agent:example:com",
  "endpoints": {
    "static": ["https://api.example.com/v1/agent"]
  },
  "capabilities": {
    "modalities": ["text"],
    "authentication": {
      "methods": ["oauth2", "jwt"]
    }
  },
  "trust": {
    "certification": "self-attested",
    "human_oversight": "true"
  }
}

参考：

references/agentfacts-schema.md

Audit Report Sections

审计报告章节

A complete GEO audit covers:

Technical Visibility
- HTML payload size (target: <1MB)
- JS dependency assessment
- Crawler accessibility
Content Authority
- Hedge density score (target: <0.2%)
- Confidence assessment
- Entity density
Discovery Strategy
- Site age analysis
- Recency wall detection
- Recommended approach
Agent Infrastructure
- AgentFacts presence
- Schema validation
- NANDA compliance

完整的GEO审计涵盖以下部分：

技术可见性
- HTML负载大小（目标：<1MB）
- JavaScript依赖评估
- 爬虫可访问性
内容权威性
- 模糊表述密度得分（目标：<0.2%）
- 置信度评估
- 实体密度
发现策略
- 网站年限分析
- 时效性壁垒检测
- 推荐方案
Agent基础设施
- AgentFacts存在性
- Schema验证
- NANDA合规性

Scripts

脚本列表

Script	Purpose
`audit-geo.py`	Full GEO audit for a URL
`check-hedge-density.py`	Analyze content confidence
`generate-agentfacts.py`	Create NANDA-compliant schema

脚本	用途
`audit-geo.py`	对URL进行完整GEO审计
`check-hedge-density.py`	分析内容置信度
`generate-agentfacts.py`	创建符合NANDA标准的Schema

Reference Documentation

参考文档

File	Contents
`hedge-density.md`	Confidence scoring methodology
`agentfacts-schema.md`	NANDA protocol specification
`technical-visibility.md`	Crawler behavior & limits
`discovery-strategies.md`	Startup vs incumbent approaches

文件	内容
`hedge-density.md`	置信度评分方法论
`agentfacts-schema.md`	NANDA协议规范
`technical-visibility.md`	爬虫行为与限制
`discovery-strategies.md`	初创企业 vs 成熟企业的差异化方案

Example Commands

示例命令

bash

undefined

bash

undefined

Full audit with report

生成带报告的完整审计

python3 ${CLAUDE_SKILL_DIR}/scripts/audit-geo.py https://mysite.com --output report.md

Quick hedge check

快速检查模糊表述

python3 ${CLAUDE_SKILL_DIR}/scripts/check-hedge-density.py --url https://mysite.com/about

Generate AgentFacts for SaaS

为SaaS生成AgentFacts

python3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain myapp.com
--agent-name "MyApp AI"
--capabilities text
--auth oauth2,apikey

undefined

python3 ${CLAUDE_SKILL_DIR}/scripts/generate-agentfacts.py
--domain myapp.com
--agent-name "MyApp AI"
--capabilities text
--auth oauth2,apikey

undefined

Key Takeaways

核心要点

GEO != SEO - Different crawlers, different rules
Confidence wins - Hedge density <0.2% is the target
Size matters - Stay under 1MB HTML
JS is risky - Content must work without rendering
Age affects strategy - Startups need web-augmented signals
AgentFacts is future-proofing - Implement NANDA now

GEO ≠ SEO - 爬虫不同，规则不同
置信度制胜 - 模糊表述密度需低于0.2%
大小很重要 - HTML需控制在1MB以内
JavaScript有风险 - 内容需在无渲染情况下可正常显示
年限影响策略 - 初创企业需要网络增强型信号
AgentFacts是未来保障 - 立即部署NANDA协议