reviewing-ai-papers

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Reviewing AI Papers

审阅AI论文

When users request analysis of AI/ML technical content (papers, articles, blog posts), extract actionable insights filtered through an enterprise AI engineering lens and store valuable discoveries to memory for cross-session recall.
当用户请求分析AI/ML技术内容(论文、文章、博客)时,基于企业AI工程视角提取可落地的洞见,并将有价值的发现存储至记忆系统,以便跨会话调用。

Contextual Priorities

核心关注方向

Technical Architecture:
  • RAG systems (semantic/lexical search, hybrid retrieval)
  • Vector database optimization and embedding strategies
  • Model fine-tuning for specialized scientific domains
  • Knowledge distillation for secure on-premise deployment
Implementation & Operations:
  • Prompt engineering and in-context learning techniques
  • Security and IP protection in AI systems
  • Scientific accuracy and hallucination mitigation
  • AWS integration (Bedrock/SageMaker)
Enterprise & Adoption:
  • Enterprise deployment in regulated environments
  • Building trust with scientific/legal stakeholders
  • Internal customer success strategies
  • Build vs. buy decision frameworks
技术架构:
  • RAG系统(语义/词汇搜索、混合检索)
  • 向量数据库优化与嵌入策略
  • 面向特定科学领域的模型微调
  • 用于安全本地部署的知识蒸馏
实施与运维:
  • Prompt工程与上下文学习技术
  • AI系统的安全与知识产权保护
  • 科学准确性与幻觉缓解
  • AWS集成(Bedrock/SageMaker)
企业落地与推广:
  • 合规环境下的企业级部署
  • 与科研/法务相关方建立信任
  • 内部客户成功策略
  • 自研vs采购决策框架

Analytical Standards

分析标准

  • Maintain objectivity: Extract factual insights without amplifying source hype
  • Challenge novelty claims: Identify what practitioners already use as baselines. Distinguish "applies existing techniques" from "genuinely new methods"
  • Separate rigor from novelty: Well-executed study of standard techniques ≠ methodological breakthrough
  • Confidence transparency: Distinguish established facts, emerging trends, speculative claims
  • Contextual filtering: Prioritize insights mapping to current challenges
  • 保持客观性:提取事实性洞见,不夸大原文的宣传性表述
  • 质疑创新性宣称:明确从业者已在使用的基线方案,区分“应用现有技术”与“真正全新方法”
  • 区分严谨性与创新性:对标准技术的出色研究≠方法学突破
  • 置信度透明化:区分已确立的事实、新兴趋势、推测性宣称
  • 场景化筛选:优先关注与当前挑战匹配的洞见

Analysis Structure

分析框架

For Substantive Content

针对有实质内容的材料

Article Assessment (2-3 sentences)
  • Core topic and primary claims
  • Credibility: author expertise, evidence quality, methodology rigor
Prioritized Insights
  • High Priority: Direct applications to active projects
  • Medium Priority: Adjacent technologies worth monitoring
  • Low Priority: Interesting but not immediately actionable
Technical Evaluation
  • Distinguish novel methods from standard practice presented as innovation
  • Flag implementation challenges, risks, resource requirements
  • Note contradictions with established best practices
Actionable Recommendations
  • Research deeper: Specific areas requiring investigation
  • Evaluate for implementation: Techniques worth prototyping
  • Share with teams: Which teams benefit from this content
  • Monitor trends: Emerging areas to track
Immediate Applications Map insights to current projects. Identify quick wins or POC opportunities.
文章评估(2-3句话)
  • 核心主题与主要宣称
  • 可信度:作者专业能力、证据质量、方法严谨性
优先级化洞见
  • 高优先级:可直接应用于在研项目
  • 中优先级:值得关注的关联技术
  • 低优先级:有吸引力但暂无直接落地可能
技术评估
  • 区分真正的新方法与被包装成创新的标准实践
  • 标记实施挑战、风险与资源需求
  • 记录与已确立最佳实践的矛盾点
可落地建议
  • 深入研究:需要进一步调研的特定领域
  • 实施评估:值得原型验证的技术
  • 团队共享:哪些团队能从此内容中获益
  • 趋势监控:需要追踪的新兴领域
即时应用 将洞见映射到现有项目,识别快速落地机会或POC(概念验证)场景。

For Thin Content

针对内容单薄的材料

  • State limitations upfront
  • Extract marginal insights if any
  • Recommend alternatives if topic matters
  • Keep brief
  • 直接说明局限性
  • 若有则提取少量洞见
  • 若主题重要则推荐替代内容
  • 保持简洁

Memory Integration

记忆整合

Automatic storage triggers:
  • High-priority insights (directly applicable)
  • Novel techniques worth prototyping
  • Pattern recognitions across papers
  • Contradictions to established practice
Storage format:
python
remember(
    "[Source: {title or url}] {condensed insight}",
    "world",
    tags=["paper-insight", "{domain}", "{technique}"],
    conf=0.85  # higher for strong evidence
)
Compression rule:
  • Full analysis → conversation (what user sees)
  • Condensed insight → memory (searchable nugget with attribution)
  • Store the actionable kernel, not the whole analysis
Example:
Analysis says: "Hybrid retrieval (BM25 + dense) shows 23% improvement over pure semantic search for scientific queries. Two-stage approach..."
Store as:
"[Source: arxiv.org/abs/2401.xxxxx] Hybrid BM25+dense retrieval: 23% lift over semantic-only for scientific corpora. Requires 10K+ domain examples for fine-tuning benefit."
Tags:
["paper-insight", "rag", "hybrid-retrieval", "scientific-domain"]
自动存储触发条件:
  • 高优先级洞见(可直接应用)
  • 值得原型验证的新技术
  • 跨论文的模式识别
  • 与已确立实践的矛盾点
存储格式:
python
remember(
    "[Source: {title or url}] {condensed insight}",
    "world",
    tags=["paper-insight", "{domain}", "{technique}"],
    conf=0.85  # higher for strong evidence
)
压缩规则:
  • 完整分析→对话内容(用户可见)
  • 浓缩洞见→记忆存储(可搜索的带来源的核心信息)
  • 存储可落地的核心内容,而非完整分析
示例:
分析内容:“混合检索(BM25 + 稠密检索)在科学领域查询上比纯语义搜索表现提升23%。两阶段方法……”
存储为:
"[Source: arxiv.org/abs/2401.xxxxx] Hybrid BM25+dense retrieval: 23% lift over semantic-only for scientific corpora. Requires 10K+ domain examples for fine-tuning benefit."
标签:
["paper-insight", "rag", "hybrid-retrieval", "scientific-domain"]

Output Standards

输出标准

  • Conciseness: Actionable insights, not content restatement
  • Precision: Distinguish demonstrates/suggests/claims/speculates
  • Relevance: Connect to focus areas or state no connection
  • Adaptive depth: Match length to content value
  • 简洁性:输出可落地洞见,而非内容复述
  • 精准性:区分“证实”/“暗示”/“宣称”/“推测”
  • 相关性:关联核心关注领域,若无关联则明确说明
  • 适配性深度:内容长度与价值匹配

Constraints

约束条件

  • No hype amplification
  • No timelines unless requested
  • No speculation beyond article
  • Note contradictions explicitly
  • State limitations on thin content
  • 不夸大宣传
  • 除非被要求,否则不提供时间线
  • 不做超出原文的推测
  • 明确记录矛盾点
  • 针对内容单薄的材料明确说明局限性