github-repo-candidate-quality-gate

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

GitHub Repo Candidate Quality Gate

GitHub仓库候选质量关卡

Use this skill when broad GitHub search output is too noisy for merge-ready recommendations.
当宽泛的GitHub搜索输出过于杂乱,无法直接用于可合并的推荐时,请使用此技能。

When to use

使用场景

  • You already ran
    gh search repos
    (or equivalent) and need recommendation-grade keeps.
  • You must preserve broad discovery evidence while filtering keeps deterministically.
  • You need lane-level status + degraded-cause reporting for unattended cron review.
  • 你已执行
    gh search repos
    (或类似命令),需要筛选出可用于推荐的候选仓库。
  • 你需要在确定性筛选候选仓库的同时,保留宽泛的发现证据。
  • 你需要为无人值守的定时审查提供阶段状态+降级原因报告。

Minimum keep metadata

候选仓库需保留的最低元数据

For every kept candidate, capture all fields:
  • full_name
  • url
  • license
  • pushed_at
    (or
    updated_at
    fallback)
  • archived
  • stars
  • one-line fit rationale
  • provenance label
对于每个保留的候选仓库,需捕获以下所有字段:
  • full_name
  • url
  • license
  • pushed_at
    (或备选的
    updated_at
  • archived
  • stars
  • 一行适配理由
  • 来源标签

Quality gate order

质量关卡执行顺序

  1. License gate: drop missing/unknown license unless explicit exception rationale.
  2. Archive gate: drop archived repos by default.
  3. Freshness gate: drop if
    pushed_at
    older than 24 months unless exception + risk.
  4. Fit gate: drop keyword-only mismatches (low-fit).
  5. Signal gate: keep broad discovery evidence, but recommendation-grade keeps require traction (default:
    stars >= 3
    ) or explicit adoption rationale.
  1. 许可证关卡:默认剔除许可证缺失/未知的仓库,除非有明确的例外理由。
  2. 归档关卡:默认剔除已归档的仓库。
  3. 新鲜度关卡:如果
    pushed_at
    早于24个月则剔除,除非有例外说明+风险评估。
  4. 适配关卡:剔除仅关键词不匹配的仓库(适配度低)。
  5. 信号关卡:保留宽泛的发现证据,但推荐级候选仓库需要具备吸引力(默认:
    stars >= 3
    )或明确的采用理由。

Reporting contract

报告约定

  • Keep and publish full discovery counts even when keeps are sparse.
  • Emit
    lane_status: pass|degraded
    .
  • Emit
    degraded_causes
    counts using:
    license
    ,
    stale
    ,
    low-fit
    ,
    archived
    ,
    low-signal
    ,
    no-results
    .
  • When a lane ends with
    raw_count == 0
    after documented recovery, include
    no-results
    explicitly to keep degraded reports deterministic.
  • Include lane health metrics:
    kept_count
    ,
    raw_count
    ,
    median_stars_raw
    ,
    zero_star_raw
    .
  • State provenance labels used (
    direct page retrieval
    ,
    indexed snippet
    , etc.).
  • 即使候选仓库数量稀少,也要保留并发布完整的发现计数。
  • 输出
    lane_status: pass|degraded
  • 使用以下类别输出
    degraded_causes
    计数:
    license
    stale
    low-fit
    archived
    low-signal
    no-results
  • 当某个阶段在已记录的恢复操作后仍出现
    raw_count == 0
    时,需明确包含
    no-results
    以确保降级报告的确定性。
  • 包含阶段健康指标:
    kept_count
    raw_count
    median_stars_raw
    zero_star_raw
  • 说明所使用的来源标签(如
    direct page retrieval
    indexed snippet
    等)。

Rollback-safe operation

可回滚操作

  • Never delete raw evidence files.
  • If stricter gates reduce keep counts too far, preserve output but mark lane degraded.
  • Allow exception keeps only when rationale + risk note are explicit.
  • 切勿删除原始证据文件。
  • 如果更严格的关卡导致候选仓库数量过少,保留输出但标记该阶段为降级。
  • 仅当有明确的理由+风险说明时,才允许保留例外候选仓库。

Example

示例

  • Input: 30 raw hits, many zero-star repos.
  • Output: discovery evidence retained, recommendation keeps filtered by metadata/freshness/signal gates, degraded causes documented for reviewer.
  • 输入:30条原始搜索结果,其中包含许多零星仓库。
  • 输出:保留发现证据,通过元数据/新鲜度/信号关卡筛选出推荐级候选仓库,并为审核人员记录降级原因。