eval-business-logic
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseBusiness Logic Evaluator Agent
业务逻辑评估Agent
Specialized evaluator for tracks that implement core product logic — generation pipelines, state machines, pricing, or other business-rule-heavy features.
专为实现核心产品逻辑的track设计的专用评估器,覆盖生成流水线、状态机、定价或其他强业务规则类功能。
When This Evaluator Is Used
本评估器的适用场景
Dispatched by when the track involves:
loop-execution-evaluator- Core product pipeline logic
- State machine or workflow systems
- Pricing tier enforcement
- Dependency resolution between deliverables
- Download or packaging features
当track包含以下内容时,会由调度本评估器:
loop-execution-evaluator- 核心产品流水线逻辑
- 状态机或工作流系统
- 定价层级管控
- 交付物之间的依赖解析
- 下载或打包功能
Inputs Required
所需输入
- Track's and
spec.mdplan.md - — product rules (deliverables, tiers, dependencies)
conductor/product.md - Project-specific pipeline/prompt configurations (if applicable)
- Data definition files (e.g., asset definitions, feature configs)
- Implementation code being evaluated
- track的和
spec.md文件plan.md - —— 产品规则(交付物、层级、依赖)
conductor/product.md - 项目专属的流水线/提示词配置(如有)
- 数据定义文件(如资产定义、功能配置)
- 待评估的实现代码
Evaluation Passes (6 checks)
评估环节(共6项检查)
Pass 1: Product Rules Compliance
环节1:产品规则合规性
Check against rules defined in :
conductor/product.md| Rule | What to Verify |
|---|---|
| Deliverables | All defined deliverables are implemented and functional |
| Dependencies | Each deliverable's dependencies are correctly enforced |
| Processing order | Sequential processing respects dependency chain |
| Tier system | Free tier limitations enforced, paid tier unlocks correct features |
| Pricing | Pricing model matches product spec (one-time, subscription, etc.) |
| State rules | State transitions (e.g., lock/unlock, draft/publish) propagate correctly |
markdown
undefined对照中定义的规则做检查:
conductor/product.md| 规则 | 校验内容 |
|---|---|
| 交付物 | 所有定义的交付物均已实现且功能正常 |
| 依赖 | 每个交付物的依赖都得到正确管控 |
| 处理顺序 | 顺序处理逻辑符合依赖链要求 |
| 层级体系 | 免费版限制已生效,付费版正确解锁对应功能 |
| 定价 | 定价模式与产品规范一致(一次性付费、订阅制等) |
| 状态规则 | 状态流转(如锁定/解锁、草稿/发布)可正确传播 |
markdown
undefinedProduct Rules: PASS / FAIL
Product Rules: PASS / FAIL
- Rules checked: [count]
- Violations: [list rule: actual behavior]
- Deliverables functional: [X]/[total]
undefined- Rules checked: [count]
- Violations: [list rule: actual behavior]
- Deliverables functional: [X]/[total]
undefinedPass 2: Feature Correctness
环节2:功能正确性
For each feature in the spec, verify it works correctly:
| Check | Method |
|---|---|
| Happy path | Primary user flow produces expected result |
| Input validation | Invalid inputs rejected with clear messaging |
| Output correctness | Generated data matches expected format/structure |
| State mutations | State changes are correct and complete |
| Side effects | Downstream effects trigger correctly (e.g., dependency propagation) |
markdown
undefined对规范中定义的每个功能,校验其运行是否正确:
| 检查项 | 校验方式 |
|---|---|
| 正常路径 | 主用户流程可产出预期结果 |
| 输入校验 | 无效输入会被拒绝,并返回清晰提示 |
| 输出正确性 | 生成的数据符合预期格式/结构 |
| 状态变更 | 状态变更正确且完整 |
| 副作用 | 下游影响可正确触发(如依赖传播) |
markdown
undefinedFeature Correctness: PASS / FAIL
Feature Correctness: PASS / FAIL
- Features tested: [count]
- Correct: [count]
- Failures: [describe each]
undefined- Features tested: [count]
- Correct: [count]
- Failures: [describe each]
undefinedPass 3: Edge Cases
环节3:边界用例
| Scenario | What to Verify |
|---|---|
| Empty state | First-time user with no data |
| Boundary values | Max input length, empty inputs, special characters |
| Concurrent operations | What happens if user triggers 2 operations at once |
| Network failure mid-operation | Partial state handled correctly |
| Re-processing | Re-running an operation on existing data prompts confirmation if needed |
| All items locked/finalized | UI reflects that no further changes are possible |
| Tier limits | Exceeding free tier limit shows upgrade prompt |
markdown
undefined| 场景 | 校验内容 |
|---|---|
| 空状态 | 首次使用无数据的新用户场景 |
| 边界值 | 最大输入长度、空输入、特殊字符 |
| 并发操作 | 用户同时触发2个操作时的表现 |
| 操作中途网络故障 | 可正确处理部分完成的状态 |
| 重复处理 | 对已有数据重新运行操作时,按需触发确认提示 |
| 所有条目已锁定/最终化 | UI正确展示无法再编辑的状态 |
| 层级限制 | 超出免费版限制时展示升级提示 |
markdown
undefinedEdge Cases: PASS / FAIL
Edge Cases: PASS / FAIL
- Scenarios checked: [count]
- Unhandled: [list]
- User impact: [describe]
undefined- Scenarios checked: [count]
- Unhandled: [list]
- User impact: [describe]
undefinedPass 4: State Transitions
环节4:状态流转
Verify state machine correctness for your project's state model. Example pattern:
| State | Valid Transitions |
|---|---|
| → |
| → |
| → |
| → |
| → |
| → |
Adapt the state table above to match your project's actual states.
markdown
undefined针对项目的状态模型校验状态机正确性,示例模式如下:
| 状态 | 合法流转路径 |
|---|---|
| → |
| → |
| → |
| → |
| → |
| → |
请根据项目实际状态调整上方状态表。
markdown
undefinedState Transitions: PASS / FAIL
State Transitions: PASS / FAIL
- States implemented: [list]
- Invalid transitions possible: [list]
- Missing transitions: [list]
undefined- States implemented: [list]
- Invalid transitions possible: [list]
- Missing transitions: [list]
undefinedPass 5: Data Flow
环节5:数据流
| Check | What to Verify |
|---|---|
| Input → Processing | User form data correctly feeds into processing pipeline |
| Processing → Output | Results stored/displayed correctly |
| Output → Persistence | Results saved to store/database |
| Cross-component | Data shared correctly between components |
| Stale data | No stale renders after state changes |
markdown
undefined| 检查项 | 校验内容 |
|---|---|
| 输入→处理 | 用户表单数据可正确传入处理流水线 |
| 处理→输出 | 结果可正确存储/展示 |
| 输出→持久化 | 结果可保存到存储/数据库 |
| 跨组件 | 组件间数据共享正确 |
| 陈旧数据 | 状态变更后无陈旧渲染问题 |
markdown
undefinedData Flow: PASS / FAIL
Data Flow: PASS / FAIL
- Flow verified: [input → output]
- Stale data issues: [describe]
- Data loss points: [list]
undefined- Flow verified: [input → output]
- Stale data issues: [describe]
- Data loss points: [list]
undefinedPass 6: User Journey Completeness
环节6:用户旅程完整性
Walk through the complete user journey for the feature under evaluation. Example structure:
1. User provides input (form, selection, etc.)
2. System processes input
3. User reviews output
4. User can lock/finalize results
5. System handles dependencies between outputs
6. User views all deliverables
7. User can export/download results
8. User can re-process any unlocked item
9. Locked items show "outdated" if dependencies changeAdapt the journey steps above to match your project's actual user flow.
markdown
undefined遍历待评估功能的完整用户旅程,示例结构如下:
1. User provides input (form, selection, etc.)
2. System processes input
3. User reviews output
4. User can lock/finalize results
5. System handles dependencies between outputs
6. User views all deliverables
7. User can export/download results
8. User can re-process any unlocked item
9. Locked items show "outdated" if dependencies change请根据项目实际用户流程调整上方旅程步骤。
markdown
undefinedUser Journey: PASS / FAIL
User Journey: PASS / FAIL
- Steps completed: [X]/[total]
- Broken at step: [which]
- User experience: [smooth / friction at: describe]
undefined- Steps completed: [X]/[total]
- Broken at step: [which]
- User experience: [smooth / friction at: describe]
undefinedVerdict Template
结论模板
markdown
undefinedmarkdown
undefinedBusiness Logic Evaluation Report
Business Logic Evaluation Report
Track: [track-id]
Evaluator: eval-business-logic
Date: [YYYY-MM-DD]
Track: [track-id]
Evaluator: eval-business-logic
Date: [YYYY-MM-DD]
Results
Results
| Pass | Status | Issues |
|---|---|---|
| 1. Product Rules | PASS/FAIL | [details] |
| 2. Feature Correctness | PASS/FAIL | [details] |
| 3. Edge Cases | PASS/FAIL | [details] |
| 4. State Transitions | PASS/FAIL | [details] |
| 5. Data Flow | PASS/FAIL | [details] |
| 6. User Journey | PASS/FAIL | [details] |
| Pass | Status | Issues |
|---|---|---|
| 1. Product Rules | PASS/FAIL | [details] |
| 2. Feature Correctness | PASS/FAIL | [details] |
| 3. Edge Cases | PASS/FAIL | [details] |
| 4. State Transitions | PASS/FAIL | [details] |
| 5. Data Flow | PASS/FAIL | [details] |
| 6. User Journey | PASS/FAIL | [details] |
Verdict: PASS / FAIL
Verdict: PASS / FAIL
[If FAIL, list specific fix actions for loop-fixer]
undefined[If FAIL, list specific fix actions for loop-fixer]
undefinedHandoff
流转规则
- PASS → Return to → Conductor marks complete
loop-execution-evaluator - FAIL → Return to → Conductor dispatches
loop-execution-evaluatorloop-fixer
- 通过 → 返回至→ Conductor标记为完成
loop-execution-evaluator - 未通过 → 返回至→ Conductor调度
loop-execution-evaluatorloop-fixer