Loading...
Loading...
Guides ML/research engineering for safeguards—safety classifier development, harm benchmarks and eval suites, labeled dataset design, fine-tuning and ablations, calibration and slice analysis, attack-surface research memos, and promotion criteria for new moderation models. Use when building or evaluating guardrail models, designing safety benchmarks, measuring precision/recall on policy categories, comparing mitigation techniques, or writing research reports on classifier improvements—not for production inference gateways (ml-infrastructure-engineer-safeguards), PII/leakage privacy research (privacy-research-engineer-safeguards), red-team attack campaigns (ai-redteam), AI governance policy (ai-risk-governance), general non-safety research (ai-researcher), or token-efficiency studies (research-engineer-scientist-tokens).
npx skill4agent add daemon-blockint-tech/agentic-enteprises-skill ml-research-engineer-safeguardsml-infrastructure-engineer-safeguardsai-redteamai-risk-governanceai-engineerai-researcherresearch-engineer-scientist-tokensdata-scientistprivacy-research-engineer-safeguards| Need | Skill |
|---|---|
| Privacy research for safeguards | |
| Production safeguard path and rollout | |
| Adversarial attack campaigns | |
| Governance sign-off and model cards | |
| Production eval harness in app | |
| General research methodology | |
| Classical ML and statistics | |
| Token efficiency ablations | |
| Release gates and ops cadence | |
references/research_framing_safety.mdreferences/safety_benchmarks_datasets.mdreferences/classifier_model_development.mdreferences/evaluation_metrics_analysis.mdreferences/ablation_experiment_design.mdreferences/research_to_production_handoff.md