Loading...
Loading...
Adversarial robustness engineering for ML/AI—evasion, poisoning, extraction, membership-inference threat models; robust training, sanitization, detectors; ASR/certified evals; lab model attacks; data-pipeline integrity; production I/O guardrails (classical ML and LLM/multimodal). Use for adversarial examples, robustness suites, poison audits, deploy guardrails—not LLM app red team (ai-redteam), governance (ai-risk-governance), safety classifier R&D (ml-research-engineer-safeguards), safeguard serving (ml-infrastructure-engineer-safeguards), privacy research (privacy-research-engineer-safeguards), AppSec pentest (penetration-tester).
npx skill4agent add daemon-blockint-tech/agentic-enteprises-skill ai-adversarial-robustness-engineerai-redteamai-risk-governanceml-research-engineer-safeguardsml-infrastructure-engineer-safeguardsprivacy-research-engineer-safeguardsai-engineerai-researcherpenetration-testerweb-pentester| Need | Skill |
|---|---|
| LLM jailbreak and app-surface red team | |
| Governance sign-off and risk tiers | |
| Safety classifier R&D and harm evals | |
| Production safeguard serving path | |
| Privacy and extraction research | |
| Production LLM/RAG implementation | |
| General ML research methodology | |
| Pipeline and artifact security | |
references/adversarial_robustness_scope.mdreferences/threat_models_and_attack_taxonomy.mdreferences/evaluation_metrics_and_benchmarks.mdreferences/defenses_and_mitigations.mdreferences/red_team_campaigns_on_models.mdreferences/production_guardrails_and_monitoring.md