Loading...
Loading...
Extract false-positive and false-negative gaps from VLM binary-classification-question (BCQ, yes/no) predictions. Use after running VLM evaluation when you have a predictions JSON and need to identify failure cases for DEFT root cause analysis on a binary-classification VLM workflow.
npx skill4agent add nvidia/skills tao-analyze-gaps-vlm-bcqvlm_bcqgap_analysis vlm_bcq \
predictions_json=/path/to/results.json \
results_dir=/path/to/output/gapsvideos_dirvideo_idgap_analysis vlm_bcq \
predictions_json=/path/to/results.json \
results_dir=/path/to/output/gaps \
videos_dir=/path/to/videos/rootkpi_gaps_report.txtkpi_gaps.jsonlvideo_idresponsegtresponsegt'yes''no'video_idvideo_id[
{
"video_id": "/path/to/video.mp4",
"response": "Yes, there is a collision.",
"gt": "B. No",
"question": "Is there a collision?"
}
]video_iderror_typeFPFNquestionground_truthresponse| Parameter | Required | Description |
|---|---|---|
| predictions_json | Yes | Path to predictions JSON file |
| results_dir | Yes | Output directory; created if it does not exist |
| videos_dir | No | Base directory for resolving relative |
| Error | Cause | Fix |
|---|---|---|
| | Check the path |
| Predictions file is not a list | Wrap predictions in |
| A prediction item is missing a required field | Inspect and fix the predictions JSON |
| Samples silently skipped | | Check logs for warnings; inspect those samples |