Loading...
Loading...
Analyze media files (PDFs, images, diagrams) that require interpretation beyond raw text. Extracts specific information or summaries from documents, describes visual content. Use for document analysis, image understanding, diagram interpretation, chart analysis, table extraction, and any media requiring visual or contextual interpretation beyond literal text extraction.
npx skill4agent add 404kidwiz/claude-supercode-skills multimodal-analysis1. Document Structure Assessment
- Identify document type and purpose
- Map section hierarchy and organization
- Recognize formatting and layout patterns
2. Content Extraction
- Extract text content with structure preserved
- Identify and extract tables and lists
- Preserve metadata and formatting information
3. Contextual Understanding
- Understand document flow and logic
- Identify key themes and main points
- Summarize content while maintaining accuracy1. Component Identification
- Recognize different diagram elements (nodes, edges, symbols)
- Understand notation and conventions used
- Identify legends, labels, and annotations
2. Relationship Mapping
- Trace connections and relationships
- Understand flow directions and dependencies
- Identify hierarchies and groupings
3. Functional Interpretation
- Explain the purpose and function of the diagram
- Describe processes and decision points
- Identify inputs, outputs, and transformations1. Chart Type Recognition
- Identify chart type (bar, line, pie, scatter, etc.)
- Understand axes, scales, and data series
- Recognize legends and color coding
2. Data Extraction
- Extract numerical values from the visualization
- Identify trends, patterns, and outliers
- Compare different data series or time periods
3. Insight Generation
- Explain what the data means in context
- Identify significant findings and implications
- Note limitations or potential misinterpretations