Loading...
Loading...
Analyze images using AI — segment objects, detect objects, extract text (OCR), describe images, ask questions about images. Use when the user requests "Segment image", "Detect objects", "OCR", "Extract text from image", "Describe image", "What's in this image", "Image analysis".
npx skill4agent add fal-ai-community/skills fal-vision| Script | Purpose |
|---|---|
| Analyze an image (segment, detect, OCR, describe, QA) |
./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation segment --query "the red car"./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation detect./scripts/analyze.sh --image-url "https://example.com/document.jpg" --operation ocr./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation describe./scripts/analyze.sh --image-url "https://example.com/photo.jpg" --operation qa --query "How many people are in this image?"| Argument | Description | Required |
|---|---|---|
| URL of image to analyze | Yes |
| segment, detect, ocr, describe, qa | Yes |
| Text prompt for segment/qa operations | For segment/qa |
| Override model endpoint | No |
# Search for segmentation models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "segmentation"
# Search for object detection models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "object detection"
# Search for OCR models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "ocr"
# Search for image captioning / visual QA models
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "caption"
bash /mnt/skills/user/fal-generate/scripts/search-models.sh --query "visual question"search_models