Search Results: ocr-processing

Found 22 Skills

mineru

Parse PDF into Markdown/JSON/DOCX using MinerU API. Extract text, tables, formulas with OCR support. Use when converting PDF documents, extracting content from scanned papers, or batch processing PDF files.

🇺🇸|EnglishTranslated

Tools & Utilitiesconversiontools/agent-ski...

conversiontools

Convert files between 140+ formats using the ConversionTools MCP server. Use when the user needs to convert documents (Word, PDF, Excel, PowerPoint), data formats (JSON, CSV, XML, YAML, Parquet), images (PNG, JPG, WebP, AVIF, HEIC, JXL, SVG), audio (MP3, WAV, FLAC), video (MOV, MKV, AVI to MP4), e-books (EPUB, MOBI, AZW), OCR text extraction, AI-powered data extraction, AI text-to-speech (TTS), AI speech-to-text transcription (STT), subtitle conversion (SRT, VTT, ASS), or website screenshots.

🇺🇸|EnglishTranslated

AI & Machine Learningqianwen-ai/qianwen-ai

qianwen-vision

[QianWen] Understand images and videos with Qwen vision models. TRIGGER when: user wants to analyze, describe, or extract information from images or videos, OCR text extraction, chart/table reading, visual reasoning, multi-image comparison, screenshot understanding, video comprehension, or explicitly invokes this skill by name (e.g. use qianwen-vision). DO NOT TRIGGER when: user wants to generate/create images (use qianwen-image-generation), generate videos (use qianwen-video-generation), text-only tasks without visual input, or non-Qwen vision tasks.

🇺🇸|EnglishTranslated

6 scripts/Checked

Document Processingvincenzoimp/academic-rese...

document-conversion

Use when converting PDFs, DOCX, HTML, scanned papers, reports, proposals, tables, or figures into Markdown, text, extracted assets, or quality reports for an academic research repository.

🇺🇸|EnglishTranslated

AI & Machine Learningsreeram5678/india-market-...

corporate_spy

Handles fetching, reading, and summarizing official Indian corporate filings (BSE/NSE). Specialized in OCR for scanned PDFs.

🇺🇸|EnglishTranslated

Document Processingexglade/my-tax-specialist

my-tax-file-organizer

Organise and rename Malaysia personal tax documents in a workspace. Use when a tax filing folder needs cleanup, filenames need standardising, or documents need to be sorted without overwriting files.

🇺🇸|EnglishTranslated

AI & Machine Learningmembranedev/application-s...

azure-ai-vision

Azure AI Vision integration. Manage data, records, and automate workflows. Use when the user wants to interact with Azure AI Vision data.

🇺🇸|EnglishTranslated

AI & Machine Learningnexu-io/open-design

fal-vision

Analyze images — segment objects, detect, run OCR, describe, and answer visual questions via fal.ai vision models.

🇺🇸|EnglishTranslated

Data Processingtrpc-group/trpc-agent-go

ocr

Extract text from images using Tesseract OCR

🇺🇸|EnglishTranslated

2 scripts/Checked

Document Processingericgandrade/claude-super...

document-converter

This skill should be used when the user needs to convert documents between formats (Office to PDF, PDF to images, image to PDF), perform PDF operations (merge, split, rotate, encrypt, decrypt), or run OCR on scanned documents. Uses local free tools — LibreOffice, ghostscript, pdftk, tesseract, and imagemagick — with no API key required. Trigger when the user says "convert this document", "export to PDF", "merge PDFs", "split PDF", "rotate PDF", "OCR this scan", "convert PPTX to PDF", "convert DOCX to PDF", or any document format conversion request.

🇺🇸|EnglishTranslated