Search Results: ocr

Found 203 Skills

Tools & Utilitiesmembranedev/application-s...

ocr-web-service

OCR Web Service integration. Manage Documents. Use when the user wants to interact with OCR Web Service data.

🇺🇸|EnglishTranslated

AI & Machine Learningframersai/agentos-skills

vision-ocr

Extract text from images using OCR and vision AI with the performOCR() high-level API or the full VisionPipeline.

🇺🇸|EnglishTranslated

AI & Machine Learningdavidcastagnetoa/skills

easyocr

OCR alternativo a PaddleOCR, excelente en caracteres especiales y múltiples scripts

🇺🇸|EnglishTranslated

Document Processingtristanmanchester/agent-s...

extracting-mistral-ocr

Extracts text, tables, and images from PDFs (including scanned PDFs) using the Mistral OCR API. Use when user asks to OCR a PDF/image, extract text from a PDF, parse a scanned document, convert a PDF to Markdown, or extract structured fields from a document.

🇺🇸|EnglishTranslated

1 scripts/Attention

Data Processingtrpc-group/trpc-agent-go

ocr

Extract text from images using Tesseract OCR

🇺🇸|EnglishTranslated

2 scripts/Checked

Uncategorizedlyndonkl/claude

socratic-teaching-scaffolds

Use when teaching complex concepts (technical, scientific, philosophical), helping learners discover insights through guided questioning rather than direct explanation, correcting misconceptions by revealing contradictions, onboarding new team members through scaffolded learning, mentoring through problem-solving question frameworks, designing self-paced learning materials, or when user mentions "teach me", "help me understand", "explain like I'm", "learning path", "guided discovery", or "Socratic method".

🇺🇸|EnglishTranslated

AI & Machine Learninganthropics/claude-for-leg...

socratic-drill

Socratic drilling — it asks, you answer, it pushes back. Does NOT give you the answer until you've earned it. Use when the user says "drill me on", "quiz me", "socratic", "test me on [subject]", or wants to study actively.

🇺🇸|EnglishTranslated

AI & Machine Learningopenakita/openakita

openakita/skills@baidu-paddleocr-text

PaddleOCR text recognition skill using PP-OCRv5 lightweight model. Supports natural scene and complex document text detection and recognition. Use when user needs OCR text extraction from images.

🇺🇸|EnglishTranslated

1 scripts/Checked

Document Processingopenakita/openakita

openakita/skills@baidu-paddleocr-doc

PaddleOCR document parsing skill based on PaddleOCR-VL-1.5. Provides SOTA-level document understanding with ultra-high precision recognition and parsing. Use when user needs to parse, extract, or understand document content.

🇺🇸|EnglishTranslated

1 scripts/Checked

Document Processinganthropics/skills

pdf

Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables from PDFs, combining or merging multiple PDFs into one, splitting PDFs apart, rotating pages, adding watermarks, creating new PDFs, filling PDF forms, encrypting/decrypting PDFs, extracting images, and OCR on scanned PDFs to make them searchable. If the user mentions a .pdf file or asks to produce one, use this skill.

🇺🇸|EnglishTranslated

141.3k

8 scripts/Checked

AI & Machine Learningdavila7/claude-code-templ...

audiocraft-audio-generation

PyTorch library for audio generation including text-to-music (MusicGen) and text-to-sound (AudioGen). Use when you need to generate music from text descriptions, create sound effects, or perform melody-conditioned music generation.

🇺🇸|EnglishTranslated

AI & Machine Learningdontbesilent2025/dbskill

dbs-unblock

dontbesilent Execution Diagnosis. Diagnose the real reason behind your 'know what to do but fail to act' using the Adlerian psychology framework. Triggers: /dbs-unblock, /self-check, 'I know what to do but can't do it', 'why do I always procrastinate' Execution block diagnosis using Adlerian psychology framework. Trigger: /dbs-unblock, "I know what to do but can't do it", "why do I procrastinate"

🇨🇳|ChineseTranslated