Loading...
Loading...
Use this skill when users need to extract text from images, PDFs, or documents. Supports URLs and local files. Returns structured JSON containing recognized text.
npx skill4agent add aidenwu0209/paddleocr-skills paddleocr-text-recognitionpython scripts/ocr_caller.py--file-url--file-path--file-pathpython scripts/ocr_caller.py --file-url "URL provided by user" --prettypython scripts/ocr_caller.py --file-path "file path" --prettypython scripts/ocr_caller.py --file-url "URL" --output result.json --prettyoktruefalsetextokerror.messagetexttextI've extracted the text from the image. Here's the complete content:
[Display the entire text here]I found some text in the image. Here's a preview:
"The quick brown fox..." (truncated)python scripts/ocr_caller.py --file-url "https://example.com/invoice.jpg" --prettypython scripts/ocr_caller.py --file-path "./document.pdf" --pretty{
"ok": true,
"text": "All recognized text here...",
"result": { ... },
"error": null
}oktruefalsetextresulterrorokCONFIG_ERROR: PADDLEOCR_OCR_API_URL not configured. Get your API at: https://paddleocr.comPlease visit the URL above to get your API_URL and TOKEN.
Once you have them, send them to me and I'll configure it automatically.API_URL=https://xxx.paddleocr.com/ocr, TOKEN=abc123...Here's my API: https://xxx and token: abc123python scripts/configure.py --api-url "PARSED_URL" --token "PARSED_TOKEN"API_ERROR: Authentication failed (403). Check your token.API_ERROR: API rate limit exceeded (429)textreferences/output_schema.mdreferences/provider_api.mdNote: Model version and capabilities are determined by your API endpoint (PADDLEOCR_OCR_API_URL).
python scripts/smoke_test.py