Loading...
Loading...
Extracts text, tables, and images from PDFs (including scanned PDFs) using the Mistral OCR API. Use when user asks to OCR a PDF/image, extract text from a PDF, parse a scanned document, convert a PDF to Markdown, or extract structured fields from a document.
npx skill4agent add tristanmanchester/agent-skills extracting-mistral-ocrpython {baseDir}/scripts/mistral_ocr_extract.py --input path/to/file.pdf --out out/ocrcombined.mdpages/page-000.mdraw_response.jsonimages/tables/file_iddocument_urltable_format=inline--include-image-base64--extract-header/--extract-footerscripts/mistral_ocr_extract.pydocument_annotationpython {baseDir}/scripts/mistral_ocr_extract.py \
--input invoice.pdf \
--out out/invoice \
--annotation-prompt "Extract supplier_name, invoice_number, invoice_date (ISO-8601), currency, total_amount. Return JSON." \
--annotation-format json_objectdocument_urltable_format=htmlMISTRAL_API_KEY--pagesreferences/mistral_ocr_api.mdreferences/output_mapping.mdreferences/annotation_prompts.md