Loading...
Loading...
[Document Processing] Convert PDF files to Markdown with support for native text PDFs and scanned documents (OCR). Cross-platform.
npx skill4agent add andreadellacorte/groove pdf-to-markdown[IMPORTANT] Useto break ALL work into small tasks BEFORE starting — including tasks for each file read. This prevents context loss from long files. For simple tasks, AI MUST ask user whether to skip.TaskCreate
scripts/convert.cjs--mode autotesseract.jsnode_modules@opendocsg/pdf2mdnpm installnpx# Basic conversion (auto-detect native vs scanned)
node .agents/skills/pdf-to-markdown/scripts/convert.cjs --input ./document.pdf
# Specify output path
node .agents/skills/pdf-to-markdown/scripts/convert.cjs -i ./doc.pdf -o ./output.md
# Force native mode (skip OCR detection)
node .agents/skills/pdf-to-markdown/scripts/convert.cjs -i ./doc.pdf --mode native| Option | Short | Description | Default |
|---|---|---|---|
| | Input PDF file path | (required) |
| | Output markdown file path | |
| | Conversion mode: | |
| | Show help message |
{
"success": true,
"input": "/path/to/input.pdf",
"output": "/path/to/output.md",
"stats": {
"pages": 5,
"mode": "native"
}
}cd .agents/skills/pdf-to-markdown
npm install tesseract.js pdfjs-dist canvascanvas