ocr
Original:🇺🇸 English
Translated
5 scripts
Extract text from images using OCR. Use when the user needs to read text from screenshots, photos, or image files.
13installs
Sourcexyuanbuilds/my_skills
Added on
NPX Install
npx skill4agent add xyuanbuilds/my_skills ocrTags
Translated version includes tags in frontmatterSKILL.md Content
View Translation Comparison →IMPORTANT - Path Resolution:
This skill can be installed in different locations (plugin system, manual installation, global, or project-specific). Before executing any commands, determine the skill directory based on where you loaded this SKILL.md file, and use that path in all commands below. Replace with the actual discovered path.
$SKILL_DIROCR
Run the OCR CLI directly via Node.js:
Setup (First Time)
bash
cd $SKILL_DIR/scripts
npm run setupExecution Pattern
bash
cd $SKILL_DIR/scripts
# OCR from file
node cli.js <image-path>
# OCR from file with language
node cli.js <image-path> --lang <lang>
# OCR from clipboard (macOS)
node cli.js --clipboard
# OCR from clipboard with language
node cli.js --clipboard --lang <lang>Examples
bash
cd $SKILL_DIR
node cli.js ./screenshot.png
node cli.js /path/to/image.jpg --lang chi_tra
node cli.js --clipboard --lang jpnSupported Languages
- - English
eng - - Simplified Chinese
chi_sim - - Traditional Chinese
chi_tra - - Japanese
jpn - - Korean
kor - Combine with :
+eng+chi_sim+jpn
Supported Formats
PNG, JPG, JPEG, GIF, BMP, WebP, TIFF