Loading...
Loading...
Compare original and translation side by side
| Feature | Description |
|---|---|
| Accuracy | 2x better than Tesseract (0.97 vs 0.88) |
| GPU | PyTorch-based, CUDA optimized |
| Languages | 90+ including CJK |
| Layout | Document layout, table recognition |
| 功能 | 描述 |
|---|---|
| 准确率 | 比Tesseract高2倍(0.97 vs 0.88) |
| GPU支持 | 基于PyTorch,CUDA优化 |
| 支持语言 | 90余种,包括中日韩(CJK)语言 |
| 版面分析 | 文档版面、表格识别 |
undefinedundefinedundefinedundefinedundefinedundefinedundefinedundefinedfrom PIL import Image
from surya.recognition import RecognitionPredictor
from surya.detection import DetectionPredictor
from surya.foundation import FoundationPredictor
image = Image.open("document.png")
foundation_predictor = FoundationPredictor()
recognition_predictor = RecognitionPredictor(foundation_predictor)
detection_predictor = DetectionPredictor()
predictions = recognition_predictor([image], det_predictor=detection_predictor)
for page in predictions:
for line in page.text_lines:
print(line.text)from PIL import Image
from surya.recognition import RecognitionPredictor
from surya.detection import DetectionPredictor
from surya.foundation import FoundationPredictor
image = Image.open("document.png")
foundation_predictor = FoundationPredictor()
recognition_predictor = RecognitionPredictor(foundation_predictor)
detection_predictor = DetectionPredictor()
predictions = recognition_predictor([image], det_predictor=detection_predictor)
for page in predictions:
for line in page.text_lines:
print(line.text)| Variable | Default | Description |
|---|---|---|
| 512 | Reduce for lower VRAM |
| 36 | Reduce if OOM |
export RECOGNITION_BATCH_SIZE=256
surya_ocr image.png| 环境变量 | 默认值 | 描述 |
|---|---|---|
| 512 | 显存不足时可减小该值 |
| 36 | 出现OOM时减小该值 |
export RECOGNITION_BATCH_SIZE=256
surya_ocr image.png| Script | Description |
|---|---|
| Helper with OOM auto-retry, batch support |
| 脚本 | 描述 |
|---|---|
| 具备OOM自动重试、批量处理功能的辅助脚本 |