Compare OCR results from images
Read text from captcha images
Surya OCR
Extract Japanese text from images
Extract text from receipts for easy expense management
Give it a pdf and it'll extract the text
Correct skew and detect text lines in PDFs or images
Extract and translate text from images
NepaliOCR
Extract text from images
Generate text from images
Extract text from single-line Kurdish images
Extract text from images and search for keywords
Streamlit OCR Comparator is a web-based application designed to compare OCR (Optical Character Recognition) results from different engines. It provides a user-friendly interface to upload images, extract text using multiple OCR engines, and slice and dice the results to find the most accurate output.
• Multi-Engine Support: Compare text extraction results from various OCR engines in one place.
• Image Upload: Directly upload images from your local filesystem or provide URLs.
• Result Comparison: Side-by-side comparison of extracted text to identify differences.
• Accuracy Analysis: Highlight mismatches and evaluate the performance of each OCR engine.
• Customizable Settings: Fine-tune OCR parameters like language, DPI, and layout analysis.
• Export Results: Download comparison reports for further analysis.
pip install streamlit-ocr-comparator
in your terminal.streamlit run ocr_comparator.py
.What OCR engines are supported?
Streamlit OCR Comparator supports multiple engines, including Tesseract, Google Vision API, and Microsoft Azure Computer Vision.
How do I install the app?
Installation is straightforward. Run pip install streamlit-ocr-comparator
in your terminal, and the app will be ready to use.
Can I customize the OCR settings?
Yes, users can customize settings like language, DPI, and layout analysis for each OCR engine to optimize results.