Generate OCR text and extract named entities from images
Extract and search text from images
Convert images to text
Extract and overlay text on PDFs
Display OCRBench leaderboard for model evaluations
Surya OCR
Demo of GOT-OCR 2.0's Transformers implementation
Convert images of text into editable text
Extract text and search keywords from images
Extract text from PDFs
Surya OCR
Convert images of text into digital text
Extract text from a PDF file
Theatre Programmer is an OCR (Optical Character Recognition) tool designed to extract text and named entities from images. It is particularly useful for processing theatre-related documents, such as playbills, scripts, or historical programmes, by converting them into readable and searchable digital text.
• OCR Text Generation: Converts images of text into editable digital text.
• Named Entity Extraction: Identifies and extracts names, dates, and locations from the text.
• Multi-Format Support: Processes images in formats like JPEG, PNG, and TIFF.
• Optimized for Theatre Texts: Tailored to handle playbills, scripts, and historical documents with accuracy.
What formats does Theatre Programmer support?
Theatre Programmer supports JPEG, PNG, and TIFF image formats for processing.
How accurate is the OCR?
Accuracy depends on the image quality. Clear, well-lit images with readable text yield the best results.
Can Theatre Programmer handle handwritten text?
No, Theatre Programmer is optimized for printed text and may not perform well with handwritten content.