FYP demonstration of document parsing of booking documents
Search information in uploaded PDFs
Process and extract text from receipts
Spirit.AI
Employs Mistral OCR for transcribing historical data
Extract text from document images
OCR for Arabic Language with QR code and Barcode Detection
Extract PDFs and chat to get insights
Upload and analyze documents for text extraction and Q&A
Gemma-3 OCR App
Find relevant text chunks from documents based on queries
Extract text from images using OCR
Find information using text queries
Donut-booking-gradio is a Gradio-based tool designed to extract text from booking documents. It serves as a demonstration for a final-year project (FYP), focusing on parsing scanned booking documents to retrieve relevant information. The tool provides an intuitive interface for users to upload their documents and extract text efficiently.
• Text Extraction: Extract readable text from scanned booking documents, including receipts, invoices, and tickets. • User-Friendly Interface: Built with Gradio, offering a simple and interactive interface for document processing. • Scanned Document Handling: Capable of processing scanned documents and extracting text even from low-quality images. • Customizable: Allows for configuration to focus on specific data points within documents. • Real-Time Extraction: Provides immediate results after document upload, making it a convenient choice for quick text extraction.
pip install donut-booking-gradio to install the tool.donut run to start the Gradio interface.What formats does donut-booking-gradio support?
Donut-booking-gradio supports scanned documents in formats like PDF, JPG, PNG, and BMP. For best results, use high-quality scans or images.
Can donut-booking-gradio handle poor-quality scans?
While it can process low-quality scans, best results are achieved with clear, readable documents. If text is illegible in the scan, accuracy may be reduced.
Is my data kept private when using donut-booking-gradio?
Yes, donut-booking-gradio processes documents locally or on your server, depending on your setup. No data is uploaded to external servers unless explicitly configured.