FYP demonstration of document parsing of booking documents
Parse documents to extract structured information
Search documents using semantic queries
Extract text from documents
Process documents and answer queries
Compare different Embeddings
Extract handwritten text from images
Process text to extract entities and details
Extract text from PDF and answer questions
Identify and extract key entities from text
Find relevant passages in documents using semantic search
Query deep learning documents to get answers
Spirit.AI
Donut-booking-gradio is a Gradio-based tool designed to extract text from booking documents. It serves as a demonstration for a final-year project (FYP), focusing on parsing scanned booking documents to retrieve relevant information. The tool provides an intuitive interface for users to upload their documents and extract text efficiently.
• Text Extraction: Extract readable text from scanned booking documents, including receipts, invoices, and tickets. • User-Friendly Interface: Built with Gradio, offering a simple and interactive interface for document processing. • Scanned Document Handling: Capable of processing scanned documents and extracting text even from low-quality images. • Customizable: Allows for configuration to focus on specific data points within documents. • Real-Time Extraction: Provides immediate results after document upload, making it a convenient choice for quick text extraction.
pip install donut-booking-gradio to install the tool.donut run to start the Gradio interface.What formats does donut-booking-gradio support?
Donut-booking-gradio supports scanned documents in formats like PDF, JPG, PNG, and BMP. For best results, use high-quality scans or images.
Can donut-booking-gradio handle poor-quality scans?
While it can process low-quality scans, best results are achieved with clear, readable documents. If text is illegible in the scan, accuracy may be reduced.
Is my data kept private when using donut-booking-gradio?
Yes, donut-booking-gradio processes documents locally or on your server, depending on your setup. No data is uploaded to external servers unless explicitly configured.