Exploring Document Intelligence
Document Intelligence helps you to quickly and accurately classify and extract information from documents using artificial intelligence (AI).
Overview
Many organizations today use simple optical character recognition (OCR) solutions to extract data from documents. This requires significant manual configuration and often requires manual adjustment as the documents evolve. Document Intelligence extends beyond simple OCR, using AI to identify, understand, and extract text and data from documents. This enables you to accurately extract information to automate document processing, even when the documents have varied text, formatting, and templates.
Document Intelligence workflow
With Document Intelligence (DocIntel) you can process single or multi-page documents in JPEG, PNG, or PDF formats. You can process documents that contain typed text such as forms, invoices, identity documents, and more.
The following diagram shows how document extraction works in Document Intelligence.
In this workflow:
- A document is uploaded for processing in a document task.
- DocIntel extracts the data from the document using OCR and AI models.
- The user provides input to validate or correct the DocIntel recommendations.
- The models are updated and trained to provide more accurate results.
Document Intelligence benefits
| Benefit | Feature | User |
|---|---|---|
| Start fast with a no-code set-up that enables data extraction from many document types including PDF and scanned paper documents. | ||
| Enable categorization for any type of document you define. | ||
| Automate intelligently with responsible, feedback-driven AI for continual learning. | ||
| Seamlessly integrate document processing steps into workflows. | ||
| Accelerate extraction of structured and semi-structured documents such as forms, invoices, IDs, and more. | ||
| Accelerate classification of single and multi-page documents. | Classify documents using the Document Intelligence workspace |