Exploring Document Intelligence

Washington DC Enable AI

Release

washingtondc

ft:locale

en-US

ft:publication_title

Washington DC Enable AI

ft:clusterId

platai

bundleId

platai

workflow

Platform

Exploring Document Intelligence

Release version: Washingtondc

Updated February 1, 2024

1 minute to read

Document Intelligence helps you to quickly and accurately classify and extract information from documents using artificial intelligence (AI).

Overview

Many organizations today use simple optical character recognition (OCR) solutions to extract data from documents. This requires significant manual configuration and often requires manual adjustment as the documents evolve. Document Intelligence extends beyond simple OCR, using AI to identify, understand, and extract text and data from documents. This enables you to accurately extract information to automate document processing, even when the documents have varied text, formatting, and templates.

Document Intelligence workflow

With Document Intelligence (DocIntel) you can process single or multi-page documents in JPEG, PNG, or PDF formats. You can process documents that contain typed text such as forms, invoices, identity documents, and more.

The following diagram shows how document extraction works in Document Intelligence.

Diagram showing how Document Intelligence activities train the AI models. — Figure 1. Document Intelligence flow

In this workflow:

A document is uploaded for processing in a document task.
DocIntel extracts the data from the document using OCR and AI models.
The user provides input to validate or correct the DocIntel recommendations.
The models are updated and trained to provide more accurate results.

Document Intelligence benefits

Diagram showing the phased approach to automation using Document Intelligence. — Figure 2. Benefits of Document Intelligence

Document Intelligence provides the following benefits.


Benefit	Feature	User
Start fast with a no-code set-up that enables data extraction from many document types including PDF and scanned paper documents.	Set up document extraction use cases	DocIntel Admin [sn_docintel.admin] DocIntel Manager [sn_docintel.manager]
Enable categorization for any type of document you define.	Set up document classification use cases	DocIntel Admin [sn_docintel.admin] DocIntel Manager [sn_docintel.manager]
Automate intelligently with responsible, feedback-driven AI for continual learning.	Configure data extraction modes	DocIntel Admin [sn_docintel.admin] DocIntel Manager [sn_docintel.manager]
Seamlessly integrate document processing steps into workflows.	Integrating Document Intelligence with other applications	DocIntel Admin [sn_docintel.admin] DocIntel Manager [sn_docintel.manager]
Accelerate extraction of structured and semi-structured documents such as forms, invoices, IDs, and more.	Extract fields using the Document Intelligence workspace	DocIntel Creation Agent [sn_docintel.creation_agent] DocIntel Extraction Agent [sn_docintel.extraction_agent]
Accelerate classification of single and multi-page documents.	Classify documents using the Document Intelligence workspace	DocIntel Creation Agent [sn_docintel.creation_agent] DocIntel Extraction Agent [sn_docintel.extraction_agent]