Exploring Document Intelligence

  • Release version: Washingtondc
  • Updated February 1, 2024
  • 1 minute to read
  • Document Intelligence helps you to quickly and accurately classify and extract information from documents using artificial intelligence (AI).

    Overview

    Many organizations today use simple optical character recognition (OCR) solutions to extract data from documents. This requires significant manual configuration and often requires manual adjustment as the documents evolve. Document Intelligence extends beyond simple OCR, using AI to identify, understand, and extract text and data from documents. This enables you to accurately extract information to automate document processing, even when the documents have varied text, formatting, and templates.

    Document Intelligence workflow

    With Document Intelligence (DocIntel) you can process single or multi-page documents in JPEG, PNG, or PDF formats. You can process documents that contain typed text such as forms, invoices, identity documents, and more.

    The following diagram shows how document extraction works in Document Intelligence.

    Figure 1. Document Intelligence flow
    Diagram showing how Document Intelligence activities train the AI models.

    In this workflow:

    1. A document is uploaded for processing in a document task.
    2. DocIntel extracts the data from the document using OCR and AI models.
    3. The user provides input to validate or correct the DocIntel recommendations.
    4. The models are updated and trained to provide more accurate results.

    Document Intelligence benefits

    Figure 2. Benefits of Document Intelligence
    Diagram showing the phased approach to automation using Document Intelligence.
    Document Intelligence provides the following benefits.
    Benefit Feature User
    Start fast with a no-code set-up that enables data extraction from many document types including PDF and scanned paper documents​.

    Set up document extraction use cases

    DocIntel Admin [sn_docintel.admin]

    DocIntel Manager [sn_docintel.manager]

    Enable categorization for any type of document you define.

    Set up document classification use cases

    DocIntel Admin [sn_docintel.admin]

    DocIntel Manager [sn_docintel.manager]

    Automate intelligently with responsible, feedback-driven AI for continual learning.

    Configure data extraction modes

    DocIntel Admin [sn_docintel.admin]

    DocIntel Manager [sn_docintel.manager]

    Seamlessly integrate document processing steps into workflows​.

    Integrating Document Intelligence with other applications

    DocIntel Admin [sn_docintel.admin]

    DocIntel Manager [sn_docintel.manager]

    Accelerate extraction of structured and semi-structured documents such as forms, invoices, IDs, and more​.

    Extract fields using the Document Intelligence workspace

    DocIntel Creation Agent [sn_docintel.creation_agent]

    DocIntel Extraction Agent [sn_docintel.extraction_agent]

    Accelerate classification of single and multi-page documents.

    Classify documents using the Document Intelligence workspace

    DocIntel Creation Agent [sn_docintel.creation_agent]

    DocIntel Extraction Agent [sn_docintel.extraction_agent]