Activate multimodal captioning for attachments from an indexed source

  • Release version: Australia
  • Updated May 27, 2026
  • 2 minutes to read
  • Generate descriptive captions for images, tables, charts, and other visual elements in attachments from records in an indexed source table. You can find these attachments by searching for terms from their generated captions.

    Before you begin

    An administrator must have activated the Platform Multimodal Service plugin on your instance. For details on this procedure, see Activate the Platform Multimodal Service plugin.
    Note:
    An AI Search administrator can select the VLM (visual learning model) provider and model used by the Platform Multimodal Service. For details on this process, see Configure multimodal captioning for AI Search.

    Role required: ais_admin

    About this task

    By default, when indexing attachments for search, AI Search includes the attachment's text but ignores its images. For tables, charts, and other visual elements, indexing extracts text but doesn't necessarily preserve the context of that text.

    To improve search recall, you can configure the Platform Multimodal Service to automatically generate descriptive captions for images, tables, charts, and other visual elements in attachments found on records from an indexed source table. With the service activated, you can find indexed source content attachments with these entity types by searching for terms that match the generated captions.

    As an example, an attachment might include an image that yields the generated caption woman in office. Searches for woman or office can match the caption terms. Such searches return the attachment as a search result even if its text doesn't otherwise contain either of those terms.
    Note:
    Multimodal captioning is only supported for Knowledge [kb_knowledge] table records from the Knowledge Table indexed source and for content retrieved by external content connectors.

    The multimodal captioning feature has been validated and tested for English-language content. For content in other languages, it may function but has not been evaluated for caption quality or accuracy. Multimodal captioning for non-English content is not currently supported.

    Multimodal captioning runs as a separate process after text indexing is completed for the attachment in question. Records with attachments that contain images, tables, charts, and other visual elements won't show generated captions until multimodal captioning completes.
    Note:
    Activating multimodal captioning for attachments found in an indexed source has no effect if the index_attachments attribute is set to false for that indexed source. Make sure this attribute is set to true if you want to index attachments and generate captions for their content.

    Procedure

    1. Navigate to All > AI Search > AI Search Index > Indexed Sources.
    2. Open the indexed source that you want to activate multimodal captioning for.
      Note:
      Multimodal captioning is currently only supported for the Knowledge Table indexed source that indexes records on the Knowledge [kb_knowledge] table.
    3. In the Advanced Configuration related list, check for an existing index_mms_attachments attribute.
      • If an index_mms_attachments attribute already exists with value false, change its value to true. No further steps are needed.
      • If an index_mms_attachments attribute exists with value true, multimodal captioning is already enabled for the indexed source. No further steps are needed.
      • If no index_mms_attachments attribute exists, continue with the following steps.
    4. In the Advanced Configuration related list, select New.
    5. On the Indexed Source Attribute form, fill in the following fields.
      Field Value
      Attribute index_mms_attachments
      Value true
    6. Select Submit.

    Result

    The new index_mms_attachments indexed source attribute appears in the Advanced Configuration related list.

    What to do next

    To make the new indexed source attribute take effect, perform a full table reindex for the indexed source. For details on this procedure, see Perform a full table index or reindex for a single AI Search indexed source.