Configure crawl settings for a ManageEngine external content connector

  • リリースバージョン: Australia
  • 更新日 2026年03月12日
  • 所要時間:12分
  • Delimit the set of knowledge articles you want your ManageEngine external content connector to retrieve. Define inclusion or exclusion filters to specify which articles the crawl retrieves and feeds to AI Search for indexing.

    始める前に

    A connector admin must have already created the ManageEngine external content connector that you want to configure crawl settings for. To learn about this procedure, see Create a ManageEngine external content connector.

    Role required: sn_ext_conn.xcc_admin

    このタスクについて

    This task is optional. By default, the ManageEngine external content connector crawls all knowledge bases from its specified source system and sends all public-domain knowledge articles and attachment binary files with all supported file extensions to AI Search for indexing. You only need to perform this task if you want the connector to use any of the following non-default settings:
    • Filter knowledge articles by ID, author, modification date, or category when running content crawls
    • Filter attachment binary files by extension when running content crawls

    Content is only retrieved from the source system if it passes all of your configured crawl setting filters. If any crawl setting filter excludes a content item, the external content connector doesn't retrieve it.

    重要:

    By default, each external content connector can index up to ten million (10,000,000) content items from its source system. When a connector exceeds this limit, it continues to crawl the source system, but only sends content item deletions and updates to AI Search for indexing, ignoring new content items. The connector logs an error message for every 10,000 content items it crawls beyond the indexing limit.

    When a connector's indexed content item count exceeds 800,000, a warning message appears in the connector's UI to indicate that it's approaching the indexing limit. If the connector reaches the indexing limit, an error message appears in its UI.

    External content connectors that support user permissions crawls can retrieve up to five hundred thousand (500,000) users.

    If one of your connectors reaches the content indexing limit, you can update its crawl settings and file inclusion/exclusion filters to reduce the number of content items it retrieves. Alternately, if you need a connector to index more than 10,000,000 content items or to retrieve more than 500,000 users, you can create a Customer Service and Support case at https://support.servicenow.com/now to request a limit increase for the connector.

    手順

    1. Navigate to All > External Content Connectors > External Content Admin Home.
    2. In the Connectors list, select the record for the ManageEngine external content connector whose settings you want to modify.
    3. In the connector editor's Settings tab, select Crawl settings.
    4. In the Articles section, select one of the following article options:
      • To retrieve all knowledge articles found while crawling, select Crawl all articles.
      • To retrieve only a specified set of knowledge articles, select Include only these articles, then use the Add article URL or ID to include field and Add button to enter URLs or IDs for knowledge articles that you want the connector to include when crawling.

        As an example, you might enter IT FAQ 2026 and Troubleshooting to retrieve only knowledge articles with the specified IDs.

      • To retrieve all but a specified set of knowledge articles, select Exclude only these articles, then use the Add article URL or ID to exclude field and Add button to enter URLs or IDs for knowledge articles that you want the connector to exclude when crawling.

        As an example, you might enter hfuzf762gh173 to exclude the knowledge article with the specified ID.

      注:
      Article URLs and IDs can be specified as prefixes, with the wildcard character * at the end of the URL or ID matching any string. As an example, specifying article ID Troubleshoot* includes or excludes articles with IDs Troubleshooting and Troubleshooting guide.
    5. In the Articles section, select one of the following author options:
      • To retrieve knowledge articles from all authors, select Crawl all authors.
      • To retrieve only knowledge articles from a specified set of authors, select Include only these authors, then use the Add author name to include field and Add button to enter names for authors whose knowledge articles you want the connector to include when crawling.

        As an example, you might enter Abel Tuter and Beth Anglin to retrieve only knowledge articles by the specified authors.

      • To retrieve knowledge articles from all but a specified set of authors, select Exclude only these authors, then use the Add author name to exclude field and Add button to enter names for authors whose knowledge articles you want the connector to exclude when crawling.

        As an example, you might enter Test user to exclude knowledge articles from the specified author.

    6. In the Articles section, select one of the following modification date options:
      • To retrieve knowledge articles modified within the last year, select Modified last year.
      • To retrieve knowledge articles modified within the last quarter, select Modified last quarter.
      • To retrieve knowledge articles modified within the last month, select Modified last month.
      • To retrieve knowledge articles modified within the last week, select Modified last week.
    7. In the Articles section, select one of the following category options:
      • To retrieve knowledge articles from all categories, select Crawl all categories.
      • To retrieve only knowledge articles from a specified set of categories, select Include only these categories, then use the Add category name to include field and Add button to enter names for categories you want the connector to include when crawling.

        As an example, you might enter Guide and Wiki to only retrieve knowledge articles in the specified categories.

      • To retrieve knowledge articles from all but a specified set of categories, select Exclude only these categories, then use the Add category name to exclude field and Add button to enter names for categories you want the connector to exclude when crawling.

        As an example, you might enter Confidential to exclude knowledge articles from this category.

    8. Select one of the following Attachments options:
      • To retrieve all attachments with supported file extensions from the source system, select Crawl all attachments.
      • To retrieve only attachments with specified file extensions from the source system, select Include only these file extensions, then use the File extensions to include field to enter attachment file extensions you want the connector to include when crawling.

        As an example, you might enter .docx to retrieve only attachments with the Microsoft Word file format.

      • To retrieve all attachments except those with specified file extensions from the source system, select Exclude only these file extensions, then use the File extensions to exclude field to enter attachment file extensions you want the connector to exclude when crawling.

        As an example, you might enter .csv to exclude attachments with the Comma-Separated Values (CSV) file format.

      For details on the supported attachment file extensions, see Binary file extensions supported in External Content Connectors.
    9. Select Save and validate.

    タスクの結果

    The ManageEngine external content connector is updated with your modified crawl settings.

    次のタスク

    To retrieve content from your ManageEngine source system using your modified crawl settings, create and run a one-time content crawl for your ManageEngine external content connector. To learn about creating and running one-time content crawls, see Create a content crawl for an external content connector.