Configure crawl settings for a Lucidchart external content connector
Specify the folders you want your Lucidchart external content connector to crawl. Define inclusion or exclusion filters to dictate the types of content the crawl retrieves and feeds to AI Search for indexing.
Before you begin
A connector administrator must have already created the Lucidchart external content connector that you want to configure crawl settings for. To learn about this procedure, see Create a Lucidchart external content connector.
Role required: sn_ext_conn.xcc_admin
About this task
- Inclusion or exclusion filters for the folders to crawl when running content crawls
- Inclusion or exclusion filters for the file extensions to retrieve when running content crawls
Content is only retrieved from the source system if it passes all of your configured crawl setting filters. If any crawl setting filter excludes a content item, the external content connector doesn't retrieve it.
By default, an external content connector can index up to one million (1,000,000) documents from its source system. When a connector exceeds this limit, it continues to crawl the source system, but only sends document deletions and updates to AI Search for indexing, ignoring new documents. The connector logs an error message for every 10,000 documents it crawls beyond the indexing limit.
When a connector's indexed document count exceeds 800,000, a warning message appears in the connector's UI to indicate that it's approaching the indexing limit. If the connector reaches the indexing limit, an error message appears in its UI.
If one of your connectors reaches the indexing limit, you can update its crawl settings and file inclusion/exclusion filters to reduce the number of documents it retrieves. Alternately, if you need to index more than 1,000,000 documents, you can create a Customer Service and Support case at https://support.servicenow.com/now to request a limit increase for the connector.
Procedure
- In the Connectors list, select the record for the Lucidchart external content connector whose settings you want to modify.
- In the connector editor's Settings tab, select Crawl settings.
-
Select one of the following Date options:
- To retrieve content from documents that were published in the last year, select Published last year.
- To retrieve content from documents that were published in the last quarter, select Published last quarter.
- To retrieve content from documents that were published in the last month, select Published last month.
- To retrieve content from documents that were published in the last week, select Published last week.
-
Select one of the following Folders options:
- To crawl all folders encountered on the source system, select Crawl all folders.
-
To crawl only a specified set of folders encountered on the source system, select Include only these folders, then use the Add folder name or ID to include field and Add button to enter names or IDs for folders you want the connector to include when crawling.
As an example, you might enter MarketingFolder to only retrieve searchable content from documents in folders with the specified name or ID.
-
To crawl all but a specified set of folders encountered on the source system, select Exclude only these folders, then use the Add folder name or ID to exclude field and Add button to enter names or IDs for folders you want the connector to exclude when crawling.
As an example, you might enter DraftFolder to exclude searchable content from documents in folders with the specified name or ID.
-
Select one of the following Documents options:
- To crawl all documents from the source system, select Crawl all documents.
-
To crawl only a specified set of documents from the source system, select Include only these documents, then use the Add document name or ID to include field and Add button to enter names or IDs for documents you want the connector to include in retrieval.
As an example, you might enter release_document to only retrieve searchable content from documents with the specified name or ID.
-
To crawl all but a specified set of documents from the source system, select Exclude only these documents, then use the Add document name or ID to exclude field and Add button to enter names or IDs for documents you want the connector to exclude from retrieval.
As an example, you might enter DraftFolder2 to exclude searchable content from documents with the specified name or ID.
- Select Save and validate.
Result
The Lucidchart external content connector is updated with your modified crawl settings.
What to do next
To retrieve content from your Lucidchart source system using your modified crawl settings, create and run a one-time content crawl for your Lucidchart external content connector. To learn about creating and running one-time content crawls, see Create a content crawl for an external content connector.