Configure crawl settings for a Google Drive external content connector
Specify the shared drives you want your Google Drive external content connector to crawl. Define inclusion or exclusion filters to dictate the types of content the crawl retrieves and feeds to AI Search for indexing.
Before you begin
A connector administrator must have already created the Google Drive external content connector that you want to configure crawl settings for. To learn about this procedure, see Create a Google Drive external content connector.
Role required: sn_ext_conn.xcc_admin
About this task
- Inclusion or exclusion filters for the eligible shared drives to crawl when running content crawlsNote:To be eligible for crawling, a shared drive must be accessible by at least one member who is a user in the Directory and who has the Manager role (or is a member of a group with the Manager role). To learn more about the Directory, see https://support.google.com/a/answer/1628009. For details on the Manager role, see https://support.google.com/a/users/answer/12380484.
- Inclusion or exclusion filters for the attachment file extensions to retrieve when running content crawls
Content is only retrieved from the source system if it passes all of your configured crawl setting filters. If any crawl setting filter excludes a content item, the external content connector doesn't retrieve it.
By default, an external content connector can index up to one million (1,000,000) documents from its source system. When a connector exceeds this limit, it continues to crawl the source system, but only sends document deletions and updates to AI Search for indexing, ignoring new documents. The connector logs an error message for every 10,000 documents it crawls beyond the indexing limit.
When a connector's indexed document count exceeds 800,000, a warning message appears in the connector's UI to indicate that it's approaching the indexing limit. If the connector reaches the indexing limit, an error message appears in its UI.
If one of your connectors reaches the indexing limit, you can update its crawl settings and file inclusion/exclusion filters to reduce the number of documents it retrieves. Alternately, if you need to index more than 1,000,000 documents, you can create a Customer Service and Support case at https://support.servicenow.com/now to request a limit increase for the connector.
Procedure
Result
The Google Drive external content connector is updated with your crawl scope and file extension filter settings.
What to do next
To retrieve content from your Google Drive source system using your modified crawl settings, create and run a one-time content crawl for your Google Drive external content connector. To learn about creating and running one-time content crawls, see Create a content crawl for an external content connector.