Estimate document volume for Microsoft SharePoint Online
Estimate the total number of documents included in your Microsoft SharePoint Online source system and the document counts for individual sites. Use this information to determine crawl scope settings needed for your Microsoft SharePoint Online external content connector.
Before you begin
- Login credentials
- Permission to configure and export the Microsoft SharePoint Online active site listing
For detailed information about access to reports in the Microsoft 365 admin center, see the Microsoft 365 reports in the admin center article on the Microsoft Learn site.
Role required: none
About this task
Before configuring a Microsoft SharePoint Online external content connector, you may find it helpful to estimate the number of documents available in your Microsoft SharePoint Online source system and in its individual sites.
By default, an external content connector can index up to one million (1,000,000) documents from its source system. When a connector exceeds this limit, it continues to crawl the source system, but only sends document deletions and updates to AI Search for indexing, ignoring new documents. The connector logs an error message for every 10,000 documents it crawls beyond the indexing limit.
When a connector's indexed document count exceeds 800,000, a warning message appears in the connector's UI to indicate that it's approaching the indexing limit. If the connector reaches the indexing limit, an error message appears in its UI.
If one of your connectors reaches the indexing limit, you can update its crawl settings and file inclusion/exclusion filters to reduce the number of documents it retrieves. Alternately, if you need to index more than 1,000,000 documents, you can create a Customer Service and Support case at https://support.servicenow.com/now to request a limit increase for the connector.
Procedure
What to do next
If your Microsoft SharePoint Online source system's total available document count exceeds the connector limit of one million (1,000,000) documents, you will need to limit the crawl scope for the Microsoft SharePoint Online external content connector. Choose a set of sites whose total document count is less than the connector limit, and inform your AI Search administrator so they can configure the external content connector's crawl settings to include only those sites.
For details on configuring the Microsoft SharePoint Online connector's crawl settings, see Configure crawl settings for a Microsoft SharePoint Online external content connector.