SharePoint and OneDrive
Supported Crawling Restrictions for SharePoint
Overview
- Greenlist restrictions permit crawling only for the specified content.
- Redlist restrictions prohibit crawling for the specified content.
Restriction Type | Greenlist | Redlist | Details |
---|---|---|---|
Time-based Restrictions | ✅ | ❌ | Restrict crawling to include/exclude content created/modified/viewed after a certain date. |
Identity-based Restrictions | ✅ | ❌ | Restrict crawling to include/exclude content created/modified/viewed by specific users or a specific group (plus public content). |
Content-based Restrictions | ✅ | ❌ | Restrict crawling to include/exclude specific content, documents, messages, or objects. |
Supported Restrictions
Restriction | Greenlist | Redlist | Details |
---|---|---|---|
Date | ✅ | ❌ | Restrict crawling to only content created/modified/viewed after a specific date. |
Entra ID Group | ✅ | ❌ | Restrict crawling to only content created/modified/viewed by users in a specific Entra ID group. Note: Public content is always crawled. |
Site | ✅ | ✅ | Restrict crawling to include/exclude specific SharePoint sites. |
Sites should be provided in URL format without a trailing forward slash. For example:
For Group restrictions when using Azure AD/Entra ID, the Object ID of the AD Group should be provided, NOT the Group name. For example:
Limitations
Applying Restrictions
Method | Supported | Details |
---|---|---|
Admin UI | ✅ | Restrictions can be applied in the Admin UI under the connector settings. |
Glean Support | ✅ | Restrictions can be applied by Glean support on request. |
Not all restrictions can be applied in the Admin UI. Please contact Glean support to apply the restriction if it is missing from the UI.