Overview

  • Greenlist restrictions permit crawling only for the specified content.
  • Redlist restrictions prohibit crawling for the specified content.
Restriction TypeGreenlistRedlistDetails
Time-based RestrictionsRestrict crawling to include/exclude content created/modified/viewed after a certain date.
Identity-based RestrictionsRestrict crawling to include/exclude content created/modified/viewed by specific users or a specific group (plus public content).
Content-based RestrictionsRestrict crawling to include/exclude specific content, documents, messages, or objects.

Supported Restrictions

RestrictionGreenlistRedlistDetails
DateRestrict crawling to only content created/modified/viewed after a specific date.
Entra ID GroupRestrict crawling to only content created/modified/viewed by users in a specific Entra ID group. Note: Public content is always crawled.
SiteRestrict crawling to include/exclude specific SharePoint sites.

Sites should be provided in URL format without a trailing forward slash. For example:

https://<domain>.sharepoint.com/sites/<siteName>

For Group restrictions when using Azure AD/Entra ID, the Object ID of the AD Group should be provided, NOT the Group name. For example:

7c77a355-c78c-6362-a195-d2428d285107

Limitations

Applying Restrictions

MethodSupportedDetails
Admin UIRestrictions can be applied in the Admin UI under the connector settings.
Glean SupportRestrictions can be applied by Glean support on request.

Not all restrictions can be applied in the Admin UI. Please contact Glean support to apply the restriction if it is missing from the UI.