Skip to main content

Crawling restrictions

Overview

  • Redlist restrictions permit Glean to crawl and index everything except the specified content (specific exclude).
Restriction TypeRedlistDetails
User-based RestrictionsRestrict crawling to exclude content created/modified/viewed by specific users or a specific group (plus public content).
Content-based RestrictionsRestrict crawling to exclude specific content, documents, messages, or objects (see below).

Supported restrictions

RestrictionRedlistDetails
User (Owner)Restrict crawling to exclude content owned by specific users.
Content (Folder)Restrict crawling to exclude content within specific folders.
Content (File)Restrict crawling to exclude specific files.
Event TypesRestrict activity/content updates to specific event types (e.g. DOWNLOAD)
Service Account (Email)Restrict activity/content updates from certain service accounts (e.g. Bots/services that synchronize or backup content to Box)

Applying restrictions

MethodSupportedDetails
Admin UIRestrictions cannot currently be applied in the Admin UI.
Glean SupportRestrictions can be applied by Glean Support on request.

Format

When specifying restrictions for Owners, Folders, or Files, the ID of the owner, folder, or file within the Box app must be specified.

For example:

  • Owner IDs:

    23400261190,23401260091
  • Folder IDs:

    119142000606,518142000607
  • File IDs:

    31600504200,31600504201

Locating user IDs

As a Box admin, navigate to the Box Managing content.

Click on a user from the user list, and the URL will reveal their user ID. For example, in https://app.box.com/master/content/2267862105/0/0, the user ID is 2267862105.