Sync People Data
In this section, you will synchronize your employee directory to Glean to ensure secure and accurate search results. Glean calls this identity information ‘People Data’
You will not be able to complete this step until your Glean tenant has been provisioned. If you were not able to switch from Magic Links to SSO in the last section, you will need to return to this step later.
About People Data
Synchronizing your employee directory to Glean is a critical part of the setup process, as it allows Glean to:
- Enforce document permissions.
- Apply role-based access controls (RBAC).
- Deliver more relevant search results.
If you configured SSO using OIDC in the last step, and your directory is the same as your SSO platform (e.g. Azure AD/Entra ID, Okta), then you have already configured people data sync as part of the SSO configuration; in which case, you now simply need to enable it as a source.
Configure a People Data Source
Required Fields
You must ensure that the following fields are populated for every user in your People Data source:
- Name (First Name, Last Name)
- Department
If you do not provide these fields, directory sync of user data to Glean will fail.
Additionally, the following fields are strongly recommended to enhance the Glean experience for your organization:
Recommended Field | Description |
---|---|
Manager / Manager Email | The email of the user’s manager. Used heavily in search ranking signals. Required for the org chart to be populated correctly. |
Location | The working location of the user. Required for location-based personalization. |
Start Date | The start date of the user. To show tenure in the user’s profile page. |
Initial Setup
Navigate to Admin Console > Settings > People data
If your SSO platform supports directory synchronization, it will be listed here with a green checkmark and status Ready to sync. Click the SSO provider’s icon to proceed.
Select your People Data Source form the list of supported apps
If you wish to set up an alternative source, click the appropriate service and follow the setup instructions.
Using a People Data source not listed?
You will need to manually upload your directory data to Glean using CSV, or leverage Glean’s Indexing API to push your directory data to Glean.
In some cases, Glean may also be able to pull your people data from another connected data source such as BambooHR, or Facebook Workplace. Please contact your Glean engineer or Glean support for more information.
Attribute Mapping
On the next screen, and if your People Data source supports it, you will be able to map attributes from your directory to the required field in Glean. This is useful if you store user information differently than what is standard.
For example, you might want to infer a user’s location from their manager’s location. If they have an Okta field called managerLocation
, you could map managerLocation
to the Glean attribute location
.
If your People Data source does not support custom mapping in the Glean interface, you will not see this option. Please contact Glean support if you require this feature but do not see it for your configured People Data source.
Map fields to Glean attributes if these are different to the defaults
Initiate the Sync
Before you start the sync, you can preview what information will included. For example, how many users will be created, and how many departments you have. If this is different from what you expected, check that you have provided the correct permissions within the People Data source.
Click Start sync to begin synchronization. From this point onwards, the sync is automatic: Glean will check your People Data source periodically for any changes.
Click Start sync to begin synchronization
Verification
The People Data sync will start, and over the next hour, you will start to see your employee directory being populated in Glean.
The synchronization is not immediate: Glean needs to crawl your people data first, and then index it so that it is usable in search.
It can take 2-3 hours for the initial sync to complete.
You can return to this page later to validate that your sync is working.
Your people data should be synced after 2-3 hours
If you do not see people data start to show up within a few hours, this typically points to a permissions issue.
Ensure that you have provided Glean with the correct permissions within your IdP when you configured SSO.
For example, for Azure AD, it is critical to have Directory.Read.All
and User.Read.All
as Application permissions NOT Delegated Permissions. If your issues persist, please contact Glean support.
Manual Upload (CSV)
For instances where your directory information cannot be automatically synchronized, Glean also supports the manual upload of people data as a set of comma-separated values .csv.
Because this method is not asynchronous, Glean does not recommend it. Any updates to your directory (new employees, departures, or changes) will require you to re-upload a new CSV.
CSV Format
The following are mandatory fields:
Required Field | Description |
---|---|
first_name | The user’s first name, eg: Ben |
last_name | The user’s last name, eg: Benson |
email | The user’s email, eg: bbenson@domain.com |
title | The user’s job title, eg: Software Engineer |
department | The user’s department, eg: Engineering |
The following are not mandatory, but highly recommended fields to enhance the Glean experience for your organization:
Recommended Field | Description |
---|---|
manager_email | The email of the user’s manager. Used heavily in search ranking signals. Required for the org chart to be populated correctly. |
location | The working location of the user. Required for location-based personalization. |
start_date | The start date of the user. To show tenure in the user’s profile page. |
A sample CSV is included below:
Upload the Data
Select CSV as a People Data source and select your CSV file when prompted.
If you do not see this as an option or are having difficulties with the upload, please contact Glean support.
Was this page helpful?