Skip to main content

Monitor connectors sync progress

The data sources page provides an overview of your connected sources and their current status. Use this page to confirm that initial sync is progressing and that ongoing updates continue to flow after setup.

Sync progress

The data sources page groups sources by where they are in the connector lifecycle.

  • Initial sync
  • All data sources

Data sources page during initial sync

Initial sync

Initial sync is split into two phases to help you understand where each data source is in the sync process:

1

Crawling

The data source is actively fetching content, metadata, and permissions from the source system. A data source typically remains in this phase while Glean retrieves the initial corpus of data.

It's normal for a data source to remain in this phase for a while, especially for large data sources or data sources with strict API rate limits. As long as Items synced continues to increase (even slowly), the crawler is making healthy progress.

2

Indexing

The data source is processing crawled content and incorporating it into the Knowledge Graph. A data source moves to this phase as crawling completes or nears completion.

Indexing can take time for large volumes of content or during periods of high system load. You can monitor Items synced to confirm that indexing is progressing. Steady increases indicate healthy operation.

All data sources

Once crawling and indexing are complete, the data source moves from Initial sync to All data sources. At this point, the connector has completed its initial sync and is in steady-state operation, processing incremental updates through webhooks and periodic crawls.

Return to this section to monitor Change rate (items/day) and confirm that ongoing updates are flowing.

Use metrics for troubleshooting

You can use Items synced and Change rate (items/day) together with the sync status to troubleshoot connector health:

Best practices

  • During onboarding, monitor the Initial sync section and the Items synced metric to spot stalls early and confirm that work is progressing.
  • In steady state, monitor Change rate (items/day) in All data sources to ensure that day-to-day updates are flowing. You can sort by the column to surface the most or least active sources.
  • Investigate sustained 0 values in Items synced or Change rate when you expect activity, such as after configuration changes, scope changes, or known source activity.
info

Status indicators like crawl status, document count, and content crawl may not always reflect real-time information. When you see "Crawl in progress", content should still be searchable as long as:

  • There is a document count displayed
  • The data source is set as visible
warning

If content isn't appearing in search results despite meeting these conditions, please submit a support ticket.

note

These metrics and status indicators provide visibility into connector activity and health, but don't represent a complete health verdict. Use them alongside other status indicators and error surfacing for comprehensive monitoring.