Skip to main content

Troubleshooting and FAQs

This page covers issues most often reported on the GitHub connector. For setup steps, see the GitHub connector setup guide.

Setup and installation

Connector validation fails right after installing the GitHub App

Symptoms:

  • Setup cannot complete: Validation returns an error instead of transitioning the connector to a healthy, crawling state.
  • Error messaging: The error references authentication, installation, connectivity, or organization configuration.
Likely causeFix / Remediation
Blocked egress IPs: Glean cannot reach GitHub because Glean's egress IPs are not allowlisted on your network firewall or your GitHub IP allowlist (GitHub Enterprise Cloud).Add Glean's documented egress IP ranges to both allowlists, then rerun validation. Note: Connectivity issues surface immediately during validation; they will not silently fail crawls later.
Incomplete installation: The GitHub App was not installed, or the installation event was not received by Glean.Navigate to GitHub, confirm the Glean App is successfully installed on your organization, and rerun validation.
Credential mismatch: The App credentials in Glean (App ID, Client ID, Client Secret, or private key) do not match the GitHub App.Copy the correct values directly from the GitHub App's settings page and re-enter them into the Glean admin console.
Organization name mismatch: The organization name configured in Glean does not match the actual GitHub organization where the App is installed.Correct the organization name in the Glean Admin Console and rerun validation.

Still not resolved? Contact Glean Support with your GitHub organization name and the exact text of the validation error.

Missing content

Only some repositories are searchable in Glean

Symptoms:

  • Users can find content in some repositories but not others.
  • Specific repositories that should be indexed do not appear in search results at all.
Likely causeFix / Remediation
Repository filtering lists: An active allowlist or blocklist is excluding the repository.The Glean GitHub App is always installed against all repositories in the organization (the installation cannot be scoped to a subset). Check the connector's configuration for allowedRepositories and blockedRepositories. If adjusted, wait for the next repository discovery crawl to reflect changes.
Private repository access limitations: The repository is private, meaning strict collaborator permissions apply.The requesting users are either not collaborators on the repository or have not completed per-user OAuth. See Developers can't see their private GitHub repositories in Glean.

Developers can't see their private GitHub repositories in Glean

Symptoms:

  • A developer searches for code or PRs from a private repository they have access to in GitHub, but receives no results.
  • Public content from the same GitHub organization returns normally in search results.
Likely causeFix / Remediation
Missing per-user OAuth authentication: The user has not individually authorized the GitHub connector.GitHub does not expose user emails directly to the connector. Instruct the user to navigate to Glean → Settings → Connectors → GitHub and click Authorize. This provides Glean with their GitHub email to match identity mappings, enabling visibility into private repos they already collaborate on without granting new access.
Missing GitHub repository permissions: The user is not a collaborator on the repository within GitHub itself.Glean grants private repository visibility strictly based on the repository's explicit collaborator list. Confirm the user is added as a collaborator within GitHub before troubleshooting Glean indexing.
Incorrect GitHub identity authorized: The user authorized GitHub using a different account identity.The user may have logged into an incorrect or personal GitHub account during the OAuth process. Have the user disconnect and re-authorize using the specific GitHub account that holds the necessary enterprise permissions.

Symptoms:

  • A user expects to find a known GitHub issue, PR, or source file, but it is not returned.
  • Other documents within the exact same repository are searchable.
  • A PR or issue appears in Glean, but related details such as diffs, reviews, or comments are still missing.
Likely causeFix / Remediation
Crawl synchronization delay: The document was created or updated recently.New and updated GitHub content is picked up by the incremental content crawl. Allow a scheduled crawl cycle to run, then check again.
Auxiliary data freshness lag: Related details can take longer to update than the PR or issue itself.A PR or issue can already appear in search while related data such as diffs, reviews, or comments is still updating. Because this auxiliary data is indexed as part of the existing PR or issue rather than as separate top-level documents, search completeness can improve over time even when overall document counts stay flat.

Still not resolved? Contact Glean Support with the exact URL of the missing document.

Symptoms:

  • Documentation hosted on GitHub Pages does not appear in Glean search results.
Likely causeFix / Remediation
GitHub Pages crawling is disabled: Glean does not index GitHub Pages by default.Contact Glean Support to explicitly enable GitHub Pages crawling for your specific connector instance.
Unsupported deployment method: The Pages site uses a custom GitHub Actions workflow rather than GitHub's branch-based build.Even when Pages crawling is enabled, Glean only indexes Pages sites that use GitHub's branch-based build (Pages deployed from a branch). Sites built with a custom GitHub Actions workflow aren't indexed.
Limitation

GitHub Pages crawling is disabled by default. When it's enabled, only sites using GitHub's branch-based build (deployed from a branch) are indexed; Pages built with custom GitHub Actions workflows are not.

Permissions and access

Users see no results or access-denied messaging across GitHub content

Symptoms:

  • Complete lack of results: Many users in the organization get no GitHub results at all.
  • Access-denied messages: Users see explicit access-denied messaging on GitHub documents within Glean.
  • Stale data: Identity data in Glean appears outdated or incomplete.
Likely causeFix / Remediation
Incomplete initial identity sync: The first full identity crawl pass is still in progress since the connector was installed.On large GitHub organizations, the initial identity sync can take time to complete. Glean also processes membership and team changes through identity webhooks in near real time. Check connector status and consult the GitHub connector documentation for more details before debugging individual accounts.
Missing per-user OAuth: Users have not authenticated their individual GitHub accounts.For private content, each user must authorize GitHub once by navigating to Glean → Settings → Connectors → GitHub.
Pending permission sync: A recent repository-collaborator assignment or team-membership change in GitHub has not been picked up yet.The next identity crawl will automatically reflect the change. Allow at least one crawl cycle for permissions to update.

Document count or GitHub details are not increasing as expected

Symptoms:

  • A pull request, issue, or repository appears in Glean, but related details such as PR diffs, review comments, reviews, or other auxiliary content are missing.
  • The overall document count does not increase even though additional GitHub activity has occurred.
Likely causeFix / Remediation
Auxiliary data processing queue: Associated GitHub metadata is queued behind the parent entity.Elements like PR diffs, reviews, and comments are indexed sequentially after the primary document surfaces. Allow time for the connector to finish crawling this auxiliary content.
Flat document count metrics: Granular updates modify existing document parts rather than creating new records.Because diffs, reviews, and comments are nested parts of a top-level document, a PR can update with rich text and metadata—improving search completeness—without increasing the overall document count metric.

A user account with service-account-like terms in its name or email can't see GitHub content

Symptoms:

  • Isolated user visibility issues: A specific user account cannot see any GitHub content in Glean, while other users in the same GitHub organization search content normally.
  • Service account naming patterns: The affected account's display name or email contains terms like svc, service, bot, or account.
Likely causeFix / Remediation
Automated identity filtering: The account was automatically filtered during identity ingestion because its display name or email triggered service-account exclusion rules.Check the user's identity profile. If necessary, adjust the display name or email alias to avoid using service-account-like terms.
Missing required identity attributes: The account is missing critical metadata fields needed for proper mapping.Confirm the user has all required attributes populated in your primary identity provider, including a department value if your identity setup requires it.

Still not resolved? Contact Glean Support with the affected user's work email and GitHub username.

Authentication

Persistent authentication errors after a working install

Symptoms:

  • Sudden connector failure: The connector worked initially, but subsequently started failing with authentication errors.
  • Unresolved errors: Authentication errors persist across multiple crawl attempts and do not self-heal.
Likely causeFix / Remediation
App uninstalled or suspended: The Glean GitHub App was manually uninstalled or suspended within the GitHub organization.Organization administrators cannot change the App's permission scopes, but they can suspend or uninstall it. In GitHub, verify the Glean App is still installed and active. Unsuspend or reinstall it if necessary.

Still not resolved? Contact Glean Support. Persistent authentication failures often require the connector instance to be manually re-validated by the Glean team.

OAuth or webhook calls between Glean and GitHub are blocked at the network

Symptoms:

  • Authentication timeouts: OAuth authorization flows hang indefinitely or fail with timeout errors.
  • Real-time update failures: Webhook deliveries from GitHub are not reaching Glean (e.g., new PRs do not appear until the next scheduled incremental crawl).
Likely causeFix / Remediation
Missing firewall or allowlist rules: Glean's egress IPs are not allowed on your corporate network firewall or your GitHub IP allowlist (GitHub Enterprise Cloud).Add Glean's documented egress IP ranges to both your corporate firewall and your GitHub IP allowlist configurations.
TLS interception or proxy disruption: A corporate proxy server is breaking TLS encryption or rewriting critical HTTP headers.Confirm with your network team that traffic moving between Glean and GitHub is excluded from inspection and is not being intercepted.

FAQs