Introduction

  • Note that the StackOverflow connector works for both public stackoverflow.com content or private Stackoverflow for Teams content or Enterprise edition.
    • To crawl public content, no need for authentication. We need to set stackoverflow.tagsWhiteList.
    • To crawl teams’ content, Glean requires authentication in order to fetch relevant information.
    • For the enterprise edition, Glean requires authentication in order to fetch relevant information
  • For teams content, authentication is done by generating an access token using Explicit Oauth 2.0.
  • For the enterprise edition, authentication is done by using the Api key from your edition.
  • Glean understands all user access permissions and strictly enforces permissions for users at the time of the query which ensures that users are not able to see results which they do not have access to.
  • It’s important to note that all data is stored in a GCP project inside the customer’s cloud account and no data leaves the customer’s environment.

Integration Features

Currently Glean indexes the following content:

  • Questions and Answers

API Usage

Glean will use the standard API to ingest all data.

Installation Process for Public stackoverflow.com Content

  1. Set stackoverflow.tagsWhiteList config with a semicolon separated list of desired tags using Glean admin console.

Installation Process for Stackoverflow Enterprise

  1. Upload the Api key from your instance into Glean using Glean’s admin console using STACKOVERFLOW_API_KEY
  2. Set the stackoverflow.useApiKey config to true.
  3. Set the stackoverflow.apiKeyDomain config to your enterprise domain.

Installation Process for Stackoverflow for Teams

  1. Find out the team id from your team’s URL settings in Admin Settings. You will use it wherever you see TEAM_ID in this document.
  2. Ask the Stackoverflow admin user to complete the flow and obtain the code from the redirect uri
    1. Visit this URL to obtain the code https://stackoverflow.com/oauth?client_id=18604&scope=no_expiry,access_team|stackoverflow.com/c/{TEAM_ID}&redirect_uri=https://askscio.com/oauth/
      1. This will redirect to https://askscio.com/oauth/?code={CODE}
      2. Copy the code as it will be used in getting access token in Step 3
  3. Share the code with Glean team members in a secure fashion. It is used for getting the access token for crawl.
  4. Glean team members will share back the access token with the customer. Upload the access token into Glean using Glean’s admin console using STACKOVERFLOW_ACCESS_TOKEN
  5. Set the stackoverflow.team config with the TEAM_ID.
  6. [Optional] If you prefer to crawl only questions with certain tags, set the stackoverflow.tagsWhiteList config with a semi-colon separated list of desired tags. Empty indicates all tags and is default behavior