Set up LLMs using the Model Hub

Use the Model Hub to securely configure and manage access to commercial and open-source LLMs for Glean Assistant and Agents. AWS hosted customers can now use Google Gemini models through Google Vertex AI.

GCP hosted customers cannot use AWS hosted models, for example, Amazon Bedrock. This change expands model choice on AWS while keeping cross‑cloud data-path restrictions in place.

What is the Model Hub?

The Model Hub offers a curated set of leading models from providers like OpenAI, Google, Anthropic, Amazon, and Meta. You can bring your own keys for providers or use Glean’s Universal Model Key to access pre‑procured models with guardrails.

Supported models

The following model families are supported for Glean Assistant and Agents. With Enterprise Flex, each agent run uses FlexCredits that are determined by the complexity of an agent including how many data sources it searches, how many steps it takes, how much memory it maintains, how many actions it executes, and the model used for each step. Agents that use higher-tiered models will consume more credits than agents that use lower-tiered models. Availability depends on your hosting environment and provider access:

OpenAI (through OpenAI or Azure OpenAI).
Google Gemini (through Google Vertex AI).
Anthropic (through Google Vertex AI or Amazon Bedrock).
Amazon (through Amazon Bedrock).
Meta Llama (through Google Vertex AI or Amazon Bedrock).

Supported models for Glean agents

Claude 3.7 Sonnet will deprecate on January 28th, 2026. Migrate to Claude Sonnet 4 or Claude Sonnet 4.5 before this date.

Basic


OpenAI (via Azure or OpenAI)	GPT 5 mini GPT 5 nano GPT 4.1 mini GPT-4o-mini
Google Gemini (via Google Vertex AI)	Gemini 2.5 Flash Gemini 2.0 Flash
Llama (via Google Vertex AI or Amazon Bedrock)	Llama 4 Maverick Llama 4 Scout

Standard


OpenAI (via Azure or OpenAI)	GPT 5.2 GPT 5.1 GPT 5 GPT 4.1 GPT-4o o3 o3-mini o4-mini
Google Gemini (via Google Vertex AI)	Gemini 3.0 Pro Preview Gemini 2.5 Pro
Anthropic (via Google Vertex AI or Amazon Bedrock)	Claude Sonnet 4.5 Claude Sonnet 4 Claude 3.7 Sonnet - Sunsetting Jan 28, 2026 Claude 3.5 Sonnet V2 Claude 4.5 Haiku Claude 3.5 Haiku
Amazon	Amazon Nova Pro 1.0
DeepSeek	R-1

Premium

Coming soon

Supported Models for Glean Assistant

We provide a wide range of models for Glean Assistant that are categorized into Basic, Standard, and Premium based on their reasoning capabilities and overall cost to run. If you are on the Universal Model Key, Glean will optimize your experience using the best-in-class Basic and Standard models. Glean will choose the default while giving you model choice. With Enterprise Flex, everyday Assistant queries that leverage Basic and Standard models will not use credits. If you wish to choose to use Premium models for advanced queries, credits will be consumed.

Basic


OpenAI (via Azure or OpenAI)	GPT 4.1 mini GPT-4o-mini
Google Gemini (via Google Vertex AI)	Gemini 2.5 Flash Gemini 2.0 Flash

Standard


OpenAI (via Azure or OpenAI)	GPT 5 GPT 4.1
Google Gemini (via Google Vertex AI)	Gemini 2.5 Pro
Anthropic (via Google Vertex AI or Amazon Bedrock)	Claude Sonnet 4.5 Claude Sonnet 4 Claude 3.7 Sonnet - Sunsetting Jan 28, 2026 Claude 3.5 Haiku

Premium


Google Gemini (via Google Vertex AI)	Nano Banana Pro (Gemini 3 Pro Image)

Cloud availability

Gemini (Vertex AI) is available on both self‑hosted AWS and self‑hosted GCP deployments.
Amazon Bedrock models remain unavailable on self‑hosted GCP deployments.

Enabling the Model Hub

You can enable models using either Glean Universal Model Key (managed by Glean) or your own Customer Keys.

Glean Universal Model Key (formerly Glean Key)

Models are preconfigured and managed by Glean.
Deprecated models are automatically replaced with newer versions.

For Customer Key

Go to the Admin Console.
Select LLM under the Platform tab.
Click Add LLM.
Select a hosting provider and follow the specific configuration steps:
- Azure OpenAI:
  1. Ensure that assistant is configured.
  2. In your Azure Portal, go to Keys and Endpoints, add the Key and Endpoint you want to use with Glean.
  3. (Optional) If you have not configured the assistant, select models for Assistant.
    - For each selected model, add the deployment name configured on Azure.
  4. Select models for Agents.
    - For each selected model, add the deployment name configured on Azure.
  5. Validate the connection.
  6. Click Save.
  7. To remove a model, uncheck it, validate, and save.
- Amazon Bedrock (self-hosted on AWS only):
  1. Enter your preferred region.
  2. (Optional) If you have not configured the assistant, select models for Assistant.
  3. Select models for Agents.
  4. Validate the connection.
  5. Click Save.
  6. To remove a model, uncheck it, validate, and save.
- Google Vertex AI (self-hosted on GCP only):
  1. (Optional) If you have not configured the assistant, select models for Assistant.
  2. Select models for Agents.
  3. Ensure selected models are enabled in Google Model Garden.
  4. Validate the connection.
  5. Click Save.
  6. To remove a model, uncheck it, validate, and save.
- OpenAI:
  1. Enter your API key to be used with Glean from the OpenAI Portal.
  2. (Optional) If you have not configured the assistant, select models for Assistant.
  3. Select models for Agents.
  4. Validate the connection.
  5. Click Save.
  6. To remove a model, uncheck it, validate, and save.
- Self‑hosted AWS deployments can use Gemini through Vertex AI.
- Self‑hosted GCP deployments cannot use models hosted on AWS, for example, Bedrock.

Selecting models for workflows

Default model for an agent: In the agent builder, open Settings (gear icon) > Select Model.\
Per‑step model: In the canvas, select a step and choose a model for that step.

You also have the option of changing the model for only a single step. Click on the step in the canvas and then select the model.

Best practices

Start with a balanced default model for most tasks and upgrade select workflows to higher‑tier models when you need stronger reasoning.
Where possible, keep model families consistent across multi‑step workflows for predictable quality and cost.
Use Customer Keys when you need to align with your enterprise provider contracts and control data routing with cloud restrictions.

Additional information

To learn how to exclude models from being used in your Glean Agents, see Exclude Models From Glean Agents.
To learn how Glean handles model deprecation, including notification timelines, migration paths, and actions needed for assistants and agents, see Model deprecation.

General

Identity

Search

Assistant

Actions

Embedded Integrations

Glean MCP Servers

Protect

Knowledge

Management

Insights

Glean Customer Event Logs

Developer

Managing Agents

Set up LLMs using the Model Hub

What is the Model Hub?

Supported models

Supported models for Glean agents

Basic

Standard

Premium

Supported Models for Glean Assistant

Basic

Standard

Premium

Cloud availability

Enabling the Model Hub

Glean Universal Model Key (formerly Glean Key)

For Customer Key

Selecting models for workflows

Best practices

Additional information

General

Identity

Search

Assistant

Actions

Embedded Integrations

Glean MCP Servers

Protect

Knowledge

Management

Insights

Glean Customer Event Logs

Developer

Managing Agents

​What is the Model Hub?

​Supported models

​Supported models for Glean agents

​Basic

​Standard

​Premium

​Supported Models for Glean Assistant

​Basic

​Standard

​Premium

​Cloud availability

​Enabling the Model Hub

​Glean Universal Model Key (formerly Glean Key)

​For Customer Key

​Selecting models for workflows

​Best practices

​Additional information

What is the Model Hub?

Supported models

Supported models for Glean agents

Basic

Standard

Premium

Supported Models for Glean Assistant

Basic

Standard

Premium

Cloud availability

Enabling the Model Hub

Glean Universal Model Key (formerly Glean Key)

For Customer Key

Selecting models for workflows

Best practices

Additional information