Skip to main content

Set up LLMs using the Model Hub

Use the Model Hub to securely configure and manage access to commercial and open-source LLMs for Glean Assistant and Agents.

The Model Hub offers a curated set of leading models from providers like OpenAI, Google, Anthropic, and Amazon. You can bring your own keys for providers or use Glean’s Universal Model Key to access pre‑procured models with guardrails.

For details on each deployment model, see Glean deployment models.

note

GCP hosted customers cannot use AWS hosted models, for example, Amazon Bedrock. This change expands model choice on AWS while keeping cross‑cloud data-path restrictions in place.

Supported models

Glean supports the following model creators. Availability depends on your hosting environment and provider access:

  • OpenAI
  • Google Gemini
  • Anthropic
  • Amazon

Supported models for Glean Agents

Glean provides basic, standard, and premium models for Glean Agents. The model tier depends on the model's reasoning capabilities and overall cost to run.

Basic

OpenAI (via Azure or OpenAI)
  • GPT 5 mini
  • GPT 5 nano
  • GPT 4.1 mini
  • GPT 4.1 nano
  • GPT 4o mini
  • o3-mini
Google Gemini (via Google Vertex AI)
  • Gemini Flash Preview 3
  • Gemini Flash 2.5
  • Gemini Flash Lite Preview 2.5
Anthropic (via Google Vertex AI or Amazon Bedrock)
  • Claude Haiku 4.5
Amazon
  • Amazon Nova Pro 1.0

Standard

OpenAI (via Azure or OpenAI)
  • GPT 5.2
  • GPT 5.1
  • GPT 5
  • GPT 4.1
  • o3
  • o4-mini
Google Gemini (via Google Vertex AI)
  • Gemini 3.5 Flash
  • Gemini Pro 3.1
  • Gemini Pro 2.5

Premium

OpenAI (via Azure or OpenAI)
  • GPT 5.5
  • GPT 5.4
  • o1
  • GPT Image 1.5
  • GPT Image 2
Anthropic (via Google Vertex AI or Amazon Bedrock)
  • Claude Opus 4.8
  • Claude Opus 4.7
  • Claude Opus 4.6
  • Claude Sonnet 4.6
Google Gemini (via Google Vertex AI)
  • Nano Banana Pro (Gemini Pro Image 3)
  • Gemini Flash Image 3.1
  • Gemini Flash Image 2.5

Supported Models for Glean Assistant

Glean provides basic, standard, and premium models for Glean Assistant. The model tier depends on the model's reasoning capabilities and overall cost to run. If your organization uses the Glean Universal Model Key, Glean optimizes your experience by using the best-in-class basic and standard models by default. You can also choose the model for Glean Assistant when you enable models in the Model Hub.

note

If you set one of the following premium models for Glean Assistant, Glean uses these models for both regular and premium requests:

  • GPT 5.4
  • Claude Sonnet 4.6
  • Claude Opus 4.6

Basic

OpenAI (via Azure or OpenAI)
  • GPT 5 mini
  • GPT 5 nano
  • GPT 4.1 mini
  • GPT 4.1 nano
  • GPT 4o mini
Google Gemini (via Google Vertex AI)
  • Gemini 3 Flash Preview
  • Gemini 2.5 Flash
  • Gemini 2.5 Flash Lite Preview
Anthropic (via Google Vertex AI or Amazon Bedrock)
  • Claude 4.5 Haiku

Standard

OpenAI (via Azure or OpenAI)
  • GPT 5.2
  • GPT 5.1
  • GPT 5
  • GPT 4.1
  • o3
Google Gemini (via Google Vertex AI)
  • Gemini 3.5 Flash (Glean Universal Model Key)
  • Gemini Pro 3.1 (Glean Universal Model Key)
  • Gemini Pro Custom Tools 3.1 (Customer Key)
  • Gemini Pro 2.5
note

For Glean Assistant on Customer Key, Gemini Pro 3.1 is not available. Use Gemini Pro Custom Tools 3.1 instead. Gemini Pro 3.1 remains available for Glean Assistant through the Glean Universal Model Key.

Premium

OpenAI (via Azure or OpenAI)
  • GPT 5.5
  • GPT 5.4
  • o1
  • GPT Image 1.5
  • GPT Realtime 1.5
  • GPT 4o Transcribe
Anthropic (via Google Vertex AI or Amazon Bedrock)
  • Claude Opus 4.8
  • Claude Opus 4.7
  • Claude Opus 4.6
  • Claude Sonnet 4.6
Google Gemini (via Google Vertex AI)
  • Nano Banana Pro (Gemini Pro Image 3)
  • Gemini Flash Image 3.1
  • Gemini Flash Image 2.5

Recently deprecated models

The following models have been deprecated and are no longer available for new configurations. Existing deployments using these models have been automatically migrated to the listed replacement.

Deprecated modelReplacementDeprecation date
GPT-3.5 TurboGPT 4.1May 8, 2025
GPT-4 (0125-preview)GPT 4.1May 8, 2025
GPT-4 (1106-preview)GPT 4.1May 8, 2025
GPT-4 TurboGPT 4.1May 8, 2025
GPT-4 (0613)GPT 4.1May 8, 2025
Gemini 1.5 ProGemini Pro 2.5September 26, 2025
Gemini 1.5 FlashGemini Flash 2.5September 26, 2025
Claude 3 SonnetClaude Sonnet 4.6October 28, 2025
Claude 3.5 SonnetClaude Sonnet 4.6October 28, 2025
Claude 3.5 Sonnet V2Claude Sonnet 4.6October 28, 2025
Claude 3.7 SonnetClaude Sonnet 4.6January 28, 2026
ChatGPT-4o latestGPT 5.1February 5, 2026
Claude 3.5 HaikuClaude Haiku 4.5February 5, 2026
Gemini 2.0 FlashGemini Flash 3.0March 3, 2026
Gemini 2.0 Flash LiteGemini Flash Lite Preview 2.5March 3, 2026
Gemini 3.0 ProGemini Pro 3.1March 25, 2026
GPT-4o (2024-11-20)GPT 5.1March 27, 2026
Claude Sonnet 4Claude Sonnet 4.6April 29, 2026
Claude Sonnet 4.5Claude Sonnet 4.6April 29, 2026
Claude 4.5 OpusClaude Opus 4.6April 29, 2026
GPT-4o (2024-05-13)GPT 5.1September 30, 2026
GPT 4o miniGPT 4.1 miniSeptember 30, 2026

Availability by hosting environment and provider

Which LLM hosting providers you can configure depends on your deployment mode and key type:

  • Glean Universal Model Key: Glean manages connectivity to all supported providers regardless of your deployment's cloud environment.
  • Customer Key (BYOK): Provider access is limited to cloud-native providers because cross-cloud access is not supported.
Hosting providerGlean Universal Model KeyCustomer Key — GCP-based deploymentCustomer Key — AWS-based deployment
OpenAI
Azure OpenAI
Google Vertex AI (Gemini, Claude)
Amazon Bedrock (Claude, Amazon Nova)
  • GCP-based deployment includes Glean Hosted and Customer Hosted on GCP.
  • AWS-based deployment refers to Customer Hosted on AWS.
  • Anthropic (Claude) models are accessible through either Google Vertex AI or Amazon Bedrock, depending on your available providers.
  • For model-level differences between key modes, see the notes under Supported models for Glean Assistant.

Pricing

With Enterprise Flex pricing, each agent run uses an amount of FlexCredits determined by the complexity of an agent. This complexity includes how many connectors the agent searches, how many steps it takes, how much memory it maintains, how many tools it executes, and the model used for each step. Agents that use higher-tier models consume more credits than agents that use lower-tier models.

note

With Enterprise Flex pricing, everyday Glean Assistant queries that leverage basic and standard models don't use credits. If you enable premium models for advanced queries, the advanced queries consume credits.

See the following documentation to learn about pricing dashboards:

Configure models in the Model Hub

You can configure models using either Glean Universal Model Key or Customer Key (BYOK).

To exclude models, see Exclude Models From Glean.

note

Self‑hosted AWS deployments can use Gemini through Vertex AI. Self‑hosted GCP deployments cannot use models hosted on AWS, for example, Bedrock.

Glean Universal Model Key

With Glean Universal Model Key, Glean manages model configuration and provisioning for you, however you can still configure which models are available in your deployment:

  1. Go to Admin Console → Platform → LLMs.
  2. On the Models page, use the search bar, capability filter, or scroll to find the model you want to include.
  3. Select the three-dot menu (...) next to the model creator, then click Manage models.
  4. For the model you want to include, toggle the model on.
  5. Click Save to apply your changes. This includes the model across Glean, including Assistant and Agents.

The Models page for Glean Universal Model Key shows per-model toggles grouped by creator. This differs from the Customer Key experience, which uses hosting provider-based configuration.

Customer Key

With Customer Key, you can configure which models are available in your deployment. The Models page for Customer Key shows per-model toggles grouped by hosting provider.

note

For Customer Key deployments, large, small, agentic, and fast agentic model defaults must come from the same provider. Image generation models can use a different provider.

To configure which models are available in your deployment:

  1. Go to Admin Console → Platform → LLMs.
  2. Click Add LLM.
  3. Select a hosting provider and follow the configuration steps:

Select models for workflows

Only models enabled for the Agents appear in the agent builder. Similarly, only models enabled for the Assistant are available when users select models in Glean Chat.

See Exclude Models From Glean for more information on how to exclude models from Glean.

Set default model for an agent

In the agent builder, open Settings (gear icon) → Select Model.

Set per‑step model

In the canvas, select a step and choose a model for that step.

Best practices

  • Start with a balanced default model for most tasks and upgrade select workflows to higher‑tier models when you need stronger reasoning.
  • Where possible, keep model families consistent across multi‑step workflows for predictable quality and cost.
  • For Customer Keys: Align with your enterprise provider contracts and control data routing with cloud restrictions.

See also

  • To learn how to exclude models from being used in your Glean Agents, see Exclude models from Glean Agents.
  • To learn how Glean handles model deprecation, including notification timelines, migration paths, and tools needed for assistants and agents, see Model deprecation.
  • To learn how users can select a model for Glean Chat conversations, see Model choice.
  • To monitor LLM usage and reliability for Customer Key deployments, see LLM Insights.
  • To resolve errors when connecting to your LLM provider, see Troubleshoot LLM provider errors.