Set up LLMs using the Model Hub
Use the Model Hub to securely configure and manage access to commercial and open-source LLMs for Glean Assistant and Agents.
The Model Hub offers a curated set of leading models from providers like OpenAI, Google, Anthropic, and Amazon. You can bring your own keys for providers or use Glean’s Universal Model Key to access pre‑procured models with guardrails.
For details on each deployment model, see Glean deployment models.
GCP hosted customers cannot use AWS hosted models, for example, Amazon Bedrock. This change expands model choice on AWS while keeping cross‑cloud data-path restrictions in place.
Supported models
Glean supports the following model creators. Availability depends on your hosting environment and provider access:
- OpenAI
- Google Gemini
- Anthropic
- Amazon
Supported models for Glean Agents
Glean provides basic, standard, and premium models for Glean Agents. The model tier depends on the model's reasoning capabilities and overall cost to run.
Basic
| OpenAI (via Azure or OpenAI) |
|
| Google Gemini (via Google Vertex AI) |
|
| Anthropic (via Google Vertex AI or Amazon Bedrock) |
|
| Amazon |
|
Standard
| OpenAI (via Azure or OpenAI) |
|
| Google Gemini (via Google Vertex AI) |
|
Premium
| OpenAI (via Azure or OpenAI) |
|
| Anthropic (via Google Vertex AI or Amazon Bedrock) |
|
| Google Gemini (via Google Vertex AI) |
|
Supported Models for Glean Assistant
Glean provides basic, standard, and premium models for Glean Assistant. The model tier depends on the model's reasoning capabilities and overall cost to run. If your organization uses the Glean Universal Model Key, Glean optimizes your experience by using the best-in-class basic and standard models by default. You can also choose the model for Glean Assistant when you enable models in the Model Hub.
If you set one of the following premium models for Glean Assistant, Glean uses these models for both regular and premium requests:
- GPT 5.4
- Claude Sonnet 4.6
- Claude Opus 4.6
Basic
| OpenAI (via Azure or OpenAI) |
|
| Google Gemini (via Google Vertex AI) |
|
| Anthropic (via Google Vertex AI or Amazon Bedrock) |
|
Standard
| OpenAI (via Azure or OpenAI) |
|
| Google Gemini (via Google Vertex AI) |
|
For Glean Assistant on Customer Key, Gemini Pro 3.1 is not available. Use Gemini Pro Custom Tools 3.1 instead. Gemini Pro 3.1 remains available for Glean Assistant through the Glean Universal Model Key.
Premium
| OpenAI (via Azure or OpenAI) |
|
| Anthropic (via Google Vertex AI or Amazon Bedrock) |
|
| Google Gemini (via Google Vertex AI) |
|
Recently deprecated models
The following models have been deprecated and are no longer available for new configurations. Existing deployments using these models have been automatically migrated to the listed replacement.
| Deprecated model | Replacement | Deprecation date |
|---|---|---|
| GPT-3.5 Turbo | GPT 4.1 | May 8, 2025 |
| GPT-4 (0125-preview) | GPT 4.1 | May 8, 2025 |
| GPT-4 (1106-preview) | GPT 4.1 | May 8, 2025 |
| GPT-4 Turbo | GPT 4.1 | May 8, 2025 |
| GPT-4 (0613) | GPT 4.1 | May 8, 2025 |
| Gemini 1.5 Pro | Gemini Pro 2.5 | September 26, 2025 |
| Gemini 1.5 Flash | Gemini Flash 2.5 | September 26, 2025 |
| Claude 3 Sonnet | Claude Sonnet 4.6 | October 28, 2025 |
| Claude 3.5 Sonnet | Claude Sonnet 4.6 | October 28, 2025 |
| Claude 3.5 Sonnet V2 | Claude Sonnet 4.6 | October 28, 2025 |
| Claude 3.7 Sonnet | Claude Sonnet 4.6 | January 28, 2026 |
| ChatGPT-4o latest | GPT 5.1 | February 5, 2026 |
| Claude 3.5 Haiku | Claude Haiku 4.5 | February 5, 2026 |
| Gemini 2.0 Flash | Gemini Flash 3.0 | March 3, 2026 |
| Gemini 2.0 Flash Lite | Gemini Flash Lite Preview 2.5 | March 3, 2026 |
| Gemini 3.0 Pro | Gemini Pro 3.1 | March 25, 2026 |
| GPT-4o (2024-11-20) | GPT 5.1 | March 27, 2026 |
| Claude Sonnet 4 | Claude Sonnet 4.6 | April 29, 2026 |
| Claude Sonnet 4.5 | Claude Sonnet 4.6 | April 29, 2026 |
| Claude 4.5 Opus | Claude Opus 4.6 | April 29, 2026 |
| GPT-4o (2024-05-13) | GPT 5.1 | September 30, 2026 |
| GPT 4o mini | GPT 4.1 mini | September 30, 2026 |
Availability by hosting environment and provider
Which LLM hosting providers you can configure depends on your deployment mode and key type:
- Glean Universal Model Key: Glean manages connectivity to all supported providers regardless of your deployment's cloud environment.
- Customer Key (BYOK): Provider access is limited to cloud-native providers because cross-cloud access is not supported.
| Hosting provider | Glean Universal Model Key | Customer Key — GCP-based deployment | Customer Key — AWS-based deployment |
|---|---|---|---|
| OpenAI | ✅ | ✅ | ✅ |
| Azure OpenAI | ✅ | ✅ | ✅ |
| Google Vertex AI (Gemini, Claude) | ✅ | ✅ | ❌ |
| Amazon Bedrock (Claude, Amazon Nova) | ✅ | ❌ | ✅ |
- GCP-based deployment includes Glean Hosted and Customer Hosted on GCP.
- AWS-based deployment refers to Customer Hosted on AWS.
- Anthropic (Claude) models are accessible through either Google Vertex AI or Amazon Bedrock, depending on your available providers.
- For model-level differences between key modes, see the notes under Supported models for Glean Assistant.
Pricing
With Enterprise Flex pricing, each agent run uses an amount of FlexCredits determined by the complexity of an agent. This complexity includes how many connectors the agent searches, how many steps it takes, how much memory it maintains, how many tools it executes, and the model used for each step. Agents that use higher-tier models consume more credits than agents that use lower-tier models.
With Enterprise Flex pricing, everyday Glean Assistant queries that leverage basic and standard models don't use credits. If you enable premium models for advanced queries, the advanced queries consume credits.
See the following documentation to learn about pricing dashboards:
Configure models in the Model Hub
You can configure models using either Glean Universal Model Key or Customer Key (BYOK).
To exclude models, see Exclude Models From Glean.
Self‑hosted AWS deployments can use Gemini through Vertex AI. Self‑hosted GCP deployments cannot use models hosted on AWS, for example, Bedrock.
Glean Universal Model Key
With Glean Universal Model Key, Glean manages model configuration and provisioning for you, however you can still configure which models are available in your deployment:
- Go to Admin Console → Platform → LLMs.
- On the Models page, use the search bar, capability filter, or scroll to find the model you want to include.
- Select the three-dot menu (...) next to the model creator, then click Manage models.
- For the model you want to include, toggle the model on.
- Click Save to apply your changes. This includes the model across Glean, including Assistant and Agents.
The Models page for Glean Universal Model Key shows per-model toggles grouped by creator. This differs from the Customer Key experience, which uses hosting provider-based configuration.
Customer Key
With Customer Key, you can configure which models are available in your deployment. The Models page for Customer Key shows per-model toggles grouped by hosting provider.
For Customer Key deployments, large, small, agentic, and fast agentic model defaults must come from the same provider. Image generation models can use a different provider.
To configure which models are available in your deployment:
- Go to Admin Console → Platform → LLMs.
- Click Add LLM.
- Select a hosting provider and follow the configuration steps:
Select models for workflows
Only models enabled for the Agents appear in the agent builder. Similarly, only models enabled for the Assistant are available when users select models in Glean Chat.
See Exclude Models From Glean for more information on how to exclude models from Glean.
Set default model for an agent
In the agent builder, open Settings (gear icon) → Select Model.

Set per‑step model
In the canvas, select a step and choose a model for that step.

Best practices
- Start with a balanced default model for most tasks and upgrade select workflows to higher‑tier models when you need stronger reasoning.
- Where possible, keep model families consistent across multi‑step workflows for predictable quality and cost.
- For Customer Keys: Align with your enterprise provider contracts and control data routing with cloud restrictions.
See also
- To learn how to exclude models from being used in your Glean Agents, see Exclude models from Glean Agents.
- To learn how Glean handles model deprecation, including notification timelines, migration paths, and tools needed for assistants and agents, see Model deprecation.
- To learn how users can select a model for Glean Chat conversations, see Model choice.
- To monitor LLM usage and reliability for Customer Key deployments, see LLM Insights.
- To resolve errors when connecting to your LLM provider, see Troubleshoot LLM provider errors.