Skip to main content

Overview

Glean’s Data Analysis for Structured Files feature in Assistant enables users to analyze CSV, XLSX, and JSON files directly through the chat interface. This powerful capability helps users across finance, sales, support, and IT teams save significant time on ad hoc analyses.

Example Analysis Capabilities

You can ask questions about your data in various ways:
Example: “What columns does my data contain?”
Example: Calculate averages, sums, and counts
Examples:
  • Find top/bottom x values
  • Determine the range of values in a column
  • List all unique values in a column
Example: “How does conversion change by lead source?”
Example: “What product area has seen increased tickets compared to previous months?”

Key Features

Assistant UX

Analyze data from uploaded files or indexed files by tagging them or pasting their link in Assistant. Generate output files and engage in multi-turn conversations about your data.

API Support

Developer platform users can access the File Upload APIs documentation or visit developers.glean.com.

Security

Analysis is performed in a sandboxed environment for each user, ensuring security throughout the file lifecycle. For details, see the security whitepaper.

Supported File Formats

  • XLS
  • XLSX
  • Google Sheets
  • CSV
  • JSON

Known Limitations

Before using the feature, be aware of these current limitations:
1

File Upload Limits

Up to 5 files of 64MB each are supported. Note that a single XLS/XLSX file can contain multiple tabs.
2

Data Retention

Default policies include 24-hour file retention and 30-day metadata retention. Users can manage retention by disabling chat session history or manually deleting sessions.
3

Complex Sheet Handling

Beta version has limited support for complex sheets with merged cells, hierarchical tables, charts, diagrams, and sentiment analysis.
4

Model Performance

To ensure optimal performance, choose the model that best fits your specific needs.

Configuration & Setup

Enabling Data Analysis

The feature is enabled by default unless you have opted out of new features.
As a Glean admin, you can manage user access through:
  1. Navigate to Admin Console
  2. Go to Platform > Assistant > Settings
  3. Select Data Analysis
  4. Choose your preferred configuration options
For a visual guide, watch the Data Analysis Admin Self-Serve video.

Code interpreter security

Glean offers two versions of the code interpreter:
  • Native
  • OpenAI Code Interpreter
Customers using the universal Glean key or the Glean key for Azure use the OpenAI Code Interpreter. The security details for each version are outlined below.

OpenAI Code Interpreter security

  • Sandbox isolation: Each analysis runs in a dedicated, short-lived sandbox per user/session. Sandboxes are destroyed after inactivity or the session ends. Code and sandbox files do not persist across sessions.
  • Glean retention defaults: By default, files are retained for up to 24 hours. In rare cases, files flagged for abuse may be retained for up to 30 days to support auditing. This is consistent with OpenAI’s terms for modified abuse monitoring.
  • Dedicated project: Glean creates a dedicated OpenAI project per customer on the Glean-billed key. Each project uses a project-scoped service account and API key, which enforces strict project-level isolation for chats and files and prevents cross-customer visibility.

Native code interpreter security

For detailed information about our native code interpreter’s security measures, refer to our security whitepaper.