Learn how to configure, set up, and use the S3 Connector for indexing documents stored in Amazon S3 buckets into your Glean instance. It includes supported features, requirements, and detailed setup instructions for both GCP and AWS deployments.
Crawl Type | Full Crawl | Incremental Crawl | People Data | Activity | Update Rate | Webhook | Notes |
---|---|---|---|---|---|---|---|
S3 Connector | Scheduled, entire bucket(s) scan | Modified/new documents since last crawl | N/A | Tracks additions/updates/deletes via full crawls | Default scheduling can be tuned | N/A | Deletion reflected at next full crawl; no webhook support. |
AmazonS3ReadOnlyAccess
(or equivalent) permissions.AmazonS3ReadOnlyAccess
permissions.AmazonS3ReadOnlyAccess
policy and click Next.AmazonS3ReadOnlyAccess
policy and click Next.