Athena
AWS's serverless, pay-per-query SQL engine that runs queries directly against data stored in S3 without requiring infrastructure provisioning or cluster management.
Summary
AWS's serverless, pay-per-query SQL engine that runs queries directly against data stored in S3 without requiring infrastructure provisioning or cluster management.
Athena is the lowest-friction entry point for querying S3 data in the AWS ecosystem. It reads Parquet, ORC, JSON, CSV, and Iceberg tables registered in Glue Catalog, making it the default ad-hoc analytics tool for AWS-centric data lakes.
- Athena charges per terabyte scanned, not per query. Without columnar formats (Parquet) and partition pruning, costs escalate rapidly on large datasets.
- Athena v3 (Trino-based) and Athena v2 (Presto-based) have different SQL compatibility and performance characteristics. Engine version must be explicitly selected.
- Athena is not suitable for low-latency, high-concurrency workloads. Each query has cold-start overhead and there are per-account concurrency limits.
scoped_toS3, Lakehouse — serverless SQL over S3depends_onAWS Glue Catalog — reads table metadata from Gluedepends_onApache Parquet — optimal performance requires columnar formatsconstrained_byCold Scan Latency — full-table scans on large S3 datasets are slow and expensive
Definition
AWS's serverless, pay-per-query SQL engine that reads data directly from S3. Supports Iceberg, Delta, and Hudi table formats via integration with AWS Glue Catalog.
Running always-on query clusters (Spark, Trino) for ad-hoc analytics is expensive when usage is sporadic. Athena provides instant SQL access to S3 data with no infrastructure to manage and a pure per-query pricing model.
Ad-hoc SQL queries over S3 data lakes, serverless Iceberg table queries, log analysis, cost-efficient exploratory analytics.
Connections 7
Outbound 6
depends_on2implements1solves1Inbound 1
enables1Resources 3
Official AWS Athena documentation for the serverless query service that runs SQL directly against data on S3 without infrastructure provisioning.
Athena's Iceberg integration guide covering ACID transactions, time travel, and schema evolution for S3-native lakehouse queries.
Athena pricing page documenting the per-query, per-bytes-scanned cost model critical for understanding S3 query economics.