Technology

SAP HANA Cloud Data Lake

SAP HANA Cloud's data-lake tier — extends the in-memory HANA database with **virtual tables that provide read-only access to Apache Iceberg data sitting in external object storage** (AWS S3, Azure Blob, ADLS Gen2, Google Cloud Storage). Pairs with **HANA Native Storage Extension (NSE)** to tier warm/cold data directly to S3-compatible endpoints without application refactoring. Lets SAP-shop analytical workloads query Iceberg-on-S3 lakehouses through standard HANA SQL, sharing identity, permissions, and observability with the rest of the SAP stack.

3 connections

Definition

What it is

SAP HANA Cloud's data-lake tier — extends the in-memory HANA database with **virtual tables that provide read-only access to Apache Iceberg data sitting in external object storage** (AWS S3, Azure Blob, ADLS Gen2, Google Cloud Storage). Pairs with **HANA Native Storage Extension (NSE)** to tier warm/cold data directly to S3-compatible endpoints without application refactoring. Lets SAP-shop analytical workloads query Iceberg-on-S3 lakehouses through standard HANA SQL, sharing identity, permissions, and observability with the rest of the SAP stack.

Why it exists

SAP shops have decades of investment in HANA's in-memory analytical engine, but the per-GB cost of HANA's primary tier makes large historical / archival data economically prohibitive — and the modern lakehouse pattern (Iceberg on S3) lives in a completely separate query stack. SAP HANA Cloud Data Lake closes that gap: keep the hot transactional state in HANA in-memory, push warm/cold + historical analytical data to S3-compatible object storage, query both as virtual tables from the same SQL session. The 2025-2026 evolution added direct Iceberg-on-S3 reads, which is the key 2026 lakehouse interoperability story for SAP customers.

Primary use cases

SAP shops needing to query Iceberg-on-S3 lakehouses without moving data into HANA primary, tiering HANA warm/cold data to S3 via Native Storage Extension, historical/archival analytical queries where in-memory cost is prohibitive, integrating non-SAP data lakes (vendor S3 buckets, ADLS Gen2, GCS) into HANA-native analytical pipelines, and hybrid stacks where SAP Datasphere is the federation point.

Recent developments

Latest signals
  • Iceberg-on-S3 virtual tables now read-supported from HANA database. SAP HANA Cloud now enables users to create virtual tables within the HANA database providing read-only access to Apache Iceberg data in external object storage — AWS S3, Azure Blob, ADLS Gen2, GCS. Per SAP Community — Datasphere Object Store of BDC.
  • Data Lake Files app: SSO + X509 authentication. The Data Lake Files app in SAP HANA Cloud Central now supports Single Sign-On in addition to X509 credentials, enabled through the "Enable Single Sign-On for Data Lake Files" menu. Per SAP Community — Sept 2025 What's New.
  • HANA Native Storage Extension (NSE) tiers warm/cold to S3-compatible. Direct tiering of warm and cold data to S3-compatible endpoints without application refactoring — keeps SQL access semantics the same while paying object-storage prices for cold data. Per SAP Titan — S3 Object Storage vs Traditional.
  • S3 operates at 10-100ms per operation — suitable for backup/archive/data-lake but NOT HANA primary. Performance-tier guidance: object storage is correct for analytical-throughput workloads, wrong for HANA's primary in-memory transactional tier. Per SAP Titan — S3 vs Traditional.
  • MinIO + SAP HANA Cloud + Data Intelligence pipeline pattern documented. MinIO published an integration pattern combining MinIO as the on-prem S3 layer with SAP HANA Cloud + SAP Data Intelligence for hybrid lakehouse pipelines. Per MinIO blog — SAP HANA + MinIO data pipelines.

Connections 3

Outbound 3
scoped_to1
depends_on1
enables1