Architecture

Interoperability Patterns

Architectural strategies for enabling multiple table formats (Iceberg, Delta, Hudi), query engines (Spark, Trino, Flink), and catalogs to operate on the same S3-stored data without requiring format conversion or data duplication.

6 connections 3 resources

Summary

What it is

Architectural strategies for enabling multiple table formats (Iceberg, Delta, Hudi), query engines (Spark, Trino, Flink), and catalogs to operate on the same S3-stored data without requiring format conversion or data duplication.

Where it fits

Interoperability patterns address the fragmentation challenge in the S3 lakehouse ecosystem. Tools like Apache XTable, Delta UniForm, and Iceberg REST Catalog enable organizations to avoid lock-in to a single table format or query engine while keeping a single copy of data on S3.

Misconceptions / Traps
  • Format interoperability (XTable, UniForm) adds a metadata translation layer. Each format's metadata must be kept in sync, and features unique to one format (e.g., Iceberg's hidden partitioning) may not translate cleanly.
  • "Read interoperability" is easier than "write interoperability." Multiple engines reading Iceberg tables is well-supported; multiple engines writing concurrently to the same Iceberg table requires careful commit conflict resolution.
  • Catalog interoperability is as important as format interoperability. An engine that reads Iceberg but uses HMS while another uses Glue will see different table states unless catalogs are synchronized.
Key Connections
  • scoped_to Table Formats, Lakehouse — cross-format and cross-engine compatibility
  • enabled_by Apache XTable, Delta UniForm — format translation tools
  • enabled_by Iceberg REST Catalog Spec — standardized catalog interface
  • solves Vendor Lock-In — reduces dependence on a single format or engine

Definition

What it is

Architectural patterns that enable multiple table formats (Iceberg, Delta, Hudi), query engines, and catalogs to coexist on the same S3 data — including format translation (Apache XTable, Delta UniForm) and multi-catalog federation.

Why it exists

Organizations rarely standardize on a single table format or query engine. Interoperability patterns enable teams to use the best tool for each job while sharing the same physical data on S3, avoiding costly format migrations and data duplication.

Primary use cases

Multi-format lakehouse environments, incremental migration from Delta to Iceberg, cross-engine analytics on shared S3 data.

Connections 6

Outbound 6

Resources 3