Technology

Delta UniForm

A Delta Lake feature that automatically generates Iceberg and Hudi metadata for Delta tables, enabling cross-format reads without data copying.

6 connections 2 resources

Summary

What it is

A Delta Lake feature that automatically generates Iceberg and Hudi metadata for Delta tables, enabling cross-format reads without data copying.

Where it fits

UniForm is Delta Lake's native answer to the interoperability problem. Instead of using an external translator (XTable), UniForm maintains Iceberg-compatible metadata as a side-effect of every Delta write. This makes it simpler for Databricks-centric environments but limits control.

Misconceptions / Traps
  • UniForm does not support all Delta features in Iceberg mode. Liquid Clustering, for example, is not compatible with UniForm-generated Iceberg metadata.
  • Iceberg metadata is read-only. External Iceberg engines can read but cannot write to UniForm-exposed tables.
Key Connections
  • depends_on Delta Lake — Delta-native feature
  • enables Apache Iceberg — generates compatible metadata for Iceberg readers
  • competes_with Apache XTable — alternative interoperability approach

Definition

What it is

A Delta Lake feature that automatically generates Apache Iceberg and Apache Hudi metadata for Delta tables, enabling engines that read Iceberg or Hudi to access Delta data without any data copying.

Why it exists

Delta Lake is the default format in the Databricks ecosystem, but many query engines (Trino, Snowflake, Athena) prefer Iceberg. UniForm bridges this gap by maintaining Iceberg-compatible metadata alongside Delta's native metadata, eliminating the interoperability penalty.

Primary use cases

Multi-engine access to Delta tables via Iceberg clients, format interoperability in mixed Databricks/non-Databricks environments.

Recent developments

Latest signals
  • Cross-format reads matured; Delta UniForm is the canonical "Delta-but-readable-as-Iceberg" answer. Per Dremio's Iceberg-vs-Delta-Lake comparison guide, Delta's strongest features and performance are still inside the Databricks ecosystem, but UniForm has closed the cross-format read gap for Trino, Snowflake, and Athena clients — those engines can now read Delta tables as Iceberg without a separate copy. Combined with Delta Connect (the Spark Connect protocol for Delta), the UniForm pattern lets one source-of-truth Delta table serve both Databricks workloads and Iceberg-only readers without ETL duplication.
  • Foundational design context still authoritative. Per Databricks's launch blog, UniForm's design principle is metadata-only — no data file copying — making the cross-format read essentially free for the write side and effectively native for the read side. That blog (August 2023) remains the load-bearing reference for understanding what UniForm does and does not do; it has not been superseded by a newer architecture post.

Connections 6

Outbound 5
scoped_to2
depends_on1
Inbound 1

Resources 2