Technology

Apache XTable

A zero-copy metadata translator (Apache incubating, formerly OneTable) that converts between Iceberg, Delta Lake, and Hudi metadata without copying data files.

7 connections 3 resources

Summary

What it is

A zero-copy metadata translator (Apache incubating, formerly OneTable) that converts between Iceberg, Delta Lake, and Hudi metadata without copying data files.

Where it fits

XTable sits between table formats and query engines, enabling a "write once, read from any format" pattern. Ingest data in Hudi (optimized for CDC) and serve it to Trino and Snowflake via Iceberg metadata — all without data duplication.

Misconceptions / Traps

Zero-copy means no data file duplication, but metadata translation still has a cost. Large tables with millions of files can have non-trivial translation overhead.
Still in Apache incubation. Documentation and build tooling are rough. Production use requires careful testing.

Key Connections

enables Apache Iceberg, Delta Lake, Apache Hudi — cross-format metadata translation
solves Vendor Lock-In — prevents table format lock-in
competes_with Delta UniForm — different approaches to interoperability

Definition

What it is

A metadata translation layer (Apache incubating, formerly OneTable) that performs zero-copy conversion between Apache Iceberg, Delta Lake, and Apache Hudi metadata. No data files are copied or modified — only metadata is translated.

Why it exists

Organizations ingest data in one table format (e.g., Hudi for low-latency CDC) but need to serve it to engines that prefer another format (e.g., Iceberg for Trino/Snowflake). XTable eliminates the need to maintain multiple copies of the same data in different formats.

Primary use cases

Cross-format interoperability without data duplication, enabling multi-engine access to a single dataset, format migration pathways.

Recent developments

Latest signals

v0.3.0-incubating release — CatalogSync + continuous sync + restore/rollback. Per the apache/incubator-xtable releases page, v0.3.0-incubating ships CatalogSync for AWS Glue and Hive Metastore (so XTable now writes the cross-format metadata into the catalog tier, not just the file tier), continuous sync mode, restore and rollback semantics, the bucket-partition transform, schema field comments, timestamp_ntz support, and upgraded parquet-avro to 1.15.1. Combined, these move XTable from "convert metadata when you ask" to "keep metadata in sync continuously across catalogs" — a real operational shape change for teams running multi-format catalogs.
Apache incubation positioning still load-bearing. Per the project's official ASF page, Apache XTable reads existing table metadata and writes metadata for other formats, enabling reads across Delta, Hudi, and Iceberg from any cloud. The project remains under Apache incubation; the framing for organizations evaluating it: XTable is the Apache-incubated answer to the cross-format-metadata translation problem, parallel to but distinct from Delta UniForm's Delta-side approach.