Time Travel
The ability to query a dataset as it existed at a previous point in time by leveraging immutable snapshots and metadata history maintained by table formats on object storage.
Summary
The ability to query a dataset as it existed at a previous point in time by leveraging immutable snapshots and metadata history maintained by table formats on object storage.
Time travel is a core capability enabled by table formats (Iceberg, Delta, Hudi) on S3. Each write operation produces a new snapshot rather than mutating files in place, and the snapshot history allows any prior version of a table to be read without restoring from backup.
- Time travel is not free storage. Every snapshot retains references to data files; without periodic snapshot expiration and orphan file cleanup, storage costs grow linearly with write frequency.
- Time travel depth is bounded by retention policy, not by the format itself. Once snapshots are expired and their data files garbage-collected, those points in time are gone permanently.
- Time travel queries on S3 incur the same GET request costs as current queries. Reading historical data does not bypass S3 pricing.
scoped_toTable Formats — time travel is a table format capabilityenabled_byApache Iceberg, Delta Lake, Apache Hudi — all three formats support snapshot-based time travelscoped_toData Versioning — time travel is a form of data versioning at the table levelconstrained_byMetadata Overhead at Scale — deep snapshot history increases metadata volume
Definition
The ability to query a dataset as it existed at a previous point in time by referencing historical snapshots maintained by a table format on object storage.
S3 objects are immutable, but logical datasets change constantly through inserts, updates, and deletes. Time travel leverages the snapshot history maintained by table formats (Iceberg, Delta, Hudi) to provide deterministic, reproducible reads of past states without maintaining separate backup copies.
Connections 4
Outbound 3
scoped_to3Inbound 1
enables1Resources 3
Apache Iceberg's official documentation on time-travel queries, showing how snapshot isolation enables querying data as of any previous state.
Delta Lake's time-travel implementation via the transaction log, covering version-based and timestamp-based historical queries.
Apache Hudi's time-travel query guide demonstrating timeline-based point-in-time access to table state on object storage.