Technology

Apache Doris

A real-time analytical database with native lakehouse capabilities, querying Iceberg, Hudi, and Paimon tables on S3 directly. Late 2025 added native Paimon Deletion Vector support and Hive/FileSystem catalogs.

7 connections 3 resources

Summary

What it is

A real-time analytical database with native lakehouse capabilities, querying Iceberg, Hudi, and Paimon tables on S3 directly. Late 2025 added native Paimon Deletion Vector support and Hive/FileSystem catalogs.

Where it fits

Doris bridges the gap between real-time serving and lakehouse analytics. Rather than requiring a separate engine for interactive dashboards vs. batch analytics, Doris provides sub-second queries directly on S3-stored lakehouse tables with native support for all major table formats.

Misconceptions / Traps
  • Native lakehouse support does not mean Doris replaces the table format engine. Doris reads lakehouse tables but does not manage compaction, snapshot expiry, or table maintenance — those remain the responsibility of Iceberg/Hudi/Paimon.
  • Sub-second performance depends on query patterns and data layout. Complex joins over large unpartitioned tables on S3 may not achieve interactive latency.
Key Connections
  • reads_from Apache Iceberg, Apache Hudi, Apache Paimon — native lakehouse table reading
  • implements S3 API — direct S3 data access
  • solves Cold Scan Latency — interactive performance on S3 data

Definition

What it is

A real-time analytical database with native lakehouse capabilities, supporting direct queries over Apache Iceberg, Hudi, and Paimon tables on S3. In late 2025 added native support for Paimon Deletion Vectors and Hive/FileSystem catalogs.

Why it exists

Real-time analytics on S3-based lakehouses traditionally requires multiple engines — one for ingestion, another for serving. Doris combines real-time ingestion with sub-second query performance, querying S3-stored lakehouse tables directly without requiring data movement.

Primary use cases

Real-time analytics over S3 lakehouse tables, sub-second dashboards on Iceberg/Hudi/Paimon data, unified real-time and batch query serving.

Connections 7

Outbound 7

Resources 3