Standard

Apache ORC

Optimized columnar format with indexing.

1 connections

Definition

What it is

Optimized columnar format with indexing.

Recent developments

Latest signals
  • Reading and Writing the Apache ORC Format. Apache Arrow supports reading and writing ORC files. PyArrow built with ORC support bundled. Ideal in-memory representation layer for ORC data. Per arrow.apache.org.
  • Apache ORC. Official Apache ORC project site. ACID support, built-in indexes (min/max, bloom filters), complex types support. Per orc.apache.org.
  • Columnar storage formats: Parquet, ORC, and Arrow explained. May 2026 ClickHouse engineering article. ORC ecosystem narrower than Parquet; dominant in Hive/HDFS. Spark, Trino, Presto read ORC well. ClickHouse, DuckDB, BigQuery, Snowflake have varying levels of support. Per clickhouse.com (2026-05-09).
  • GitHub. Apache ORC Format 1.0.0 protobuf definitions for Apache ORC 2.0+. Standardized open-source columnar storage format spec. Per GitHub (apache/orc-format) (2023-12-05).
  • ORC Adopters. Apache ORC adopters page listing Hadoop, Spark, Arrow, Flink, Iceberg, Druid, Hive, Impala, Gobblin, Nifi, Pig, EEL, and Facebook (300+ PB). Per orc.apache.org (2015-07-16).

Connections 1

Outbound 1
scoped_to1