Standard
Apache ORC
Optimized columnar format with indexing.
1 connections
Definition
What it is
Optimized columnar format with indexing.
Recent developments
Latest signals
- Reading and Writing the Apache ORC Format. Apache Arrow supports reading and writing ORC files. PyArrow built with ORC support bundled. Ideal in-memory representation layer for ORC data. Per arrow.apache.org.
- Apache ORC. Official Apache ORC project site. ACID support, built-in indexes (min/max, bloom filters), complex types support. Per orc.apache.org.
- Columnar storage formats: Parquet, ORC, and Arrow explained. May 2026 ClickHouse engineering article. ORC ecosystem narrower than Parquet; dominant in Hive/HDFS. Spark, Trino, Presto read ORC well. ClickHouse, DuckDB, BigQuery, Snowflake have varying levels of support. Per clickhouse.com (2026-05-09).
- GitHub. Apache ORC Format 1.0.0 protobuf definitions for Apache ORC 2.0+. Standardized open-source columnar storage format spec. Per GitHub (apache/orc-format) (2023-12-05).
- ORC Adopters. Apache ORC adopters page listing Hadoop, Spark, Arrow, Flink, Iceberg, Druid, Hive, Impala, Gobblin, Nifi, Pig, EEL, and Facebook (300+ PB). Per orc.apache.org (2015-07-16).
Connections 1
Outbound 1
scoped_to1