Standard
Specifications, protocols, and file formats that define interoperability.
21 nodesThe HTTP-based API for object storage operations — PUT, GET, DELETE, LIST, multipart upload. The de-facto standard for object stor…
A columnar file format specification designed for efficient analytical queries. Stores data by column, enabling predicate pushdown…
A cross-language in-memory columnar data format specification with libraries for zero-copy reads, IPC, and efficient analytics.
The specification defining how a logical table is represented as metadata files, manifest lists, manifests, and data files on obje…
The specification for ACID transaction logs over Parquet files on object storage. Defines how writes, deletes, and schema changes …
The specification for managing incremental data processing on object storage — record-level upserts, deletes, change logs, and tim…
Optimized Row Columnar file format specification — a columnar format with built-in indexing, compression, and predicate pushdown s…
A row-based data serialization format with rich schema definition and built-in schema evolution support. Schemas are stored with t…
A Kubernetes API standard for provisioning and managing object storage buckets as native Kubernetes resources, analogous to CSI (C…
An open REST API specification for Apache Iceberg catalog operations, enabling multi-engine interoperability through a standardize…
A protocol family for accessing NVMe storage devices over network fabrics (RDMA, TCP, Fibre Channel), enabling disaggregated flash…
A network transport protocol for direct memory-to-memory data transfer between machines, bypassing the operating system kernel and…
An NVMe SSD specification that exposes storage as sequential-write zones instead of random-access blocks, reducing write amplifica…
The AWS cryptographic request signing protocol used to authenticate and authorize S3 API requests. Every S3 request is signed with…
An S3 API extension that provides write-once-read-many (WORM) protection for objects, preventing deletion or modification for a sp…
Conflict-free Replicated Data Types — mathematical data structures that can be replicated across multiple sites and merged without…
An open standard that defines a common JSON schema for capturing data lineage events — what datasets were consumed, what was produ…
A specialized S3 bucket type with a hierarchical directory namespace optimized for high-performance, high-request-rate workloads. …
The 2025 evolution of the Apache Iceberg table specification, introducing Row Lineage for row-level provenance tracking, native CD…
A modern columnar data format optimized for random access and vector search on object storage, providing up to 100x faster random …
A formal agreement between data producers and data consumers that specifies the schema, semantics, SLAs, and quality expectations …