Standard
Specifications, protocols, and file formats that define interoperability.
27 nodesThe HTTP-based API for object storage operations — PUT, GET, DELETE, LIST, multipart upload. The de-facto standard for object stor…
A columnar file format specification designed for efficient analytical queries. Stores data by column, enabling predicate pushdown…
A cross-language in-memory columnar data format specification with libraries for zero-copy reads, IPC, and efficient analytics.
The specification defining how a logical table is represented as metadata files, manifest lists, manifests, and data files on obje…
The specification for ACID transaction logs over Parquet files on object storage. Defines how writes, deletes, and schema changes …
The specification for managing incremental data processing on object storage — record-level upserts, deletes, change logs, and tim…
Optimized Row Columnar file format specification — a columnar format with built-in indexing, compression, and predicate pushdown s…
A row-based data serialization format with rich schema definition and built-in schema evolution support. Schemas are stored with t…
A Kubernetes API standard for provisioning and managing object storage buckets as native Kubernetes resources, analogous to CSI (C…
An open REST API specification for Apache Iceberg catalog operations — namespace/table listing, metadata load, commit, snapshot ma…
A protocol family for accessing NVMe storage devices over network fabrics (RDMA, TCP, Fibre Channel), enabling disaggregated flash…
IETF RFC 5661 — a stateful evolution of NFS that introduces sessions, parallel NFS (pNFS), and close-to-open consistency semantics…
A network transport protocol for direct memory-to-memory data transfer between machines, bypassing the operating system kernel and…
An NVMe SSD specification that exposes storage as sequential-write zones instead of random-access blocks, reducing write amplifica…
The AWS cryptographic request signing protocol used to authenticate and authorize S3 API requests. Every S3 request is signed with…
An S3 API extension that provides write-once-read-many (WORM) protection for objects, preventing deletion or modification for a sp…
Conflict-free Replicated Data Types — mathematical data structures that can be replicated across multiple sites and merged without…
An open standard that defines a common JSON schema for capturing data lineage events — what datasets were consumed, what was produ…
A specialized S3 bucket type with a hierarchical directory namespace — forward slash is a true directory boundary, not a delimiter…
The 2025 evolution of the Apache Iceberg table specification, introducing Row Lineage for row-level provenance tracking, native CD…
A binary format defined inside the Apache Iceberg specification for storing table-level statistics, indexes, and (in V3) deletion …
A next-generation open-source columnar file format incubating at the Linux Foundation AI & Data Foundation, designed to supersede …
A columnar file format from Meta, purpose-built for ML feature engineering on wide tables (10K+ columns), using block encoding for…
A modern columnar data format optimized for random access and vector search on object storage, providing up to 100x faster random …
A formal agreement between data producers and data consumers that specifies the schema, semantics, SLAs, and quality expectations …
An open, vendor-neutral protocol — frequently called "**USB-C for AI**" — that standardizes how reasoning engines (LLMs and agenti…
Compute Express Link 3.0 — the third-generation specification (published February 2026) that extends PCIe capabilities to create *…