RDMA-Accelerated Object Access
Using RDMA network transport for microsecond-level object storage access within high-performance computing clusters, bypassing kernel network stacks for direct memory-to-memory data transfer.
Summary
Using RDMA network transport for microsecond-level object storage access within high-performance computing clusters, bypassing kernel network stacks for direct memory-to-memory data transfer.
RDMA-accelerated access targets the performance gap between local NVMe and network-attached object storage. Within a data center cluster, RDMA can deliver object access latency approaching local disk — enabling S3-compatible storage to serve latency-sensitive AI and HPC workloads.
- RDMA is a data center technology. It does not work across the public internet or across typical WAN links. Benefits are limited to intra-cluster or intra-DC access.
- The S3 API itself is HTTP-based and cannot use RDMA directly. RDMA acceleration typically operates at the storage backend level, beneath the S3 API layer.
depends_onRDMA (RoCE v2 / InfiniBand) — the transport protocolsolvesCold Scan Latency — microsecond access within clustersscoped_toObject Storage — high-performance storage access pattern
Definition
Using RDMA (Remote Direct Memory Access) network transport to access object storage data with microsecond-level latency, bypassing the TCP/IP stack and CPU overhead of HTTP-based S3 access.
Intra-cluster data movement for erasure coding, replication, and shuffle operations dominates storage backend performance. RDMA eliminates protocol overhead for these internal data paths, dramatically increasing throughput.
High-performance intra-cluster replication, erasure code reconstruction, AI/ML storage fabric, low-latency internal data paths.
Recent developments
- MinIO AIStor + NVIDIA GPUDirect RDMA tech preview shipping. First open-source S3-compatible storage to ship GPUDirect RDMA — turns the object store into a true parallel data plane, all nodes participating in parallel to push/pull data over RDMA to GPU servers. 200+ GB/s sustained, 45% GPU-server CPU reduction. Per MinIO Blog — AIStor with NVIDIA GPUDirect RDMA for S3-Compatible Storage.
- Cloudian shipped RDMA for HyperStore at GA. First commercial S3-compatible storage vendor to ship at GA — frames RDMA-for-S3 as "moving beyond AI to become the new standard for modern data architectures." Per Cloudian — RDMA for S3-Compatible Storage: New Standard and Cloudian — Deploys NVIDIA RDMA for S3-Compatible Storage.
- NVIDIA announced upcoming GA for RDMA-for-S3. Tech preview shipped in MinIO + Cloudian; NVIDIA is preparing the broader GA milestone — the protocol layer + tooling is approaching production-default status. Per MinIO Blog — AIStor + NVIDIA GPUDirect RDMA.
- S3 API extensions for RDMA:
x-amz-rdma-tokenHTTP header. RDMA negotiation rides on top of the standard S3 control plane via customx-amz-rdma-tokenheaders — backward-compatible with non-RDMA clients (they ignore the headers and fall back to HTTP). The cleanest possible protocol extension shape. Per project notes (cuObject thread). - 2026 RDMA-for-S3 vendor roster: MinIO, Cloudian, VAST, DDN, HPE Alletra X10000. Cross-vendor S3-RDMA implementations are the convergence signal — the pattern is settled at the protocol layer. Per Cloudian — S3-Compatible Storage Providers Top 5 2026.
- Internal-data-path use case (replication, EC reconstruction) is the unsung win. Headlines focus on GPU-to-object-storage, but the bigger sustained benefit is intra-cluster: replication + erasure-coding repair + rebalancing — operations that move TB-scale data inside the storage cluster — go from TCP-bottlenecked to wire-speed-RDMA. Per Cloudian — RDMA for S3-Compatible Storage.
Connections 5
Outbound 3
Inbound 2
augments1enables1Resources 2
NVIDIA networking solutions for InfiniBand and RoCE enabling RDMA-accelerated storage access in data centers.
Whitepaper on RoCE v2 deployment covering lossless fabric design and performance characteristics for storage workloads.