The Local-First S3 Index for LLM Data Infrastructure

— 410 concepts · 1838 relationships · 48 guides

What's in the index

Topic 23 Technology 177 Standard 35 Architecture 85 Pain Point 54 Model Class 21 LLM Capability 15

Browse by topic 23 top-level concepts

Each technology, standard, and architecture in the index belongs to one or more topics — the conceptual anchors that define the S3 / AI-memory-infrastructure ecosystem. The seven topics added in the May 16, 2026 wave are highlighted.

Amazon's Simple Storage Service and the broader ecosystem of S3-compatible object storage. The root concept of this e...

239 connections

Object Storage

The storage paradigm of flat-namespace, HTTP-accessible binary objects with metadata. Data is addressed by bucket and...

150 connections

AI Memory Infrastructure NEW

The emerging tier of persistent, object-storage-backed memory architecture sitting between GPU HBM and cold S3 — the ...

71 connections

Table Formats

The category of specifications (Iceberg, Delta, Hudi) that bring table semantics — schema, partitioning, ACID transac...

49 connections

Lakehouse

The convergence of data lake storage (raw files on object storage) with data warehouse capabilities — ACID transactio...

44 connections

LLM-Assisted Data Systems

The intersection of large language models and S3-centric data infrastructure. Scoped strictly to cases where LLMs ope...

40 connections

AI Runtime Infrastructure NEW

The layer of standardized orchestration fabrics, communication protocols, model gateways, and agent runtimes that sit...

36 connections

Vector Indexing on Object Storage

The practice of building and querying vector indexes over embeddings derived from data stored in S3.

30 connections

Object Storage for AI Data Pipelines

Using S3 as the central data layer for machine learning workflows: storing training data, model checkpoints, feature ...

25 connections

Metadata Management

The discipline of maintaining catalogs, schemas, statistics, and descriptive information about objects and datasets s...

20 connections

AI Memory Governance NEW

The compliance, audit, lineage, and retention discipline applied to persistent AI memory — extending traditional data...

19 connections

Sovereign Storage

The practice of deploying S3-compatible object storage on infrastructure that is fully controlled by a specific organ...

17 connections

Data Lake

The pattern of storing raw, heterogeneous data in object storage for later processing. Data arrives in its original f...

15 connections

Inference Locality NEW

The architectural shift toward minimizing data movement between storage and inference compute — placing computation a...

14 connections

Geo / Edge Object Storage

Deploying S3-compatible object storage at geographically distributed edge locations with synchronization to a central...

12 connections

GPU + Object Storage Convergence NEW

The set of technologies eliminating CPU bounce-buffers between object storage and GPU memory — establishing direct me...

10 connections

Retrieval Engineering NEW

The discipline of building production retrieval systems that go beyond basic Retrieval-Augmented Generation (RAG) — o...

7 connections

Data Versioning

Techniques for tracking and managing changes to datasets stored in object storage over time, including snapshots, bra...

6 connections

Directory Buckets / Hot Object Storage

A purpose-built storage tier designed for single-digit millisecond latency, using a directory-based namespace within ...

6 connections

Kubernetes Object Provisioning & Policy

Kubernetes-native provisioning and management of S3 buckets using operators, the Container Object Storage Interface (...

5 connections

Distributed Context Systems NEW

The orchestration of memory and shared state across multi-agent environments — the architectural pattern that enables...

5 connections

Metadata-First Object Storage

A design philosophy that treats object metadata as a first-class, queryable resource rather than an afterthought. Ena...

4 connections

Time Travel

The ability to query a dataset as it existed at a previous point in time by leveraging immutable snapshots and metada...

4 connections

How the ecosystem connects

Every technology in this index exists to solve a problem. The relationships below show which operational pain points matter most and which tools address them — browse all 410 nodes or click any link to explore.

Vendor Lock-In

Dependence on a single S3 provider's proprietary features, pricing, or integrations that makes migra...

→

MinIO competes RustFS pgsty/minio Fork alt MinIO

Cold Scan Latency

Slow first-query performance against S3-stored data, caused by object discovery, metadata fetching, ...

→

Spice.ai Weaviate Qdrant Actian VectorAI DB Lakehouse Architecture Hybrid S3 + Vector Index NVMe-backed Object Tier Apache Parquet ORC +28 more

DuckDB alt DataFusion DuckDB alt Polars

Egress Cost

The cost charged by cloud providers for data transferred out of their S3 service — to the internet, ...

→

Cloudflare R2 Backblaze B2 Wasabi Wasabi AiR Local Inference Stack Tiered Storage Cache-Fronted Object Storage Storage Class Lifecycle Recommendation +13 more

IDrive e2 alt Wasabi Hexabyte alt Wasabi

Memory Wall

The architectural ceiling created by the diverging trajectories of compute throughput (which has sca...

→

WEKA Mem0 NVIDIA BlueField-4 Inference Context Memory Storage (ICMS) Multi-Head Latent Attention (MLA) TurboQuant ObjectCache +10 more

WEKA competes VAST Data Zep competes Mem0

Schema Evolution

Changing data schemas (adding columns, renaming fields, altering types) in S3-stored datasets withou...

→

Apache Iceberg Delta Lake Apache Hudi lakeFS Write-Audit-Publish Branching / Tagging Iceberg Table Spec Delta Lake Protocol +5 more

Apache Hudi competes Apache Paimon DuckLake alt Apache Iceberg

Metadata Overhead at Scale

Table format metadata (manifests, snapshots, statistics) grows as S3 datasets grow, eventually slowi...

→

DuckLake Amazon S3 Tables Amazon S3 Metadata AWS Glue Catalog Manifest Pruning Hybrid Metadata Patterns Puffin File Format Puffin Format +4 more

DuckLake alt Apache Iceberg DuckLake alt Apache Hudi

Legacy Ingestion Bottlenecks

Older ETL systems designed for HDFS or traditional databases that cannot efficiently write to modern...

→

Apache Ozone Apache Hudi Bytewax Apache Airflow CDC into Lakehouse Event-Driven Ingestion Real-Time AI Lakehouse +2 more

Apache Hudi competes Apache Paimon DuckLake alt Apache Hudi

High Cloud Inference Cost

The expense of running LLM/ML inference via cloud APIs (per-token or per-request pricing) against S3...

→

MinIO MemKV S3 Express One Zone Amazon S3 Vectors AWS Lambda Local Inference Stack Hybrid Retrieval DeepSeek V4 +2 more

WEKA competes VAST Data Helicone AI Gateway competes LiteLLM

New to the ecosystem?

I run local AI. Why do I care about S3?

Guided path from local inference to the S3 storage ecosystem — storage, formats, retrieval, and the tradeoffs that matter.

→

Recent waves 3 latest of 22

Architectural shifts as they happen. Each post anchors on a pre-existing pain point and walks through what changed.

Jul 5, 2026

The Frontier Moved Again — and the Floor Compounded

In June we argued the AI stack had split into a closed, expensive frontier and an open, cheap floor. Six weeks later the frontier proved the point the hard way: Claude Fable 5 was pulled offline for 19 days by an export-control order, then restored with a retention mandate that quietly bars the most sensitive data from ever touching it. Meanwhile the floor didn't just hold — it compounded. Object storage got RDMA-fast, the query engines converged on one substrate, and the data plane grew a control plane built for agents instead of humans. This is the field report.

Jun 27, 2026

The Storage Cost Inversion: When Object Storage Grew a Brain

A 2026 NAND/flash shortage and a wave of cloud storage price hikes made fast storage scarce — and pushed AI memory onto S3. But object storage didn't just become the cheap tier. Under pressure it became the active substrate: an agentic data plane, an RL training buffer, and a hot KV-cache memory pool.

Jun 22, 2026

The Training I/O Tax: Storage Just Got Repriced by the GPU

Three June 2026 signals — Alibaba's 30% CPFS price hike, a MinIO-vs-Dell RDMA throughput benchmark, and LanceDB's first published Enterprise latency numbers — show the same force at work: AI training I/O is repricing the storage layer. Managed parallel file storage is becoming a premium good, while commodity RDMA object storage closes the throughput gap at ~1% host CPU. The bottleneck moved, and so did the bill.

All 22 posts →

Featured guides

Guide 1