Topic

Topic

Navigational entry points. Conceptual domains that other nodes connect to via scoped_to edges.

9 nodes

S3

Topic

Amazon's Simple Storage Service and the broader ecosystem of S3-compatible object storage. The root concept of this entire index.

39 connections 4 resources

Object Storage

Topic

The storage paradigm of flat-namespace, HTTP-accessible binary objects with metadata. Data is addressed by bucket and key, not by filesystem path.

19 connections 3 resources

Lakehouse

Topic

The convergence of data lake storage (raw files on object storage) with data warehouse capabilities — ACID transactions, schema enforcement, SQL acces...

14 connections 3 resources

Data Lake

Topic

The pattern of storing raw, heterogeneous data in object storage for later processing. Data arrives in its original form and is transformed downstream...

8 connections 3 resources

Table Formats

Topic

The category of specifications (Iceberg, Delta, Hudi) that bring table semantics — schema, partitioning, ACID transactions, time-travel — to collectio...

15 connections 4 resources

Vector Indexing on Object Storage

Topic

The practice of building and querying vector indexes over embeddings derived from data stored in S3.

7 connections 3 resources

LLM-Assisted Data Systems

Topic

The intersection of large language models and S3-centric data infrastructure. Scoped strictly to cases where LLMs operate on, enhance, or derive value...

14 connections 3 resources

Metadata Management

Topic

The discipline of maintaining catalogs, schemas, statistics, and descriptive information about objects and datasets stored in S3.

5 connections 4 resources

Data Versioning

Topic

Techniques for tracking and managing changes to datasets stored in object storage over time, including snapshots, branching, and rollback.

2 connections 3 resources