LLMS3 .com
Graph Browse Guides
Browse All
Topic 9
  • S3
  • Object Storage
  • Lakehouse
  • Data Lake
  • Table Formats
  • Vector Indexing on Object Storage
  • LLM-Assisted Data Systems
  • Metadata Management
  • Data Versioning
Technology 14
  • AWS S3
  • MinIO
  • Ceph
  • Apache Ozone
  • Apache Iceberg
  • Delta Lake
  • Apache Hudi
  • DuckDB
  • Trino
  • ClickHouse
  • Apache Spark
  • LanceDB
  • StarRocks
  • Apache Flink
Standard 8
  • S3 API
  • Apache Parquet
  • Apache Arrow
  • Iceberg Table Spec
  • Delta Lake Protocol
  • Apache Hudi Spec
  • ORC
  • Apache Avro
Architecture 8
  • Lakehouse Architecture
  • Medallion Architecture
  • Separation of Storage and Compute
  • Hybrid S3 + Vector Index
  • Offline Embedding Pipeline
  • Local Inference Stack
  • Write-Audit-Publish
  • Tiered Storage
Pain Point 12
  • Small Files Problem
  • Cold Scan Latency
  • Schema Evolution
  • Legacy Ingestion Bottlenecks
  • High Cloud Inference Cost
  • Object Listing Performance
  • Metadata Overhead at Scale
  • Partition Pruning Complexity
  • Vendor Lock-In
  • Egress Cost
  • S3 Consistency Model Variance
  • Lack of Atomic Rename
Model Class 4
  • Embedding Model
  • General-Purpose LLM
  • Code-Focused LLM
  • Small / Distilled Model
LLM Capability 6
  • Embedding Generation
  • Semantic Search
  • Metadata Extraction
  • Schema Inference
  • Data Classification
  • Natural Language Querying
Guides (8)
Model Class

Model Class

Categories of ML/LLM models by their operational role in S3-centric systems.

4 nodes

Embedding Model

Model Class

A class of model that converts unstructured data (text, images, audio) into fixed-dimensional vector representations suitable for similarity search.

6 connections 3 resources

General-Purpose LLM

Model Class

A large language model for broad text tasks. In scope when applied to metadata extraction, summarization, schema inference, or querying of S3-stored c...

10 connections 3 resources

Code-Focused LLM

Model Class

An LLM specialized for code understanding and generation. A subtype of General-Purpose LLM with enhanced ability to work with structured and semi-stru...

4 connections 3 resources

Small / Distilled Model

Model Class

A compact model (typically under 10B parameters) suitable for local or edge deployment, often distilled from a larger model to retain key capabilities...

2 connections 2 resources
Esc
LLMS3.com — The S3 & Object Storage Ecosystem Index
About Privacy llms.txt llms-full.txt