Summary

What it is

An open-source vector database with hybrid search combining BM25 keyword matching and vector similarity in a single query, plus multi-tenancy and S3-tiered cold storage.

Where it fits

Weaviate is the stateful vector search server for teams that need both keyword and semantic retrieval over S3-derived embeddings. Its tiered storage offloads cold tenants to S3, aligning with the separation of storage and compute pattern. It represents the opposite architectural choice from LanceDB — a managed, always-on server vs. embedded serverless queries.

Misconceptions / Traps

Weaviate is a stateful server requiring dedicated infrastructure — it is not serverless like LanceDB. Plan for operational overhead including backups, scaling, and upgrades.
Hybrid search (BM25 + vector) is powerful but requires tuning the fusion algorithm. Default weights rarely match production relevance needs.
Multi-tenancy isolates data but shares cluster resources. Noisy-neighbor effects are possible without proper resource limits.

Key Connections

scoped_to Vector Indexing on Object Storage — stores cold vectors on S3
solves Cold Scan Latency — pre-indexed hybrid search over embeddings
alternative_to LanceDB — stateful server vs serverless on S3

Definition

What it is

An open-source vector database with hybrid search combining vector similarity and BM25 keyword scoring. Supports multi-tenancy and tiered storage that offloads inactive tenants to S3-compatible backends.

Why it exists

Pure vector search misses keyword-exact matches and pure keyword search misses semantic meaning. Weaviate combines both retrieval modes in a single query, reducing the need for separate search pipelines. Its multi-tenant architecture lets SaaS platforms isolate customer data while sharing infrastructure, and its S3 tiered storage keeps cold data off expensive local disks.

Primary use cases

Hybrid semantic + keyword search over S3-derived embeddings, multi-tenant RAG backends, tiered embedding storage with hot data in memory and cold data on S3.

Recent developments

Latest signals

Source mix note: Weaviate's recent third-party benchmark coverage is heavy; the bullets below cite multiple comparison surveys to triangulate performance positioning rather than rely on any single vendor-published number.

Latest release: v1.38.2 (current as of June 2026). Tracking the upstream stable release line. Per weaviate/weaviate releases.
Weaviate v1.37 ships the first built-in MCP server in a vector database (April 23, 2026). Available as a preview, it's exposed as a Streamable HTTP endpoint at /v1/mcp on the REST port, disabled by default, and RBAC-governed via new read_mcp/create_mcp/update_mcp permissions so agents (Claude Code, Cursor, VS Code) query and upsert natively without custom glue. Per Weaviate 1.37 Release.
Hybrid search positioning: HNSW + BM25F + compression + hybrid ranking as first-class. Per TechBytes' 2026 Pinecone/Weaviate/pgvector comparison, Weaviate is strongest when search itself is the product — HNSW, BM25F lexical scoring, vector compression, and hybrid ranking are all first-class rather than bolt-ons. The 2026 convergence trends across the vector DB landscape are: compression by default, read/write disaggregation, and hybrid (vector + keyword) ranking; Weaviate has been ahead of that curve for two years.
Concrete benchmark positioning at 1M-vector scale. Per the HolySheep showdown vs Milvus + Qdrant (April 2026), Weaviate 1.23 posts cold query p50 35ms / p99 189ms, warm query p50 11ms / p99 47ms, bulk insert of 1M vectors in 6m 03s, and a 98.41% success rate under 1000 QPS sustained load. Treat these as third-party positioning data, not official Weaviate measurements.