Milvus vs OpenSearch for Vector Search

Comparing Milvus and OpenSearch for large-scale vector search, covering architecture, scalability, performance, and operational considerations.

Added 28 Mar 2026 5 min read Updated 14 Jun 2026

#Milvus #OpenSearch #vector-search #database #comparison

Learn this your way

Read Guided course

Milvus is a purpose-built vector database designed for billion-scale similarity search. OpenSearch is a search and analytics engine with vector search capabilities. When choosing between them, the decision often comes down to scale requirements and whether you need capabilities beyond vector search.

Architecture

Milvus is built as a cloud-native distributed system. It separates compute from storage, using object storage (S3) for persistence. Earlier versions relied on an external message queue (Pulsar or Kafka) for streaming, but Milvus 2.6 (June 2025) replaced that dependency with Woodpecker, a purpose-built write-ahead log that persists directly to object storage and removes the need to run a separate message broker. This architecture enables independent scaling of query and insertion workloads. Components include a proxy layer, query nodes, data nodes, index nodes, and (since 2.6) a streaming node for real-time ingestion.

OpenSearch is a distributed search engine based on the Apache Lucene library. Vector search is provided by the k-NN plugin, which supports HNSW and IVF methods across three engines: Faiss, Lucene, and NMSLIB (NMSLIB is deprecated). The architecture is a shared-nothing cluster where each node stores a portion of the data.

Scale Comparison

Metric	Milvus	OpenSearch
Max vectors tested	Billions	Hundreds of millions
Distributed scaling	Cloud-native, compute-storage separation	Cluster scaling (add nodes)
Index types	IVF_FLAT, IVF_SQ8, IVF_PQ, HNSW, SCANN, DiskANN	HNSW, IVF (across Faiss, Lucene, NMSLIB engines)
GPU indexing	Yes	Yes (since OpenSearch 3.0, via NVIDIA cuVS)
Disk-based indexing	Yes (DiskANN)	Yes (disk-optimized mode since 2.17)

Milvus has a clear advantage at very large scale. DiskANN enables billion-scale vector search without requiring all vectors to reside in memory, and GPU-accelerated indexing dramatically speeds up index building for large datasets. The gap has narrowed: OpenSearch 3.0 (May 2025) added GPU-accelerated vector indexing using NVIDIA cuVS, and disk-optimized vector search has been available since OpenSearch 2.17 to cut memory cost on large indexes.

Feature Comparison

Feature	Milvus	OpenSearch
Vector search	Core capability	Plugin (k-NN)
Full-text search	Native BM25 (since Milvus 2.5)	Core capability
Hybrid search	Vector + full-text (BM25) + scalar filtering	Vector + text + filtering
Analytics	No	Yes (aggregations, dashboards)
Schema enforcement	Yes (typed fields)	Flexible (dynamic mapping)
Multi-vector	Yes (multiple vector fields per entity)	Yes (multiple k-NN fields)
Partitioning	Yes (by partition key)	Yes (by index/shard)
Time travel queries	Yes (query historical state)	No

OpenSearch provides significantly broader functionality beyond vector search. If you need log analytics, dashboards, or aggregations alongside vector search, OpenSearch is the more complete platform. The full-text search gap has closed somewhat: Milvus 2.5 added native full-text search and Milvus 2.6 added BM25 scoring, so basic keyword plus vector hybrid search no longer requires a separate engine. OpenSearch remains far stronger for general-purpose text relevance, analytics, and observability.

Operational Complexity

Milvus has high operational complexity in its distributed mode. The architecture requires multiple components: etcd for metadata, MinIO/S3 for storage, plus the Milvus components themselves. Milvus 2.6 removed the external Pulsar/Kafka message broker by replacing it with the built-in Woodpecker write-ahead log, which reduces the number of moving parts but does not make the distributed deployment trivial. Managed options (Zilliz Cloud) eliminate this complexity but add cost.

Milvus Lite is an embedded mode for development and small-scale use. It simplifies getting started but is not suitable for production at scale.

Amazon OpenSearch Service on AWS is fully managed. AWS handles provisioning, patching, backups, and scaling. The service tracks the open-source releases closely and supports OpenSearch up to version 3.3 as of late 2025. The operational burden is moderate and well-documented.

For teams without dedicated infrastructure engineers, OpenSearch Service is significantly easier to operate than self-hosted Milvus.

Performance

At moderate scale (under 10 million vectors), both perform well with proper configuration. At larger scale, Milvus’s purpose-built architecture and GPU support give it advantages in both indexing speed and query latency.

Milvus also supports more index types, allowing fine-tuning of the accuracy-performance tradeoff for specific workloads. DiskANN is particularly valuable for cost-effective search at billion scale.

Cost

Self-hosted Milvus: Infrastructure cost for multiple components. A minimal production cluster needs 3+ nodes plus supporting services. Higher infrastructure cost but no per-query charges.

Zilliz Cloud (managed Milvus): Managed service with usage-based pricing. Simpler operations but premium pricing.

OpenSearch Service: Instance-based pricing. A production cluster (3 data nodes, 3 master nodes) starts at ~$200-300/month. Includes all features (vector search, text search, analytics).

For pure vector search at moderate scale, the costs are comparable. At very large scale (100M+ vectors), Milvus’s efficient indexing and DiskANN can be more cost-effective per vector stored.

When to Choose Milvus

Billion-scale vector search requirements
Need GPU-accelerated indexing
Need disk-based indexing for cost-effective large-scale search
Building a dedicated similarity search service
Team has infrastructure expertise to manage the system (or use Zilliz Cloud)

When to Choose OpenSearch

Need combined vector search, full-text search, and analytics
Already using OpenSearch for other purposes
Want managed AWS service with standard operational practices
Scale requirements are under 100 million vectors
Need dashboards and visualization alongside vector search

For most enterprise AI applications, OpenSearch provides sufficient vector search capability alongside valuable additional features. Milvus is the right choice when vector search at massive scale is the primary requirement and the operational investment is justified. Both projects are evolving quickly (Milvus 3.0 is in beta and OpenSearch has shipped 3.x with GPU vector search), so benchmark against current versions rather than older comparisons.

Sources

Open source projects

Freelancer Templates Contracts, proposals, SOWs

Freelancer Automation Workflow recipes, AI playbooks

Work with Linda

Workshop Series €2,000/mo x 3

1:1 Consulting 60 min session