Itinai.com it development details code screens blured futuris c6679a58 04d0 490e 917c d214103a6d65 2
Itinai.com it development details code screens blured futuris c6679a58 04d0 490e 917c d214103a6d65 2

Microsoft’s Cost-Effective Vector Search System with DiskANN in Azure Cosmos DB



Cost-Effective Vector Search with Microsoft Azure Cosmos DB

Microsoft’s Innovative Vector Search Solution

Microsoft has developed a groundbreaking system that integrates vector search capabilities directly into Azure Cosmos DB. This advancement allows businesses to perform efficient searches on high-dimensional vector data, which is essential for applications like web search, AI assistants, and content recommendations.

Understanding Vector-Based Retrieval Challenges

Vector-based retrieval systems face significant challenges, primarily due to the high costs and complexities associated with maintaining separate databases for transactional data and vector indexes. Traditionally, businesses have had to duplicate data across systems, leading to:

  • Increased latency in data retrieval
  • Higher storage costs
  • Risks of data inconsistencies

Popular tools like Zilliz and Pinecone, while effective, often operate as standalone services that can struggle with latency and memory usage, especially when handling large datasets or frequent updates.

Microsoft’s Integrated Solution

The research team at Microsoft has tackled these challenges by embedding vector indexing within Azure Cosmos DB’s NoSQL framework. Utilizing DiskANN, a graph-based indexing library, they have created a system that:

  • Eliminates the need for a separate vector database
  • Utilizes Cosmos DB’s strengths, such as high availability and automatic partitioning
  • Maintains a single vector index per partition, synchronized with document data

This integration not only simplifies operations but also enhances performance and scalability, making it a cost-effective solution for businesses.

Performance and Cost Efficiency

In testing, Microsoft’s system has shown impressive results. For a dataset of 10 million vectors, the average query latency was under 20 milliseconds, with a recall rate of 94.64%. When comparing costs:

  • Azure Cosmos DB’s query costs were 15 times lower than Zilliz and 41 times lower than Pinecone.
  • The system maintained cost efficiency even as the index size grew, with minimal increases in latency.
  • Ingestion costs for 10 million vector inserts were approximately $162.5, competitive with other platforms.

These results demonstrate that businesses can achieve high performance without incurring excessive costs, even during heavy data updates.

Conclusion

Microsoft’s integration of vector search into Azure Cosmos DB offers a practical solution for businesses looking to enhance their data retrieval capabilities. By simplifying operations and significantly reducing costs, this system provides a valuable template for organizations aiming to incorporate advanced semantic search into their workflows. For more information, check out the research paper and explore how artificial intelligence can transform your business operations.

If you need assistance in implementing AI solutions in your business, feel free to reach out to us at hello@itinai.ru or connect with us on Telegram, X, and LinkedIn.


Itinai.com office ai background high tech quantum computing 0002ba7c e3d6 4fd7 abd6 cfe4e5f08aeb 0

Vladimir Dyachkov, Ph.D
Editor-in-Chief itinai.com

I believe that AI is only as powerful as the human insight guiding it.

Unleash Your Creative Potential with AI Agents

Competitors are already using AI Agents

Business Problems We Solve

  • Automation of internal processes.
  • Optimizing AI costs without huge budgets.
  • Training staff, developing custom courses for business needs
  • Integrating AI into client work, automating first lines of contact

Large and Medium Businesses

Startups

Offline Business

100% of clients report increased productivity and reduced operati

AI news and solutions