Portfolio Investment Memo

Why We Backed Qdrant

Yuki Nakashima March 28, 2023

When Flintrock led Qdrant's seed round in early 2023, the vector database category was already crowded. Pinecone had raised $100M. Weaviate had significant community traction. Milvus had been running in production for years. There was a real question worth asking: why does the world need another vector database, and why does Qdrant specifically deserve the Flintrock check?

The answer came down to three things: the architecture decisions the team had made, the way they had made them, and the community they had built while making them without external funding.

The architecture thesis

Vector databases have a fundamental tension. Pure ANN (approximate nearest neighbor) search is fast but imprecise — you get the vectors that are geometrically close, not necessarily the ones you want. Real AI applications almost always want to combine vector similarity with structured filtering: "find me the semantically similar documents, but only the ones from this user, in this category, published after this date."

Most vector databases handle this by doing the vector search first and the filtering second — which means either scanning the full index and then filtering, or doing the vector search and discarding a large fraction of results in the filter step. Both degrade dramatically as filter cardinality increases.

Qdrant's approach was to build filtering into the index traversal itself — a technique they call "filterable HNSW." Rather than separate steps, the filter conditions constrain which nodes in the HNSW graph are traversable during search. This means the search operation and the filter operation are interleaved at the index level, and performance degrades gracefully as filters become more selective rather than catastrophically.

This is a genuinely different architectural decision, not a marketing claim. When I first read the technical documentation and then looked at their benchmark data, the performance profile matched what the architecture predicted. That's a meaningful signal.

The team's reasoning quality

Architecture decisions matter, but so does the team's capacity to keep making good architectural decisions as the system evolves. What I look for in technical founding teams is not just the quality of their current decisions but the quality of their reasoning process — do they understand the tradeoffs they're making, can they explain the failure modes of their approach, are they honest about what they've gotten wrong?

The Qdrant founders — Andrei Vasnetsov and team — were unusually direct about their limitations during diligence. They knew where their performance degraded. They knew which use cases their current architecture didn't serve well. They had opinions about why, and those opinions reflected genuine understanding rather than defensive positioning. That quality of technical honesty is a strong predictor of good long-term architectural decisions.

The question we spent the most time on

The hardest question in vector database diligence in early 2023 was whether this would be a standalone business or a feature that hyperscalers absorb. Google had SCANN. Meta had Faiss. Amazon was adding vector search to RDS and OpenSearch. The cloud platforms were clearly paying attention.

Our conclusion, which has been borne out since, is that the hyperscaler implementations are optimized for the hyperscaler's broader platform integration — they're not optimized for vector search performance. For teams running performance-sensitive AI applications, the performance gap between a purpose-built vector database and a tacked-on cloud service is meaningful enough to justify a standalone deployment. And purpose-built means open-source community and developer ecosystem, which is where Qdrant had already established a position.

Two years later, the answer looks right. Qdrant has grown substantially in production deployments and the hyperscaler alternatives have not displaced purpose-built vector databases for performance-sensitive workloads. The thesis held.