Research — Flintrock Capital

2026

The vector database category has ~40 funded companies. Maybe five have the API surface, performance profile, and community momentum to survive as independent businesses. What distinguishes the durable ones.

Mar 14, 2026

Model Serving at the Edge: Why Latency Changes Everything

James Thornton

Moving inference from cloud to edge isn't just a cost optimization — it's an architectural shift that changes what AI applications can do. The infrastructure requirements, and who builds them.

Jan 22, 2026

The Data Contract Movement, Two Years In

James Thornton

Data contracts as a pattern are now mainstream. What's actually stuck in production, what's still aspirational, and what the successful implementations have in common.

2025

Nov 6, 2025

Why Streaming Is Finally Winning

James Thornton

Real-time data pipelines have been "the future" for a decade. In 2025 they're genuinely table-stakes. The infrastructure maturity, tooling cost drops, and developer ergonomics that closed the gap.

Sep 18, 2025

The AI Observability Gap

Yuki Nakashima

Traditional APM wasn't built for probabilistic outputs. ML systems in production need a different observability surface — what it looks like and which teams are building the right abstractions.

Jul 29, 2025

Feature Stores in 2025: From Research Project to Critical Infrastructure

Yuki Nakashima

When Flintrock led Chalk's seed in 2023, feature stores were a specialty concern. Now they're a deployment prerequisite. The market map, and what the mature-pattern looks like.

May 9, 2025

Building a Production-Grade RAG Stack from Open-Source Components

Yuki Nakashima

Every portfolio company we've backed has been absorbed into someone's RAG architecture. Here's what a principled open-source stack looks like, layer by layer, with no vendor lock-in at any point.

Mar 3, 2025

DuckDB and the Single-Node Renaissance

James Thornton

For most analytical workloads, a single well-designed process beats a distributed cluster. Why the pendulum is swinging back, what DuckDB makes tractable, and the class of workloads still needing Spark.

Jan 17, 2025

The MLOps-to-LLMOps Pivot: What Survives the Transition

Yuki Nakashima

Not all MLOps tooling translates cleanly to the large-language-model world. A systematic look at which abstractions carry over, which need redesigning, and which should be retired entirely.

2024

Nov 21, 2024

Embedding the Database: Why In-Process Analytics Matter for AI

James Thornton

The database-as-server model was designed for a different era. In-process engines like DuckDB dissolve the client-server boundary — and that changes how AI applications can interact with data.

Sep 5, 2024

The Vector Database Market Map: Infrastructure Layer or Application Layer?

Yuki Nakashima

Vector databases are being built by infrastructure engineers and application developers simultaneously, with very different design centers. What that means for how the category shakes out.

Jul 11, 2024

The Developer Tools Distribution Playbook, Updated for 2024

James Thornton

The OSS-core to enterprise-contract playbook that Kafka and Airflow ran is still viable — but the timeline, community threshold, and enterprise buyer behavior have all shifted. What's new.

Apr 30, 2024

Data Quality at the Source: Why Moving Validation Upstream Changes Everything

Yuki Nakashima

Post-hoc data quality tooling runs in the warehouse, after the damage is done. The more productive frame is source-side validation — which requires a different infrastructure model entirely.

Feb 14, 2024

Orchestration Is Not Just Scheduling

James Thornton

The dominant mental model of orchestration as "cron at scale" misses what makes modern workflow systems valuable: observability, recoverable failure states, and dynamic task graphs. What changed and why it matters.

2023

Nov 8, 2023

The AI Infrastructure Stack: A Field Guide for Founders

James Thornton

There are now hundreds of AI infrastructure companies. Most address real pain. Few will have durable businesses. A map of the stack, a framework for evaluating positions, and a view on where defensibility compounds.

Aug 24, 2023

Building Context for LLMs: The Retrieval Problem at Scale

Yuki Nakashima

What a language model can do is bounded by what it can see. The retrieval infrastructure that populates the context window — embedding models, vector stores, rerankers, chunking strategies — is not a solved problem.

Jun 2, 2023

Production ML Is Still Hard, and That's an Infrastructure Opportunity

Yuki Nakashima

The gap between ML research and ML production is not closing fast enough. Feature skew, model drift, serving latency, dependency hell — each is a product category. The companies addressing the hardest ones.

Mar 28, 2023

Why We Backed Qdrant

Yuki Nakashima

Our investment thesis for Qdrant: the architecture decisions that separate it from other vector databases, the community trajectory that convinced us, and the question we spent the most time on during diligence.

2022

Oct 17, 2022

Seed-Stage Data Infrastructure Investing: Why Earlier Is Better

James Thornton

The window to back the next Kafka, Spark, or Airflow equivalent opens before the market recognizes it. What we look for in Seed-stage data infrastructure companies, and why we write checks when others wait.

Jul 6, 2022

The Case for Open-Source Data Infrastructure

James Thornton

The most durable data infrastructure companies share a pattern: open-source core with enterprise expansion path. Why OSS-first distribution outperforms proprietary distribution at the infrastructure layer, and when it doesn't.

Feb 11, 2022

What the AI Application Layer Needs Before It Can Scale

James Thornton

AI applications are proliferating. But the infrastructure layer they run on — the databases, orchestration systems, feature stores, model-serving frameworks — is still immature. The missing pieces, and why now is the right time to build them.