[flat index] Flat Search Interface#983
Conversation
Codecov Report❌ Patch coverage is Additional details and impacted files@@ Coverage Diff @@
## main #983 +/- ##
==========================================
- Coverage 89.45% 89.45% -0.01%
==========================================
Files 458 463 +5
Lines 85398 85974 +576
==========================================
+ Hits 76395 76909 +514
- Misses 9003 9065 +62
Flags with carried forward coverage won't be shown. Click here to find out more.
🚀 New features to boost your workflow:
|
There was a problem hiding this comment.
Pull request overview
This PR introduces an RFC plus an initial “flat” (sequential scan) search surface in diskann, analogous to the existing graph/random-access search pipeline built around DataProvider/Accessor.
Changes:
- Added an RFC describing the flat iterator/strategy/index abstraction and trade-offs.
- Added a new
diskann::flatmodule withFlatIterator,FlatSearchStrategy,FlatIndex::knn_search, andFlatPostProcess(+CopyFlatIds). - Exported the new
flatmodule from the crate root.
Reviewed changes
Copilot reviewed 7 out of 7 changed files in this pull request and generated 9 comments.
Show a summary per file
| File | Description |
|---|---|
| rfcs/00983-flat-search.md | RFC describing the design for sequential (“flat”) index search APIs. |
| diskann/src/lib.rs | Exposes the new flat module publicly. |
| diskann/src/flat/mod.rs | New module root + re-exports for the flat search surface. |
| diskann/src/flat/iterator.rs | Defines the async lending iterator primitive FlatIterator. |
| diskann/src/flat/strategy.rs | Defines FlatSearchStrategy to create per-query iterators and query computers. |
| diskann/src/flat/index.rs | Implements FlatIndex and the brute-force knn_search scan algorithm. |
| diskann/src/flat/post_process.rs | Defines FlatPostProcess and a basic CopyFlatIds post-processor. |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
hildebrandmw
left a comment
There was a problem hiding this comment.
Thanks Aditya. Left a few general comments with some ideas on how we might improve our code sharing. In general, I'm not a fan of prefixing everything with Flat. We already have the flat module so flat::SearchStrategy reads fine to me as opposed to flat::FlatSearchStrategy, which is a little redundant.
# Conflicts: # diskann/src/graph/glue.rs
This reverts commit 2928404.
# Conflicts: # diskann-providers/src/model/graph/provider/async_/caching/provider.rs
…t/DiskANN into u/adkrishnan/flat-index
hildebrandmw
left a comment
There was a problem hiding this comment.
Thanks Aditya,
Here's my take: The new HasElementRef and DistancesUnordered (the one in provider, not flat) add a hefty boilerplate burden to users of the graph index. Especially in light of something like #1067 which pushes in the other direction and while that PR is still experimental, I'm getting more convinced is the right API for the graph index. This puts us in kind of an awkward place. If we get this in, users consuming a new version will have churn for these little misc traits that will then need to be undone by #1067 (or some spiritual successor) which is not great.
If you want to start programming against this API, what if you introduce your own temporary TemporaryBuildQueryComputer (I believe this is the biggest reason why this PR needs to add HasElementRef and DistancesUnrodered) and we hold off on post processing for a short period of time to bottom out on #1067. If the latter doesn't pan out, then we can continue with these changes and accept it. Otherwise, the flat index implementations (which presumably will be less numerous than the current graph implementations) can be brought inline with the new trait organization and we'll cause less overall churn for our users.
I also remain unconvinced that the Iterator is any simpler than just implementing the new DistancesUnordered trait directly.
Thanks for the feedback and thorough review @hildebrandmw. To make sure I understand, are you suggesting we temporarily create a disjoint trait structure for the flat index (avoiding supporting post-processing) and then once #1067 settles down, we can re-evaluate how much shared surface we can have for these two indexes (both on the building/construction side and the post-processing)? |
Yeah, that's what I'm proposing. Something to minimize code churn, or at least defer it temporarily until we know that we aren't going to introduce these intermediate traits ( |
Remove `flat_search` from `DiskANNIndex` and the `IdIterator` trait from `diskann`. Since the only caller was from `diskann-disk`, add a new `flat_search` inherent method to `DiskIndexSearcher`. The flat search method is not compatible with the experimental direction in #1067 and with #983 on the horizon, this is safe to move for now.
hildebrandmw
left a comment
There was a problem hiding this comment.
Thanks for your patience Aditya!
This PR introduces a trait interface and a light index to support brute-force search for providers that can be used as/are a flat-index. There is an associated RFC that walks through the interface and associated implementation in
diskannas a newflatmodule.Rendered RFC link.
Motivation
The repo has no first-class surface for brute-force search. This PR adds a small trait hierarchy that gives flat search the same provider-agnostic shape that graph search has, so any backend (in-memory, quantized, disk, remote) can plug in once and reuse a shared algorithm.
Traits (
flat/strategy.rs)DistancesUnordered<C>— the single trait a backend must implement. Fuses iteration and scoring into one method: the implementation drives a full scan, scoring each element with a precomputed query computerC, and invokes a callback with(id, distance)pairs. Key associated types:ElementRef<'a>-- the reference shapeCscores against.Id-- the id type yielded to the callback (decoupled fromHasIdso visitors can yield any id shape).C : for<'a> PreprocessedDistanceFunction<Self::ElementRef<'a>, f32>-- the precomputer query computer.SearchStrategy<P, T>— factory that creates aDistancesUnorderedvisitor from a provider + context, and builds the per-query computer. Mirrors the graph-side strategy pattern. Two fallible methods:create_visitor— borrows provider + context, returns aVisitorbuild_query_computer— preprocesses the queryTinto aQueryComputerIndex (
flat/index.rs)FlatIndex<P>— thin'staticwrapper around aDataProvider. Currently we have implemented the naive kNN search algorithm for the flat index.knn_searchasks the strategy for a visitor, builds the query computer, drivesdistances_unorderedthrough a priority queue, and writes results viaSearchPostProcess.Test infrastructure (
flat/test/)A self-contained test provider with dimension-validated
Strategy, transient-error injection, and aKnnOracleRunharness that comparesknn_searchresults against a brute-force reference with baseline caching for regression detection.Future work
knn_searchcurrently uses the graph-sideSearchPostProcesstrait to write results into the output buffer. Simplify theDataProvidercontract for graph search #1067 will introduce a flat-specific post-processing step that decouples flat search from the graph module's output machinery.DistancesUnorderedacts is an associated type, instead of theInternalIdof the provider. This is due to overly restrictive trait bounds onVectorIdtrait. We plan on relaxing this allowing for more generic id types.