KNN benchmark tooling should also report "KNN Searcher RAM" #314

mikemccand · 2024-11-12T12:27:15Z

[Spinoff from https://github.com/apache/lucene/pull/13651/]

With the various cool ways Lucene can now quantize KNN vectors (per-dimension scalar quantization, and the upcoming RabitQ and maybe other cool algos with time...), the "hot RAM" required for efficient searching is much lower than the index size because Lucene always keeps the original (float32 or byte) input vectors so KNN data structures can be recomputed accurately during segment merging.

Let's fix our KNN tooling to separately report "hot RAM" required (subtract the index storage needed for the original vectors).

The text was updated successfully, but these errors were encountered:

mikemccand mentioned this issue Nov 13, 2024

Run knnPerfTest.py in nightly benchmarks #316

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KNN benchmark tooling should also report "KNN Searcher RAM" #314

KNN benchmark tooling should also report "KNN Searcher RAM" #314

mikemccand commented Nov 12, 2024

KNN benchmark tooling should also report "KNN Searcher RAM" #314

KNN benchmark tooling should also report "KNN Searcher RAM" #314

Comments

mikemccand commented Nov 12, 2024