Adapting the Exploration Graph for High Throughput in Low Recall Regimes
TL;DR
Abstract
Nearest neighbor search is vital for modern search systems, particularly in high-dimensional spaces. This paper addresses the 2024 SISAP Indexing Challenge, which involves searching a dataset of 100 million 768-dimensional feature vectors under low recall and memory constraints. We explore the trade-offs between navigation and exploration graphs, where the latter is typically better suited for high recall scenarios. To tackle the challenge, we combine the state-of-the-art continuous refining Exploration Graph (crEG) with feature compression techniques. Although compression reduces overall recall accuracy, it significantly improves search speed. Given the challenge's focus on low recall, this trade-off enables crEG to perform efficiently, making it competitive even against navigation graphs traditionally favored in such settings.
BibTeX
If you use our work in your research, please cite our publication:
@inproceedings{10.1007/978-3-031-75823-2_24,
author = {Hezel, Nico and Schilling, Bruno and Barthel, Kai Uwe and Schall, Konstantin and Jung, Klaus},
editor="Ch{\'a}vez, Edgar and Kimia, Benjamin and Loko{\v{c}}, Jakub and Patella, Marco and Sedmidubsky, Jan",
title="Adapting the Exploration Graph for High Throughput in Low Recall Regimes",
booktitle="Similarity Search and Applications",
year="2025",
publisher="Springer Nature Switzerland",
address="Cham",
pages="283--290",
isbn="978-3-031-75823-2"
}