Graph based Nearest Neighbor Search: Promises and Failures
Recently, graph based nearest neighbor search gets more and more popular on large-scale retrieval tasks. The attractiveness of this type of approaches lies in its superior performance over most of the known nearest neighbor search approaches as well as its genericness to various distance measures. In this paper, the role of two strategies, namely hierarchical structure and graph diversification that are adopted as the key steps in graph based approaches, is investigated. We find that the hierarchical structure could not achieve "a much better logarithmic complexity scaling" as it was claimed in the original paper, particularly on high dimensional cases. Moreover, we find that similar high search speed efficiency as the one with hierarchical structure could be achieved with the support of flat k-NN graph after graph diversification. Finally, we point out the difficulty, that is faced by most of the graph based search approaches, is directly linked to "curse of dimensionality".
READ FULL TEXT