Hi authors,
Regarding the Two-Stage k-NN Search (Section 3.1), I am curious if you have compared it with the native (single-stage) kNN-LM?
I would love to know the specific differences in:
-
Effectiveness: Does the 2-stage approximation cause a noticeable drop in perplexity?
-
Efficiency: How significant is the speed/latency gap between them?
Any qualitative analysis or quantitative results would be greatly appreciated. Thanks!