The dual-encoder has become the de facto architecture for dense retrieva...
Knowledge distillation is often used to transfer knowledge from a strong...
Sampling proper negatives from a large document pool is vital to effecti...
Knowledge distillation is an effective way to transfer knowledge from a
...
Traditional information retrieval (IR) ranking models process the full t...