Towards Better Web Search Performance: Pre-training, Fine-tuning and Learning to Rank

02/28/2023
by   Haitao Li, et al.
0

This paper describes the approach of the THUIR team at the WSDM Cup 2023 Pre-training for Web Search task. This task requires the participant to rank the relevant documents for each query. We propose a new data pre-processing method and conduct pre-training and fine-tuning with the processed data. Moreover, we extract statistical, axiomatic, and semantic features to enhance the ranking performance. After the feature extraction, diverse learning-to-rank models are employed to merge those features. The experimental results show the superiority of our proposal. We finally achieve second place in this competition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2023

Ensemble Ranking Model with Multiple Pretraining Strategies for Web Search

An effective ranking model usually requires a large amount of training d...
research
01/19/2015

Statistical-mechanical analysis of pre-training and fine tuning in deep learning

In this paper, we present a statistical-mechanical analysis of deep lear...
research
11/30/2015

Cost-aware Pre-training for Multiclass Cost-sensitive Deep Learning

Deep learning has been one of the most prominent machine learning techni...
research
11/15/2018

Boosting Search Performance Using Query Variations

Rank fusion is a powerful technique that allows multiple sources of info...
research
08/03/2023

Curricular Transfer Learning for Sentence Encoded Tasks

Fine-tuning language models in a downstream task is the standard approac...
research
04/22/2023

Towards Understanding Feature Learning in Out-of-Distribution Generalization

A common explanation for the failure of out-of-distribution (OOD) genera...
research
01/31/2023

ZhichunRoad at Amazon KDD Cup 2022: MultiTask Pre-Training for E-Commerce Product Search

In this paper, we propose a robust multilingual model to improve the qua...

Please sign up or login with your details

Forgot password? Click here to reset