Adversarial Sampling and Training for Semi-Supervised Information Retrieval

11/09/2018
by   Dae Hoon Park, et al.
0

Modern ad-hoc retrieval models learned with implicit feedback have two problems in general. First, there are usually much more non-clicked documents than clicked documents, and many of the non-clicked documents are not informational. Second, modern ad-hoc retrieval models are vulnerable to adversarial examples due to the linear nature in the models. To solve the problems at the same time, we propose adversarial training methods that can overcome those weaknesses. Our key idea is to combine adversarial training with adversarial sampling in order to obtain very difficult examples, which are informational and can attack the linear nature of the models. Specifically, we adversarially sample difficult training examples, and based on them, we further generate adversarial examples that are even more difficult. To make the models robust, the generated adversarial examples as well as the original training examples are then given to the models for joint optimization. Experiments are performed on benchmark data sets for common ad-hoc retrieval tasks such as Web search, item recommendation, and question answering. The proposed methods are closely compared with IRGAN, which is a recent relevant approach that employs adversarial training. Experiment results indicate that the proposed methods significantly outperform strong baselines especially for high-ranked documents, and they outperform IRGAN in NDCG@5 using only 5 search task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/21/2018

A Line in the Sand: Recommendation or Ad-hoc Retrieval?

The popular approaches to recommendation and ad-hoc retrieval tasks are ...
research
07/19/2019

Learning More From Less: Towards Strengthening Weak Supervision for Ad-Hoc Retrieval

The limited availability of ground truth relevance labels has been a maj...
research
09/05/2019

Adversarial Examples with Difficult Common Words for Paraphrase Identification

Despite the success of deep models for paraphrase identification on benc...
research
11/23/2017

A Deep Relevance Matching Model for Ad-hoc Retrieval

In recent years, deep neural networks have led to exciting breakthroughs...
research
09/14/2022

Pre-training for Information Retrieval: Are Hyperlinks Fully Explored?

Recent years have witnessed great progress on applying pre-trained langu...
research
06/07/2023

PromptAttack: Probing Dialogue State Trackers with Adversarial Prompts

A key component of modern conversational systems is the Dialogue State T...
research
09/15/2017

Certified Non-Confluence with ConCon 1.5

We present three methods to check CTRSs for non-confluence: (1) an ad ho...

Please sign up or login with your details

Forgot password? Click here to reset