CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question Answering

10/30/2020
by   Xiang Yue, et al.
10

Clinical question answering (QA) aims to automatically answer questions from medical professionals based on clinical texts. Studies show that neural QA models trained on one corpus may not generalize well to new clinical texts from a different institute or a different patient group, where large-scale QA pairs are not readily available for retraining. To address this challenge, we propose a simple yet effective framework, CliniQG4QA, which leverages question generation (QG) to synthesize QA pairs on new clinical contexts and boosts QA models without requiring manual annotations. In order to generate diverse types of questions that are essential for training QA models, we further introduce a seq2seq-based question phrase prediction (QPP) module that can be used together with most existing QG models to diversify their generation. Our comprehensive experiment results show that the QA corpus generated by our framework is helpful to improve QA models on the new contexts (up to 8 terms of Exact Match), and that the QPP module plays a crucial role in achieving the gain.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset