Contextualized Embeddings based Convolutional Neural Networks for Duplicate Question Identification

by   Harsh Sakhrani, et al.

Question Paraphrase Identification (QPI) is a critical task for large-scale Question-Answering forums. The purpose of QPI is to determine whether a given pair of questions are semantically identical or not. Previous approaches for this task have yielded promising results, but have often relied on complex recurrence mechanisms that are expensive and time-consuming in nature. In this paper, we propose a novel architecture combining a Bidirectional Transformer Encoder with Convolutional Neural Networks for the QPI task. We produce the predictions from the proposed architecture using two different inference setups: Siamese and Matched Aggregation. Experimental results demonstrate that our model achieves state-of-the-art performance on the Quora Question Pairs dataset. We empirically prove that the addition of convolution layers to the model architecture improves the results in both inference setups. We also investigate the impact of partial and complete fine-tuning and analyze the trade-off between computational power and accuracy in the process. Based on the obtained results, we conclude that the Matched-Aggregation setup consistently outperforms the Siamese setup. Our work provides insights into what architecture combinations and setups are likely to produce better results for the QPI task.


page 1

page 2

page 3

page 4


Siamese Neural Networks with Random Forest for detecting duplicate question pairs

Determining whether two given questions are semantically similar is a fa...

Combining word embeddings and convolutional neural networks to detect duplicated questions

Detecting semantic similarities between sentences is still a challenge t...

VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering

In this paper, we address the problem of visual question answering by pr...

Learning to Answer by Learning to Ask: Getting the Best of GPT-2 and BERT Worlds

Automatic question generation aims at the generation of questions from a...

Knowledge Graph Question Answering using Graph-Pattern Isomorphism

Knowledge Graph Question Answering (KGQA) systems are based on machine l...

Color Cerberus

Simple convolutional neural network was able to win ISISPA color constan...

Duplicate Question Retrieval and Confirmation Time Prediction in Software Communities

Community Question Answering (CQA) in different domains is growing at a ...

Please sign up or login with your details

Forgot password? Click here to reset