Contextualized Embeddings based Convolutional Neural Networks for Duplicate Question Identification

2021-09-03 14:30:09

Harsh Sakhrani (1), Saloni Parekh (1), Pratik Ratadiya (2) ((1) Pune Institute of Computer Technology, Maharashtra, India, (2) vCreaTek Consulting Services Pvt. Ltd., Maharashtra, India)

arXiv_CL

arXiv_CL CNN Embedding Inference Prediction Transformer Pose

Abstract
Abstract (translated)
URL
PDF

Abstract

Question Paraphrase Identification (QPI) is a critical task for large-scale Question-Answering forums. The purpose of QPI is to determine whether a given pair of questions are semantically identical or not. Previous approaches for this task have yielded promising results, but have often relied on complex recurrence mechanisms that are expensive and time-consuming in nature. In this paper, we propose a novel architecture combining a Bidirectional Transformer Encoder with Convolutional Neural Networks for the QPI task. We produce the predictions from the proposed architecture using two different inference setups: Siamese and Matched Aggregation. Experimental results demonstrate that our model achieves state-of-the-art performance on the Quora Question Pairs dataset. We empirically prove that the addition of convolution layers to the model architecture improves the results in both inference setups. We also investigate the impact of partial and complete fine-tuning and analyze the trade-off between computational power and accuracy in the process. Based on the obtained results, we conclude that the Matched-Aggregation setup consistently outperforms the Siamese setup. Our work provides insights into what architecture combinations and setups are likely to produce better results for the QPI task.

Abstract (translated)

URL

https://arxiv.org/abs/2109.01560

PDF

https://arxiv.org/pdf/2109.01560.pdf