Abstract
In this work, we propose a method for extracting text spans that may indicate one of the BIG5 psychological traits using a question-answering task with examples that have no answer for the asked question. We utilized the RoBERTa model fine-tuned on SQuAD 2.0 dataset. The model was further fine-tuned utilizing comments from Reddit. We examined the effect of the percentage of examples with no answer in the training dataset on the overall performance. The results obtained in this study are in line with the SQuAD 2.0 benchmark and present a good baseline for further research.
Abstract (translated)
在本研究中,我们提出了一种方法,通过一个问答任务,使用例子中没有回答的问题来提取可能表示 Big5 心理特质的文本片段。我们使用了优化了在 SQuAD 2.0 数据集上的罗BERTa 模型。模型还使用Reddit的评论进行了进一步优化。我们研究了训练数据中没有回答的例子百分比对整体表现的影响。这项研究的结果与 SQuAD 2.0 基准一致,提供了一个良好的基准,用于进一步研究。
URL
https://arxiv.org/abs/2305.14891