Problems and Countermeasures in Natural Language Processing Evaluation

2021-04-20 01:35:16

Qingxiu Dong, Zhifang Sui, Weidong Zhan, Baobao Chang

arXiv_CL

arXiv_CL Pose

Abstract
Abstract (translated)
URL
PDF

Abstract

Evaluation in natural language processing guides and promotes research on models and methods. In recent years, new evalua-tion data sets and evaluation tasks have been continuously proposed. At the same time, a series of problems exposed by ex-isting evaluation have also restricted the progress of natural language processing technology. Starting from the concept, com-position, development and meaning of natural language evaluation, this article classifies and summarizes the tasks and char-acteristics of mainstream natural language evaluation, and then summarizes the problems and causes of natural language pro-cessing evaluation. Finally, this article refers to the human language ability evaluation standard, puts forward the concept of human-like machine language ability evaluation, and proposes a series of basic principles and implementation ideas for hu-man-like machine language ability evaluation from the three aspects of reliability, difficulty and validity.

Abstract (translated)

URL

https://arxiv.org/abs/2104.09712

PDF

https://arxiv.org/pdf/2104.09712.pdf