Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization

2022-12-20 16:25:42

Lining Zhang, João Sedoc, Simon Mille, Yufang Hou, Sebastian Gehrmann, Daniel Deutsch, Elizabeth Clark, Yixin Liu, Miruna Clinciu, Saad Mahamood, Khyathi Chandu

arXiv_CL

arXiv_CL Summarization

Abstract
Abstract (translated)
URL
PDF

Abstract

The acquisition of high-quality human annotations through crowdsourcing platforms like Amazon Mechanical Turk (MTurk) is more challenging than expected. The annotation quality might be affected by various aspects like annotation instructions, Human Intelligence Task (HIT) design, and wages paid to annotators, etc. To avoid potentially low-quality annotations which could mislead the evaluation of automatic summarization system outputs, we investigate the recruitment of high-quality MTurk workers via a three-step qualification pipeline. We show that we can successfully filter out bad workers before they carry out the evaluations and obtain high-quality annotations while optimizing the use of resources. This paper can serve as basis for the recruitment of qualified annotators in other challenging annotation tasks.

Abstract (translated)

URL

https://arxiv.org/abs/2212.10397

PDF

https://arxiv.org/pdf/2212.10397.pdf