Tracking entities in technical procedures -- a new dataset and baselines

2021-04-15 11:16:41

Saransh Goyal, Pratyush Pandey, Garima Gaur, Subhalingam D, Srikanta Bedathur, Maya Ramanath

arXiv_AI

arXiv_AI Tracking

Abstract
Abstract (translated)
URL
PDF

Abstract

We introduce TechTrack, a new dataset for tracking entities in technical procedures. The dataset, prepared by annotating open domain articles from WikiHow, consists of 1351 procedures, e.g., "How to connect a printer", identifies more than 1200 unique entities with an average of 4.7 entities per procedure. We evaluate the performance of state-of-the-art models on the entity-tracking task and find that they are well below the human annotation performance. We describe how TechTrack can be used to take forward the research on understanding procedures from temporal texts.

Abstract (translated)

URL

https://arxiv.org/abs/2104.07378

PDF

https://arxiv.org/pdf/2104.07378.pdf