Span-based Joint Entity and Relation Extraction with Transformer Pre-training

Abstract
Abstract (translated)
URL
PDF

Abstract

We introduce SpERT, an attention model for span-based joint entity and relation extraction. Our approach employs the pre-trained Transformer network BERT as its core. We use BERT embeddings as shared inputs for a light-weight reasoning, which features entity recognition and filtering, as well as relation classification with a localized, marker-free context representation. The model is trained on strong within-sentence negative samples, which are efficiently extracted in a single BERT pass. These aspects facilitate a search over all spans in the sentence. In ablation studies, we demonstrate the benefits of pre-training, strong negative sampling and localized context. Our model outperforms prior work by up to 5% F1 score on several datasets for joint entity and relation extraction.

Abstract (translated)

URL

https://arxiv.org/abs/1909.07755

PDF

https://arxiv.org/pdf/1909.07755.pdf