Archive TimeLine Summarization : Conceptual Framework for Timeline Generation over Historical Document Collections

2023-01-31 08:58:47

Nicolas Gutehrlé (CRIT), Antoine Doucet (L3I), Adam Jatowt

arXiv_CL

Abstract
Abstract (translated)
URL
PDF

Abstract

Archive collections are nowadays mostly available through search engines interfaces, which allow a user to retrieve documents by issuing queries. The study of these collections may be, however, impaired by some aspects of search engines, such as the overwhelming number of documents returned or the lack of contextual knowledge provided. New methods that could work independently or in combination with search engines are then required to access these collections. In this position paper, we propose to extend TimeLine Summarization (TLS) methods on archive collections to assist in their studies. We provide an overview of existing TLS methods and we describe a conceptual framework for an Archive TimeLine Summarization (ATLS) system, which aims to generate informative, readable and interpretable timelines.

Abstract (translated)

档案集现在大多通过搜索引擎接口可用,用户可以通过提出查询来检索文档。但这些集的研究可能受到搜索引擎的某些方面的影响,例如返回文档数量过多或提供的背景知识缺乏。因此,需要独立或与搜索引擎结合使用的新方法来访问这些集。在本论文中,我们提议扩展时间线摘要(TLS)方法,以协助研究档案集。我们概述了现有的TLS方法,并描述了一个旨在生成 informative、可读和理解的时间表的档案集时间线摘要(ATLS)系统的概念框架。

URL

https://arxiv.org/abs/2301.13479

PDF

https://arxiv.org/pdf/2301.13479.pdf