Abstract
This thesis is a corpus-based, quantitative, and typological analysis of the functions of Early Slavic participle constructions and their finite competitors ($jegda$-'when'-clauses). The first part leverages detailed linguistic annotation on Early Slavic corpora at the morphosyntactic, dependency, information-structural, and lexical levels to obtain indirect evidence for different potential functions of participle clauses and their main finite competitor and understand the roles of compositionality and default discourse reasoning as explanations for the distribution of participle constructions and $jegda$-clauses in the corpus. The second part uses massively parallel data to analyze typological variation in how languages express the semantic space of English $when$, whose scope encompasses that of Early Slavic participle constructions and $jegda$-clauses. Probabilistic semantic maps are generated and statistical methods (including Kriging, Gaussian Mixture Modelling, precision and recall analysis) are used to induce cross-linguistically salient dimensions from the parallel corpus and to study conceptual variation within the semantic space of the hypothetical concept WHEN.
Abstract (translated)
本论文是对早期斯拉夫语参与词构造及其有限竞争者 ($jegda$-“当”-从句) 功能的定量和类型分析。第一部分利用对早期斯拉夫语语料在形态、依存、信息结构和词汇层次的详细语义注释,间接证明不同参与词从句及其主要有限竞争者以及构成性和默认会话推理在语料库中分布的不同潜在功能,并理解构成性和默认会话推理在语料库中参与词构造和 $jegda$-从句分布的作用。第二部分利用大量并行数据分析语言如何表达英语 $when$ 的语义空间,其范围涵盖了早期斯拉夫语参与词构造和 $jegda$-从句。概率语义图被生成,并使用统计方法(包括Kriging、高斯混合建模、精确度和召回分析)从并行语料中诱导跨语言显着维度,并研究假想概念WHEN语义空间中的概念变异。
URL
https://arxiv.org/abs/2405.01972