Abstract
This paper introduces ORD-CC32 , an open research dataset derived from the 1932 Cairo Congress of Arab Music recordings, a historically significant collection representing diverse Arab musical traditions. The dataset includes structured metadata, melodic and rhythmic mode tags (maqam and iqa), manually labeled tonic information, and acoustic features extracted using state-of-the-art pitch detection methods. These resources support computational studies of tuning, temperament, and regional variations in Arab music. A case study using pitch histograms demonstrates the potential for data-driven analysis of microtonal differences across regions. By making this dataset openly available, we aim to enable interdisciplinary research in computational ethnomusicology, music information retrieval (MIR), cultural studies, and digital heritage preservation. ORD-CC32 is shared on Zenodo with tools for feature extraction and metadata retrieval.
Abstract (translated)
本文介绍了ORD-CC32,这是一个源自1932年开罗阿拉伯音乐大会录音的开放研究数据集,该会议记录了一组代表多种阿拉伯音乐传统的历史重要收藏。数据集包括结构化的元数据、旋律和节奏模式标签(maqam和iqa)、人工标注的主音信息以及使用最先进的音高检测方法提取的声学特征。这些资源支持对调律、音阶及阿拉伯音乐地区差异进行计算研究。通过一个利用音高直方图的数据驱动分析案例研究,展示了跨地区微音程差异的潜在研究可能。通过开放这一数据集,我们旨在促进跨学科的研究工作,在计算民族音乐学、音乐信息检索(MIR)、文化研究和数字遗产保护等领域发挥作用。ORD-CC32在Zenodo上共享,并附带用于特征提取和元数据检索的工具。
URL
https://arxiv.org/abs/2506.14503