论文标题

希腊传统和民间音乐的数据集:lyra

A Dataset for Greek Traditional and Folk Music: Lyra

论文作者

Papaioannou, Charilaos, Valiantzas, Ioannis, Giannakopoulos, Theodoros, Kaliakatsos-Papakostas, Maximos, Potamianos, Alexandros

论文摘要

在mir范围内研究代表性不足的音乐传统不仅至关重要,不仅对于开发新颖的分析工具,而且对于揭示可能对研究世界音乐有用的音乐功能。本文介绍了希腊传统和民间音乐的数据集,其中包括1570件,总结了大约80个小时的数据。该数据集包含YouTube时间戳的链接,用于检索音频和视频,以及有关仪器,地理和类型等丰富的元数据信息。该内容是从在线获得的希腊纪录片系列中收集的,学者们在演出期间以现场音乐和舞蹈表演展示了希腊的音乐传统,并讨论了有关音乐的社交,文化和音乐学方面的讨论。因此,此过程导致了有关各个方面的大量描述,例如音乐类型,原产地和乐器。此外,在录制设备方面,在严格的生产级规格下进行了录音,从而导致非常干净和均匀的音频内容。在这项工作中,除了详细介绍数据集外,我们还提出了一种基线深度学习分类方法,以识别所涉及的音乐属性。数据集,基线分类方法和模型在公共存储库中提供。还讨论了进一步完善数据集的未来指示。

Studying under-represented music traditions under the MIR scope is crucial, not only for developing novel analysis tools, but also for unveiling musical functions that might prove useful in studying world musics. This paper presents a dataset for Greek Traditional and Folk music that includes 1570 pieces, summing in around 80 hours of data. The dataset incorporates YouTube timestamped links for retrieving audio and video, along with rich metadata information with regards to instrumentation, geography and genre, among others. The content has been collected from a Greek documentary series that is available online, where academics present music traditions of Greece with live music and dance performance during the show, along with discussions about social, cultural and musicological aspects of the presented music. Therefore, this procedure has resulted in a significant wealth of descriptions regarding a variety of aspects, such as musical genre, places of origin and musical instruments. In addition, the audio recordings were performed under strict production-level specifications, in terms of recording equipment, leading to very clean and homogeneous audio content. In this work, apart from presenting the dataset in detail, we propose a baseline deep-learning classification approach to recognize the involved musicological attributes. The dataset, the baseline classification methods and the models are provided in public repositories. Future directions for further refining the dataset are also discussed.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源