DIgital Musicology Corpus Analysis Toolkit
Project description
DiMCAT
DIgital Musicology Corpus Analysis Toolkit
A Python library for processing and analyzing notated music on a very large scale. It is under heavy development and has just seen its v1.0.0 alpha release. The library is developed by the Digital and Cognitive Musicology Lab at the École Polytechnique Fédérale de Lausanne (EPFL) in Switzerland and a white paper has been published as
Hentschel, J., McLeod, A., Rammos, Y., & Rohrmeier, M. (2023). Introducing DiMCAT for processing and analyzing notated music on a very large scale. Proceedings of the 24th International Society for Music Information Retrieval Conference, 516–523. https://ismir2023program.ismir.net/poster_52.html
Installation
DiMCAT is available on PyPI and can be installed via pip:
pip install dimcat
Quickstart
DiMCAT compiles frictionless datapackages. To play around with the alpha release, we recommend downloading the package which corresponds to the DCML corpora that are currently public. The package consists of two files:
dcml_corpora.zip (data)
dcml_corpora.json (metadata)
The data package can be loaded into DiMCAT as follows:
from dimcat import Dataset
D = Dataset.from_package("dcml_corpora.datapackage.json")
Acknowledgements
Development of this software tool was supported by the Swiss National Science Foundation within the project “Distant Listening – The Development of Harmony over Three Centuries (1700–2000)” (Grant no. 182811). This project is being conducted at the Latour Chair in Digital and Cognitive Musicology, generously funded by Mr. Claude Latour.
The software project has been set up using PyScaffold 4.2.1.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.