diff --git a/.zenodo.json b/.zenodo.json index dc67e32..17e793a 100644 --- a/.zenodo.json +++ b/.zenodo.json @@ -1,6 +1,6 @@ { "license": "CC-BY-NC-SA-4.0", - "description": "

This corpus of annotated MuseScore files has been created within the DCML corpus initiative and employs the DCML harmony annotation standard. It is one out of nine similar corpora that have been grouped together to An Annotated Corpus of Tonal Piano Music from the Long 19th Century which comes with a data report that is currently under review.

\n\n

Version 1 has been released for submitting it as part of the data report Hentschel, J., Rammos, Y., Neuwirth, M., Rohrmeier, M. (forthcoming). An Annotated Corpus of Tonal Piano Music from the Long 19th Century that accompanies nine corpora grouped under the DOI 10.5281/zenodo.7483349.

\n\n

Version 1.1 comes with a complete set of metadata and score headers. Among more accurate composition dates, the metadata now include URIs that identify the compositions in terms of the Virtual International Authority File (VIAF), Wikidata, IMSLP and MusicBrainz. The data has been re-extracted from the scores using ms3 1.1.1 . Syntax errors in the phrase brackets have been corrected.

", + "description": "

This corpus of annotated MuseScore files has been created within the DCML corpus initiative and employs the DCML harmony annotation standard. It is one out of nine similar corpora that have been grouped together to An Annotated Corpus of Tonal Piano Music from the Long 19th Century which comes with a data report that is currently under review.

\n\n

The dataset lives on GitHub (link under "Related identifiers") and is stored on Zenodo purely for conservation and automatic DOI generation for new GitHub releases. For technical reasons, we include only brief, generic instructions on how to use the data. For more detailed documentation, please refer to the dataset's GitHub page.

\n\n

What is included

\n\n

The dataset includes annotated MusicScores .mscx files that have been created with MuseScore 3.6.2 and can be opened with any MuseScore 3, or later version. Apart from that, the score information (measures, notes, harmony labels) have been extracted in the form of TSV files which can be found respectively in the folders measures, notes, and harmonies. They have been extracted with the Python library ms3 and its documentation has a column glossary for looking up the meaning of a column.

\n\n

Getting the data

\n\n

You can download the dataset as a ZIP file from Zenodo or GitHub. Please note that these automatically generated ZIP files do not include submodules, which would appear as empty folders. If you need ZIP files, you will need to find the submodule repositories (e.g. via GitHub) and download them individually.

\n\n

Apart from that, there is the possibility to git-clone the GitHub repository to your disk. This has the advantage that it allows to version-control any changes you want to make to the dataset and to ask for your changes to be included ("merged") in a future version.

", "contributors": [ { "orcid": "0000-0002-6329-7492", @@ -22,11 +22,14 @@ ], "grants": [ { - "id": "10.13039/501100001711::105216_182811" + "identifier": "dcml" + }, + { + "identifier": "epfl" } ], "upload_type": "dataset", - "version": "v1.1", + "version": "v2.0", "communities": [ { "identifier": "dcml" @@ -59,8 +62,18 @@ "related_identifiers": [ { "scheme": "url", - "identifier": "https://github.com/DCMLab/tchaikovsky_seasons/tree/v1.1", - "relation": "isSupplementTo" + "identifier": "https://github.com/DCMLab/tchaikovsky_seasons/tree/v2.0", + "relation": "references" + }, + { + "scheme": "url", + "identifier": "https://dcmlab.github.io/tchaikovsky_seasons/", + "relation": "isDocumentedBy" + }, + { + "scheme": "doi", + "identifier": "10.5281/zenodo.7483349", + "relation": "isPartOf" } ] } diff --git a/README.md b/README.md index e0a45cf..a87eb41 100644 --- a/README.md +++ b/README.md @@ -3,8 +3,18 @@ ![GitHub repo size](https://img.shields.io/github/repo-size/DCMLab/tchaikovsky_seasons) ![License](https://img.shields.io/badge/license-CC%20BY--NC--SA%204.0-9cf) +This is a README file for a data repository originating from the [DCML corpus initiative](https://github.com/DCMLab/dcml_corpora) +and serves as welcome page for both + +* the GitHub repo [https://github.com/DCMLab/tchaikovsky_seasons](https://github.com/DCMLab/tchaikovsky_seasons) and the corresponding +* documentation page [https://dcmlab.github.io/tchaikovsky_seasons](https://dcmlab.github.io/tchaikovsky_seasons) + +For information on how to obtain and use the dataset, please refer to [this documentation page](https://dcmlab.github.io/tchaikovsky_seasons/introduction). + + -* [Pyotr Tchaikovsky - The Seasons (A corpus of annotated scores)](#pyotr-tchaikovsky---the-seasons--a-corpus-of-annotated-scores-) +* [Pyotr Tchaikovsky - The Seasons (A corpus of annotated scores)](#pyotr-tchaikovsky---the-seasons-a-corpus-of-annotated-scores) + * [Version history](#version-history) * [Getting the data](#getting-the-data) * [With full version history](#with-full-version-history) * [Without full version history](#without-full-version-history) @@ -21,7 +31,7 @@ * [Questions, Suggestions, Corrections, Bug Reports](#questions-suggestions-corrections-bug-reports) * [License](#license) * [Naming convention](#naming-convention) -* [Overview](#overview) + * [Overview](#overview) # Pyotr Tchaikovsky - The Seasons (A corpus of annotated scores) @@ -30,18 +40,12 @@ This corpus of annotated [MuseScore](https://musescore.org) files has been creat the [DCML corpus initiative](https://github.com/DCMLab/dcml_corpora) and employs the [DCML harmony annotation standard](https://github.com/DCMLab/standards). It is one out of nine similar corpora that have been grouped together -to [An Annotated Corpus of Tonal Piano Music from the Long 19th Century](https://github.com/DCMLab/romantic_piano_corpus) -which comes with a data report that is currently under review. - -**Version 1** has been released for submitting it as part of the data -report `Hentschel, J., Rammos, Y., Neuwirth, M., Rohrmeier, M. (forthcoming). An Annotated Corpus of Tonal Piano Music from the Long 19th Century` -that accompanies nine corpora grouped under the DOI [10.5281/zenodo.7483349](https://doi.org/10.5281/zenodo.7483349). - -**Version 1.1** comes with a complete set of metadata and score headers. Among more accurate composition dates, the -metadata now include URIs that identify the compositions in terms of -the [Virtual International Authority File (VIAF)](https://viaf.org/), [Wikidata](https://www.wikidata.org), [IMSLP](https://imslp.org/) -and [MusicBrainz](https://musicbrainz.org/). The data has been re-extracted from the scores -using [ms3 1.1.1](https://pypi.org/project/ms3/). Syntax errors in the phrase brackets have been corrected. +to [An Annotated Corpus of Tonal Piano Music from the Long 19th Century](https://doi.org/10.5281/zenodo.7483349) +which comes with a data report that is currently in press at Empirical Musicology Review. + +## Version history + +See the [GitHub releases](https://github.com/DCMLab/tchaikovsky_seasons/releases). ## Getting the data