ยป Dezrann corpus/developer documentation

Adding and maintaining a new corpus on Dezrann

A corpus on Dezrann should contain score(s), set(s) of analyses/annotations, synchronized audio(s) (or at least two of the three), and appropriate metadata. As far as possible, these files should be available under open-data licenses, as described on Open Science and Licenses. We try to add corpus in a reproducible way, as for the public corpora available on the platform. Data you use or you create (scores, annotations, audios, metadata) have to be available within a git or through any stable URL.

As a corpus curator/maintainer, your responsability is mostly to prepare, update, and maintain a corpus description file such as metadata/my-corpus.json giving all the information and pointing to some sources. This involves the following steps.

Tutorial: follow the ๐Ÿ๏ธ points

Step 1. Data/metadata preparation

(Sources data)

(Metadata)

Step 2. First build of the corpus

When the metadata/my-corpus.json file is ready:

curl -sS --request POST --url https://test-ws.dezrann.net/corpus --header 'Content-Type: multipart/form-data' --form metadata=@my-corpus.json

Step 3. Check/curation

๐Ÿ๏ธ Once the corpus is on Dezrann, in the sandbox (as for exemple on https://test.dezrann.net/~/salperwick-sandbox/piece-yourname)

Step 4. Publication and long-term maintainance

(Communication, maintenance)