Indian art music tonic datasets
Indian art music tonic datasets
Citació
- CompMusic. Indian art music tonic datasets [dataset]. Repositori Digital de la UPF: Barcelona; 2014. Disponible a: https://doi.org/10.34810/data458
Enllaç permanent
Descripció
Dades relacionades
Resum
This dataset comprises 597 commercially available audio music recordings of Indian art music (Hindustani and Carnatic music), each manually annotated with the tonic of the lead artist. This dataset is used as the test corpus for the development of tonic identification approaches.Descripció
These datasets comprise audio excerpts and manually done annotations of the tonic pitch of the lead artist for each audio excerpt. Each excerpt is accompanied by its associated editorial metadata. These datasets can be used to develop and evaluate computational approaches for automatic tonic identification in Indian art music. These datasets have been used in several articles mentioned below. A majority of these datasets come from the CompMusic corpora of Indian art music, for which each recording is associated with a MBID. With the MBID other information can be obtained using the Dunya API. We here provide an overview of the tonic identification datasets. /nDatasets -------/nThe statistics about the datasets for tonic identification is listed in the table below. These six datasets are used in Gulati, S., Bellur, A., Salamon, J., Ranjani, H. G., Ishwar, V., Murthy, H. A., & Serra, X. (2014). Automatic Tonic Identification in Indian Art Music: Approaches and Evaluation. Journal of New Music Research, 43(01), 55–73 for a comparative evaluation. To the best of our knowledge these are the largest datasets available for tonic identification for Indian art music. These datases vary in terms of the audio quality, recording period (decade), the number of recordings for Carnatic, Hindustani, male and female singers and instrumental and vocal excerpts. For a detailed information about these datasets we refer to Chapter 3 of this thesis (http://hdl.handle.net/10803/398984)./n/nThe audio files corresponding to these datsets are made available on request for only research purposes. To obtain the files fill the FORM (https://goo.gl/forms/kWzpCsZW8DM7noW63)./n/n---CompMusic Tonic Identification Datasets ---/n/nDatasets: CM1, CM2, CM3/n/nFeatures: pitch + multipitch histogram + pitch histograms/n/n /n/n---IITM Tonic Identification Datasets ---/n/nDatasets: IITM1, IITM2/n/nFeatures: pitch + multipitch histogram + pitch histograms/n/n /n/n--- IISc Tonic identification Dataset ---/n/nDataset: IISc/n/nFeatures: pitch + multipitch histogram + pitch histograms/n/n /n/nAnnotation Format ---/n/nThe tonic annotations are availabe both in tsv and json format. /n/nTSV: <relative path to audio><tab><tonic(Hz)><tab><Carnatic or Hindustani><tab><artist_name><tab><gender of the singer><vocal or instrumental> /n/nJSON: {/n 'artist': <name of the lead artist if available>, /n/n 'filepath': <relative path to the audio file>,/n/n 'gender': <gender of the lead singer if available>,/n/n 'mbid': <musicbrainz id when available>,/n/n 'tonic': <tonic in Hz>,/n/n 'tradition': <Hindustani or Carnatic>,/n/n 'type': <vocal or instrumental>/n }/n/n/nwhere keys of the main dictionary are the filepaths to the audio files (feature path is exactly the same with a different extension of the file name).