Soundata: reproducible use of audio datasets

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Fuentes, Magdalena
  • dc.contributor.author Plaja-Roglans, Genís
  • dc.contributor.author Cortès Sebastià, Guillem
  • dc.contributor.author Khandelwal, Tanmay
  • dc.contributor.author Miron, Marius
  • dc.contributor.author Serra, Xavier
  • dc.contributor.author Bello, Juan Pablo
  • dc.contributor.author Salomon, Justin
  • dc.date.accessioned 2025-05-27T11:54:50Z
  • dc.date.available 2025-05-27T11:54:50Z
  • dc.date.issued 2024
  • dc.description.abstract Soundata is an open-source Python library for working with audio datasets in a programmatic and standardized way. It removes the need for writing custom loaders and improves reproducibility by providing tools to validate data against a canonical version. It speeds up research pipelines by allowing users to quickly download a dataset, validate that the dataset is complete and correct, and load it into memory in a standardized and reproducible way. It is designed to work with bioacoustics, environmental, urban, and spatial sound datasets; to be easy to use and easy to contribute to; and to increase reproducibility and standardize the usage of sound datasets in a flexible way.
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Fuentes M, Plaja-Roglans G, Cortès-Sebastià G, Khandelwal T, Miron M, Bello JP, et al. Soundata: reproducible use of audio datasets. J Open Source Softw. 2024;9(98):6634. DOI: 10.21105/joss.06634
  • dc.identifier.doi http://dx.doi.org/10.21105/joss.06634
  • dc.identifier.issn 2475-9066
  • dc.identifier.uri http://hdl.handle.net/10230/70528
  • dc.language.iso eng
  • dc.publisher Jostrans (Journal of Specialised Translation)
  • dc.relation.ispartof Journal of Open Source Software. 2024;9(98):6634
  • dc.rights Authors of papers retain copyright and release the work under a Creative Commons Attribution 4.0 International License (CC BY 4.0).
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.rights.uri http://creativecommons.org/licenses/by/4.0/
  • dc.subject.keyword Audio
  • dc.subject.keyword Environmental-sound
  • dc.subject.keyword Bioacoustics
  • dc.subject.keyword Dataset
  • dc.subject.keyword Urban-sound
  • dc.title Soundata: reproducible use of audio datasets
  • dc.type info:eu-repo/semantics/article
  • dc.type.version info:eu-repo/semantics/publishedVersion