Nearest-neighbor automatic sound classification with a wordNet taxonomy

dc.contributor.authorCano Vila, Pedro
dc.contributor.authorKoppenberger, Markus
dc.contributor.authorLe Groux, Sylvain
dc.contributor.authorRicard, Julien
dc.contributor.authorWack, Nicolas
dc.contributor.authorHerrera Boyer, Perfecto, 1964-
dc.date.accessioned2019-06-27T16:39:00Z
dc.date.available2019-06-27T16:39:00Z
dc.date.issued2005
dc.description.abstractSound engineers need to access vast collections of sound efects for their film and video productions. Sound efects providers rely on text-retrieval techniques to offer their collections. Currently, annotation of audio content is done manually, which is an arduous task. Automatic annotation methods, normally fine-tuned to reduced domains such as musical instruments or reduced sound effects taxonomies, are not mature enough for labeling with great detail any possible sound. A general sound recognition tool would require first, a taxonomy that represents the world and, second, thousands of classifiers, each specialized in distinguishing little details. We report experimental results on a general sound annotator. To tackle the taxonomy definition problem we use WordNet, a semantic network that organizes real world knowledge. In order to overcome the need of a huge number of classifiers to distinguish many different sound classes, we use a nearest-neighbor classifier with a database of isolated sounds unambiguously linked to WordNet concepts. A 30% concept prediction is achieved on a database of over 50.000 sounds and over 1600 concepts.
dc.format.mimetypeapplication/pdf
dc.identifier.citationCano P, Koppenberger M, Le Groux S, Ricard J, Wack N, Herrera P. Nearest-neighbor automatic sound classification with a wordNet taxonomy. J Intell Inf Syst. 2005;24(2-3):99-111.
dc.identifier.issn0925-9902
dc.identifier.urihttp://hdl.handle.net/10230/41883
dc.language.isoeng
dc.publisherSpringer
dc.relation.ispartofJournal of intelligent information systems. 2005;24(2-3):99-111.
dc.rights© Springer The final publication is available at Springer via https://link.springer.com/content/pdf/10.1007%2Fs10844-005-0318-4.pdf
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.subject.keywordAudio identification
dc.subject.keywordWordNet
dc.subject.keywordNearest-neighbor
dc.subject.keywordEveryday sound
dc.subject.keywordKnowledge management
dc.titleNearest-neighbor automatic sound classification with a wordNet taxonomy
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/acceptedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Cano_int_near.pdf
Size:
213.55 KB
Format:
Adobe Portable Document Format