Learning contextual tag embeddings for cross-modal alignment of audio and tags
Loading...
Date
Document Type
Document Version
Author
Citation
Favory X, Drossos K, Virtanen T, Serra X. Learning contextual tag embeddings for cross-modal alignment of audio and tags. In: 2021 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP): proceedings; 2021 Jun 6-11; Toronto, Canada. [Piscataway]: IEEE; 2021. p. 596-600. DOI: 10.1109/ICASSP39728.2021.9414638






