Retrieval of multimedia objects by fusing multiple modalities

Citació

Gialampoukidis I, Moumtzidou A, Tsikrika T, Vrochidis S, Kompatsiaris I. Retrieval of multimedia objects by fusing multiple modalities. In: ICMR'16. International Conference on Multimedia Retrieval; 2016 June 6-9; New York (NY, USA). New York: ACM; 2016. p. 359-62. DOI: 10.1145/2911996.2912068

Enllaç permanent

Descripció

Resum
Searching for multimedia objects with heterogeneous modalities is critical for the construction of effective multimedia retrieval systems. Towards this direction, we propose a framework for the multimodal fusion of visual and textual similarities, based on visual features, visual concepts and textual concepts. Our method is compared to the baseline method that only fuses two modalities but integrates all early, late, linearly weighted, diffusion and graph-based models in one unifying framework. Our framework integrates more than two modalities and high-level information, so as to retrieve multimedia objects enriched with high-level textual and visual concepts, in response to a multimodal query. The experimental comparison is done under the same memory complexity, in two multimedia collections in the multimedia retrieval task. The results have shown that we outperform the baseline method, in terms of Mean Average Precision.
Descripció
Comunicació presentada a: ICMR'16. International Conference on Multimedia Retrieval 2016, celebrat a Nova York del 6 al 9 de juny de 2016
DOI
http://dx.doi.org/10.1145/2911996.2912068
Col·leccions
Congressos (Departament de Tecnologies de la Informació i les Comunicacions)
Documents OpenAIRE (Open Access Infrastructure for Research in Europe)

Fitxers