Retrieval of multimedia objects by fusing multiple modalities
Retrieval of multimedia objects by fusing multiple modalities
Citació
- Gialampoukidis I, Moumtzidou A, Tsikrika T, Vrochidis S, Kompatsiaris I. Retrieval of multimedia objects by fusing multiple modalities. In: ICMR'16. International Conference on Multimedia Retrieval; 2016 June 6-9; New York (NY, USA). New York: ACM; 2016. p. 359-62. DOI: 10.1145/2911996.2912068
Enllaç permanent
Descripció
Resum
Searching for multimedia objects with heterogeneous modalities is critical for the construction of effective multimedia retrieval systems. Towards this direction, we propose a framework for the multimodal fusion of visual and textual similarities, based on visual features, visual concepts and textual concepts. Our method is compared to the baseline method that only fuses two modalities but integrates all early, late, linearly weighted, diffusion and graph-based models in one unifying framework. Our framework integrates more than two modalities and high-level information, so as to retrieve multimedia objects enriched with high-level textual and visual concepts, in response to a multimodal query. The experimental comparison is done under the same memory complexity, in two multimedia collections in the multimedia retrieval task. The results have shown that we outperform the baseline method, in terms of Mean Average Precision.Descripció
Comunicació presentada a: ICMR'16. International Conference on Multimedia Retrieval 2016, celebrat a Nova York del 6 al 9 de juny de 2016