Retrieval of multimedia objects by fusing multiple modalities

Citació

  • Gialampoukidis I, Moumtzidou A, Tsikrika T, Vrochidis S, Kompatsiaris I. Retrieval of multimedia objects by fusing multiple modalities. In: ICMR'16. International Conference on Multimedia Retrieval; 2016 June 6-9; New York (NY, USA). New York: ACM; 2016. p. 359-62. DOI: 10.1145/2911996.2912068

Enllaç permanent

Descripció

  • Resum

    Searching for multimedia objects with heterogeneous modalities is critical for the construction of effective multimedia retrieval systems. Towards this direction, we propose a framework for the multimodal fusion of visual and textual similarities, based on visual features, visual concepts and textual concepts. Our method is compared to the baseline method that only fuses two modalities but integrates all early, late, linearly weighted, diffusion and graph-based models in one unifying framework. Our framework integrates more than two modalities and high-level information, so as to retrieve multimedia objects enriched with high-level textual and visual concepts, in response to a multimodal query. The experimental comparison is done under the same memory complexity, in two multimedia collections in the multimedia retrieval task. The results have shown that we outperform the baseline method, in terms of Mean Average Precision.
  • Descripció

    Comunicació presentada a: ICMR'16. International Conference on Multimedia Retrieval 2016, celebrat a Nova York del 6 al 9 de juny de 2016
  • Mostra el registre complet