Multimedia retrieval based on non-linear graph-based fusion and partial least squares regression
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Gialampoukidis, Iliasca
- dc.contributor.author Moumtzidou, Anastasiaca
- dc.contributor.author Liparas, Dimitrisca
- dc.contributor.author Tsikrika, Theodoraca
- dc.contributor.author Vrochidis, Stefanosca
- dc.contributor.author Kompatsiaris, Ioannisca
- dc.date.accessioned 2017-09-08T15:15:30Z
- dc.date.issued 2017
- dc.description.abstract Heterogeneous sources of information, such as images, videos, text and metadata are often used to describe di erent or complementary views of the same multimedia object, especially in the online news domain and in large annotated image collections. The retrieval of multimedia objects, given a mul- timodal query, requires the combination of several sources of information in an e cient and scalable way. Towards this direction, we provide a novel unsuper- vised framework for multimodal fusion of visual and textual similarities, which are based on visual features, visual concepts and textual metadata, integrating non-linear graph-based fusion and Partial Least Squares Regression. The fu- sion strategy is based on the construction of a multimodal contextual similarity matrix and the non-linear combination of relevance scores from query-based similarity vectors. Our framework can employ more than two modalities and high-level information, without increase in memory complexity, when com- pared to state-of-the-art baseline methods. The experimental comparison is done in three public multimedia collections in the multimedia retrieval task. The results have shown that the proposed method outperforms the baseline methods, in terms of Mean Average Precision and Precision@20.en
- dc.description.sponsorship This work was partially supported by the European Commission by the projects MULTISENSOR (FP7-610411) and KRISTINA (H2020-645012).en
- dc.format.mimetype application/pdf
- dc.identifier.citation Gialampoukidis I, Moumtzidou A, Liparas D, Tsikrika T, Vrochidis S, Kompatsiaris I. Multimedia retrieval based on non-linear graph-based fusion and partial least squares regression. Multimed Tools Appl. 2017;76(21):22383-403. DOI: 10.1007/s11042-017-4797-4
- dc.identifier.doi http://dx.doi.org/10.1007/s11042-017-4797-4
- dc.identifier.issn 1380-7501
- dc.identifier.uri http://hdl.handle.net/10230/32760
- dc.language.iso eng
- dc.publisher Springerca
- dc.relation.ispartof Multimed Tools Appl. 2017;76(21):22383-403
- dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/645012
- dc.relation.projectID info:eu-repo/grantAgreement/EC/FP7/610411
- dc.rights © Springer The final publication is available at Springer via https://link.springer.com/article/10.1007/s11042-017-4797-4
- dc.rights.accessRights info:eu-repo/semantics/openAccess
- dc.subject.keyword Multimedia retrievalen
- dc.subject.keyword Non-linear fusionen
- dc.subject.keyword Graph-based modelsen
- dc.title Multimedia retrieval based on non-linear graph-based fusion and partial least squares regressionca
- dc.type info:eu-repo/semantics/article
- dc.type.version info:eu-repo/semantics/acceptedVersion