HMC-reliefF: feature ranking for hierarchical multi-label classification

dc.contributor.authorSlavkov, Ivica
dc.contributor.authorKarcheska, Jana
dc.contributor.authorKocev, Dragi
dc.contributor.authorDžeroski, Sašo
dc.date.accessioned2019-12-03T07:57:45Z
dc.date.available2019-12-03T07:57:45Z
dc.date.issued2018
dc.description.abstractIn machine learning, the growing complexity of the available data poses an increased challenge for its analysis. The rising complexity is both in terms of the data becoming more high-dimensional as well as the data having a more intricate structure. This emphasizes the need for developing machine learning algorithms that are able to tackle both the high-dimensionality and the complex structure of the data. Our work in this paper focuses on the development and analysis of the HMCReliefF algorithm, which is a feature relevance (ranking) algorithm for the task of Hierarchical Multi-label Classification (HMC). The basis of the algorithm is the RReliefF algorithm for regression that is adapted for hierarchical multi-label target variables. We perform an extensive experimental investigation of the HMC-ReliefF algorithm on several datasets from the domains of image annotation and functional genomics. We analyse the algorithm’s performance in terms of accuracy in a filterlike setting and also in terms of ranking stability for various parameter values. The results show that the HMC-ReliefF can successfully detect relevant features from the data that can be further used for constructing accurate predictive models. Additionally, the stability analysis helps to determine the preferred parameter values for obtaining not just accurate, but also a stable algorithm output.
dc.description.sponsorshipWe would like to acknowledge the support of the European Commission through the project MAESTRA – Learning from Massive, Incompletely annotated, and Structured Data (Grant number ICT-2013-612944) and the Human Brain Project (Grant number 604102), and the support of the Spanish Ministry of Economy and Competitiveness, “Centro de Excelencia Severo Ochoa 2013-2017”, SEV-2012-0208.
dc.format.mimetypeapplication/pdf
dc.identifier.citationSlavkov I, Karcheska J, Kocev D, Džeroski S. HMC-reliefF: feature ranking for hierarchical multi-label classification. Computer Science and Information Systems. 2018;15(1):187-209. DOI: 10.2298/CSIS170115043S
dc.identifier.doihttp://dx.doi.org/10.2298/CSIS170115043S
dc.identifier.issn1820-0214
dc.identifier.urihttp://hdl.handle.net/10230/43057
dc.language.isoeng
dc.publisherComSIS Consortium
dc.relation.ispartofComputer Science and Information Systems. 2018;15(1):187-209
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/FP7/612944
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/FP7/604102
dc.rightsPublished by ComSIS Consortium under Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (http://creativecommons.org/licenses/by-nc-nd/4.0/)
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.keywordFeature selection
dc.subject.keywordFeature ranking
dc.subject.keywordStructured data
dc.subject.keywordHierarchical multilabel classification
dc.subject.keywordReliefF
dc.titleHMC-reliefF: feature ranking for hierarchical multi-label classification
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Sla_CSIS_HMC.pdf
Size:
1.23 MB
Format:
Adobe Portable Document Format

License

Rights