Genetic algorithm with healthy population and multiple streams sharing information for clustering
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Beg, Abul Hashem
- dc.contributor.author Islam, Md Zahidul
- dc.contributor.author Estivill-Castro, V. (Vladimir)
- dc.date.accessioned 2020-07-08T08:34:47Z
- dc.date.available 2020-07-08T08:34:47Z
- dc.date.issued 2016
- dc.description.abstract Many popular clustering techniques including K-means require various user inputs such as the number of clusters k, which can often be very difficult for a user to guess in advance. Moreover, existing techniques like K-means also have a tendency of getting stuck at local optima. As a result, various evolutionary algorithm based clustering techniques have been proposed. Typically, they choose the initial population randomly, whereas carefully selected initial population can improve final clustering results. Hence, some existing techniques such as GenClust carefully select high-quality initial population with a complexity of O(n2) which is very high. We propose a clustering technique that in addition to selecting an initial population with a low complexity of O(n), uses a number of new components including multiple streams, information exchange between neighboring streams, regular health improvement of the chromosomes, and mutation which also aims to improve chromosome health. We compare the proposed technique HeMI with five (5) existing techniques on 20 publicly available data sets in terms of two well-known evaluation criteria. We also carry out a thorough experimentation to investigate the usefulness of the new components of HeMI. Our experimental results demonstrate statistically significant superiority of HeMI over existing techniques and the effectiveness of the proposed components.en
- dc.format.mimetype application/pdf
- dc.identifier.citation Beg AH, Islam MZ, Estivill-Castro V. Genetic algorithm with healthy population and multiple streams sharing information for clustering. Knowl Based Syst. 2016 Dec 15;114:61-78. DOI: 10.1016/j.knosys.2016.09.030
- dc.identifier.doi http://dx.doi.org/10.1016/j.knosys.2016.09.030
- dc.identifier.issn 0950-7051
- dc.identifier.uri http://hdl.handle.net/10230/45082
- dc.language.iso eng
- dc.publisher Elsevier
- dc.relation.ispartof Knowledge-based systems. 2016 Dec 15;114:61-78
- dc.rights © Elsevier http://dx.doi.org/10.1016/j.knosys.2016.09.030
- dc.rights.accessRights info:eu-repo/semantics/openAccess
- dc.subject.keyword Data miningen
- dc.subject.keyword Clusteringen
- dc.subject.keyword K-meansen
- dc.subject.keyword Genetic algorithmen
- dc.subject.keyword Multiple streamsen
- dc.title Genetic algorithm with healthy population and multiple streams sharing information for clusteringen
- dc.type info:eu-repo/semantics/article
- dc.type.version info:eu-repo/semantics/acceptedVersion