Topic detection using the DBSCAN-Martingale and the time operator

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Gialampoukidis, Iliasca
  • dc.contributor.author Vrochidis, Stefanosca
  • dc.contributor.author Kompatsiaris, Ioannisca
  • dc.contributor.author Antoniou, Ioannisca
  • dc.date.accessioned 2017-09-14T12:19:56Z
  • dc.date.available 2017-09-14T12:19:56Z
  • dc.date.issued 2017
  • dc.description Comunicació presentada a: The 17th Conference of the Applied Stochastic Models and Data Analysis (ASMDA), celebrada del 6 al 9 de juny de 2017 a Londres, Regne Unit.ca
  • dc.description.abstract Topic detection is usually considered as a decision process implemented in some relevant context, for example clustering. In this case, clusters correspond to topics that should be identifed. Density-based clustering, for example, uses only a density level E and a lower bound for the number of points in a cluster. As the density level is hard to be estimated, a stochastic process, called the DBSCANMartingale, is constructed for the combination of several outputs of DBSCAN for various randomly selected values of E in a predefned closed interval [0; Emax] from the uniform distribution. We have observed that most of the clusters are extracted in the interval [0; Emax=2], and moreover in the interval [Emax=2; Emax] the DBSCANMartingale stochastic process is less innovative, i.e. extracts only a few or no clusters. Therefore, non-symmetric skewed distributions are needed to generate density levels for the extraction of all clusters in a fast way. In this work we show that skewed distributions may be used instead of the uniform, so as to extract all clusters as quickly as possible. Experiments on real datasets show that the average innovation time of the DBSCAN-Martingale stochastic process is reduced when skewed distributions are employed, so less time is needed to extract all clusters.en
  • dc.description.sponsorship The first author would like to thank the Research Committee of the Aristo- tle University of Thessaloniki for awarding him the \Aristeia" postdoctoral scholarship 2016. Moreover, this work has been partially supported by the EC-funded project KRISTINA (H2020-645012).en
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Gialampoukidis I, Vrochidis S, Kompatsiaris I, Antoniou I. Topic detection using the DBSCAN-Martingale and the time operator. In: Skiadas CH, editor. Proceedings of the 17th Applied Stochastic Models and Data Analysis International Conference with the 6th Demographics Workshop ASMDA2017; 2017 June 6-9; London, UK. [London]: ISAST; 2017. p. 387-95.
  • dc.identifier.uri http://hdl.handle.net/10230/32762
  • dc.language.iso eng
  • dc.relation.ispartof Skiadas CH, editor. Proceedings of the 17th Applied Stochastic Models and Data Analysis International Conference with the 6th Demographics Workshop ASMDA2017; 2017 June 6-9; London, UK. [London]: ISAST; 2017. p. 387-95.
  • dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/645012
  • dc.rights © 2017 by ISAST: International Society for the Advancement of Science and Technology.
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.subject.keyword DBSCAN-Martingaleen
  • dc.subject.keyword Time operatoren
  • dc.subject.keyword Skewed distributionsen
  • dc.subject.keyword Internal ageen
  • dc.subject.keyword Density-based clusteringen
  • dc.subject.keyword Innovation processen
  • dc.title Topic detection using the DBSCAN-Martingale and the time operatorca
  • dc.type info:eu-repo/semantics/conferenceObject
  • dc.type.version info:eu-repo/semantics/publishedVersion