A synthetic data generator for online social network graphs
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Nettleton, David F.
- dc.date.accessioned 2020-07-06T08:25:36Z
- dc.date.available 2020-07-06T08:25:36Z
- dc.date.issued 2016
- dc.description.abstract Two of the difficulties for data analysts of online social networks are (1) the public availability of data and (2) respecting the privacy of the users. One possible solution to both of these problems is to use synthetically generated data. However, this presents a series of challenges related to generating a realistic dataset in terms of topologies, attribute values, communities, data distributions, correlations and so on. In the following work, we present and validate an approach for populating a graph topology with synthetic data which approximates an online social network. The empirical tests confirm that our approach generates a dataset which is both diverse and with a good fit to the target requirements, with a realistic modeling of noise and fitting to communities. A good match is obtained between the generated data and the target profiles and distributions, which is competitive with other state of the art methods. The data generator is also highly configurable, with a sophisticated control parameter set for different “similarity/diversity” levels.en
- dc.description.sponsorship This work is partially funded by the Spanish MEC (project TIN2013-49814-EXP).
- dc.format.mimetype application/pdf
- dc.identifier.citation Nettleton DF. A synthetic data generator for online social network graphs. Soc Netw Anal Min. 2016 Jul 1;6(1):44. DOI: 10.1007/s13278-016-0352-y
- dc.identifier.issn 1869-5450
- dc.identifier.uri http://hdl.handle.net/10230/45073
- dc.language.iso eng
- dc.publisher Springer
- dc.relation.ispartof Social Network Analysis and Mining. 2016 Jul 1;6(1):44
- dc.relation.projectID info:eu-repo/grantAgreement/ES/1PE/TIN2013-49814-EXP
- dc.rights © Springer The final publication is available at Springer via http://dx.doi.org/10.1007/s13278-016-0352-y
- dc.rights.accessRights info:eu-repo/semantics/openAccess
- dc.subject.keyword Graphs and networksen
- dc.subject.keyword Online social networksen
- dc.subject.keyword Synthetic data generationen
- dc.subject.keyword Topologyen
- dc.subject.keyword Attributesen
- dc.subject.keyword Attribute-valuesen
- dc.subject.keyword Seedsen
- dc.subject.keyword Communitiesen
- dc.title A synthetic data generator for online social network graphsen
- dc.type info:eu-repo/semantics/article
- dc.type.version info:eu-repo/semantics/acceptedVersion