The Online conversation threads repository
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Gómez, Vicençca
- dc.contributor.author Kaltenbrunner, Andreasca
- dc.contributor.author Laniado, Davidca
- dc.date.accessioned 2016-05-11T17:43:34Z
- dc.date.available 2016-05-11T17:43:34Z
- dc.date.issued 2016-05-10
- dc.description Contain:/n/slashdot/nslashdot_with_comments.tgz (518M): compressed file with raw xml/ntree-YY-MM.mat (128M): conversation threads in Matlab structures format/nFields/n data : struct with fields ; post data/n id : string ; identifier of the post/n user : string ; writer of the news post/n date : double ; seconds/n topics : string ; main topics/n tree : struct with fields ; hierarchical structure of the thread/n data : comment data: id, parentid, score, user and date/n parent : index in tree(index) of the parent (-42 for the root node)/n depth : depth in the thread/n child : vector of children/n/n/barrapunto/nraw_xml.tgz (85M): compressed file with raw xml/ntree-YY.mat (22M): conversation threads in Matlab structures/n/n/wikipedia/nAllArticleTitles.csv.tar.gz (85M): compressed file with article titles/nall_comments.csv.tar.gz (1.6G): compressed file with comments/nWP_tree_raw_data_X.mat (87M): conversation threads in Matlab structures/n/nSoftware: Matlab
- dc.description.abstract This repository contains datasets with online conversation threads collected and analyzed by different researchers. Currently, you can find datsets from different news aggregators (Slashdot, Barrapunto) and the English Wikipedia talk pages. Slashdot conversations (Aug 2005 - Aug 2006) Online conversations generated at Slashdot during a year. Posts and comments published between August 26th, 2005 and August 31th, 2006. For each discussion thread: sub-domains, title, topics and hierarchical relations between comments. For each comment: user, date, score and textual content. This dataset is different from the Slashdot Zoo social network (it is not a signed network of users) contained in the SNAP repository and represents the full version of the dataset used in the CAW 2.0 - Content Analysis for the WEB 2.0 workshop for the WWW 2009 conference that can be found in several repositories such as Konect/n/nBarrapunto conversations (Jan 2005 - Dec 2008)/nOnline conversations generated at Barrapunto (Spanish clone of Slashdot) during three years. For each discussion thread: sub-domains, title, topics and hierarchical relations between comments. For each comment: user, date, score and textual content Wikipedia (2001 - Mar 2010) Data from articles discussions (talk) pages of the English Wikipedia as of March 2010. It contains comments on about 870,000 articles (i.e. all articles which had a corresponding talk page with at least one comment), in total about 9.4 million comments. The oldest comments date back to as early as 2001.ca
- dc.identifier.doi https://doi.org/10.34810/data497
- dc.identifier.uri http://hdl.handle.net/10230/26270
- dc.language.iso engca
- dc.relation Informació addicional: http://www.mbfys.ru.nl/staff/v.gomez/
- dc.relation Publicació relacionada: Gómez V, Kaltenbrunner A, López V. Statistical analysis of the social network and discussion threads in Slashdot. In: WWW '08 Proceedings of the 17th international conference on World Wide Web; 2008 April 21-25; Beijing, China. New York: ACM; 2008. p. 645-54. DOI: 10.1145/1367497.1367585
- dc.relation Publicació relacionada: Gómez V, Kappen HJ, Litvak N, Kaltenbrunner A. A likelihood-based framework for the analysis of discussion threads. World Wide Web. 2013;16(5):645-75. DOI: 10.1007/s11280-012-0162-8. http://hdl.handle.net/10230/26746
- dc.relation Publicació relacionada: Laniado D, Tasso R, Volkovich Y, Kaltenbrunner A. When the Wikipedians talk: network and tree structure of Wikipedia discussion pages. In: Proceedings of the Fifth International Conference on Weblogs and Social Media (ICWSM-11); 2011 July 17-21; Barcelona, Spain. California: The AAAI Press, [2011]. p. 177-84. http://hdl.handle.net/10230/26817
- dc.relation.isreferencedby http://hdl.handle.net/10230/26816
- dc.relation.isreferencedby http://hdl.handle.net/10230/26746
- dc.relation.isreferencedby http://hdl.handle.net/10230/26817
- dc.rights Aquest document està subjecte a una llicència Creative Commonsca
- dc.rights.accessRights info:eu-repo/semantics/openAccessca
- dc.rights.uri http://creativecommons.org/licenses/by/3.0/es/ca
- dc.subject.keyword Online conversations
- dc.subject.keyword Discussion threads
- dc.subject.keyword Slashdot
- dc.subject.keyword Barrapunto
- dc.subject.keyword Wikipedia
- dc.title The Online conversation threads repositoryca
- dc.type info:eu-repo/semantics/otherca
- dc.type Dataset