Harmony assumptions in information retrieval and social networks
| dc.contributor.author | Roelleke, Thomas | |
| dc.contributor.author | Kaltenbrunner, Andreas | |
| dc.contributor.author | Baeza Yates, Ricardo | |
| dc.date.accessioned | 2020-03-09T17:06:53Z | |
| dc.date.available | 2020-03-09T17:06:53Z | |
| dc.date.issued | 2014 | |
| dc.description.abstract | In many applications, independence of event occurrences is assumed, even if there is evidence for dependence. Capturing dependence leads to complex models, and even if the complex models were superior, they fail to beat the simplicity and scalability of the independence assumption. Therefore, many models assume independence and apply heuristics to improve results. Theoretical explanations of the heuristics are seldom given or generalizable. This paper reports that some of these heuristics can be explained as encoding dependence in an exponent based on the generalized harmonic sum. Unlike independence, where the probability of subsequent occurrences of an event is the product of the single event probability, harmony is based on a product with decaying exponent. For independence, the sequence probability is p1+1+⋯+1=pn, whereas for harmony, it is p1+1/2+⋯+1/n. The generalized harmonic sum leads to a spectrum of harmony assumptions. This paper shows that harmony assumptions naturally extend probability theory. An experimental evaluation for information retrieval (IR; term occurrences) and social networks (SN's; user interactions) shows that assuming harmony is more suitable than assuming independence. The potential impact of harmony assumptions lies beyond IR and SN's, since many applications rely on probability theory and apply heuristics to compensate the independence assumption. Given the concept of harmony assumptions, the dependence between multiple occurrences of an event can be reflected in an intuitive and effective way. | en |
| dc.description.sponsorship | This work was funded by Yahoo Labs and Barcelona Media. | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.citation | Roelleke T, Kaltenbrunner A, Baeza-Yates R. Harmony assumptions in information retrieval and social networks. Comput J. 2014 Aug 11;58(11):2982-99. DOI: 10.1093/comjnl/bxv031 | |
| dc.identifier.doi | http://dx.doi.org/10.1093/comjnl/bxv031 | |
| dc.identifier.issn | 0010-4620 | |
| dc.identifier.uri | http://hdl.handle.net/10230/43842 | |
| dc.language.iso | eng | |
| dc.publisher | Oxford University Press | |
| dc.relation.ispartof | The Computer journal. 2014 Aug 11;58(11):2982-99 | |
| dc.rights | © Oxford University Press. This is a pre-copyedited, author-produced version of an article accepted for publication in The Computer journal following peer review. The version of record Roelleke T, Kaltenbrunner A, Baeza-Yates R. Harmony assumptions in information retrieval and social networks. Comput J. 2014 Aug 11;58(11):2982-99. DOI: 10.1093/comjnl/bxv031 is available online at: https://doi.org/10.1093/comjnl/bxv031 | |
| dc.rights.accessRights | info:eu-repo/semantics/openAccess | |
| dc.subject.keyword | Probability theory | en |
| dc.subject.keyword | Modelling of dependence assumptions | en |
| dc.subject.keyword | Harmonic sum | en |
| dc.subject.keyword | Generalized binomial probability | en |
| dc.subject.keyword | Information retrieval | en |
| dc.subject.keyword | Social networks | en |
| dc.title | Harmony assumptions in information retrieval and social networks | |
| dc.type | info:eu-repo/semantics/article | |
| dc.type.version | info:eu-repo/semantics/acceptedVersion |
Fitxers
Paquet original
1 - 1 de 1
Carregant...
- Nom:
- Roelleke_ComputJ_Harmo.pdf
- Mida:
- 820.37 KB
- Format:
- Adobe Portable Document Format
