Welcome to the UPF Digital Repository

Large scale analysis of gender bias and sexism in song lyrics

Show simple item record

dc.contributor.author Betti, Lorenzo
dc.contributor.author Abrate, Carlo
dc.contributor.author Kaltenbrunner, Andreas
dc.date.accessioned 2023-06-30T06:45:16Z
dc.date.available 2023-06-30T06:45:16Z
dc.date.issued 2023
dc.identifier.citation Betti L, Abrate C, Kaltenbrunner A. Large scale analysis of gender bias and sexism in song lyrics. EPJ Data Sci. 2023;12:10. DOI: 10.1140/epjds/s13688-023-00384-8
dc.identifier.issn 2193-1127
dc.identifier.uri http://hdl.handle.net/10230/57420
dc.description.abstract We employ Natural Language Processing techniques to analyse 377,808 English song lyrics from the “Two Million Song Database” corpus, focusing on the expression of sexism across five decades (1960–2010) and the measurement of gender biases. Using a sexism classifier, we identify sexist lyrics at a larger scale than previous studies using small samples of manually annotated popular songs. Furthermore, we reveal gender biases by measuring associations in word embeddings learned on song lyrics. We find sexist content to increase across time, especially from male artists and for popular songs appearing in Billboard charts. Songs are also shown to contain different language biases depending on the gender of the performer, with male solo artist songs containing more and stronger biases. This is the first large scale analysis of this type, giving insights into language usage in such an influential part of popular culture.
dc.format.mimetype application/pdf
dc.language.iso eng
dc.publisher Springer
dc.relation.ispartof EPJ Data Science. 2023;12:10.
dc.rights © The Author(s) 2023. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/
dc.rights.uri http://creativecommons.org/licenses/by/4.0/
dc.title Large scale analysis of gender bias and sexism in song lyrics
dc.type info:eu-repo/semantics/article
dc.identifier.doi http://dx.doi.org/10.1140/epjds/s13688-023-00384-8
dc.subject.keyword Song lyrics
dc.subject.keyword Gender
dc.subject.keyword Natural language processing
dc.subject.keyword Word embeddings
dc.subject.keyword Language bias
dc.subject.keyword Sexism
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.type.version info:eu-repo/semantics/publishedVersion


This item appears in the following Collection(s)

Show simple item record

Search DSpace

Advanced Search


My Account


In collaboration with Compliant to Partaking