Positioning political texts with large language models by asking and averaging

Le Mens, Gaël; Gallego, Aina

Positioning political texts with large language models by asking and averaging

Mostra el registre complet Registre parcial de l'ítem

dc.contributor.author Le Mens, Gaël
dc.contributor.author Gallego, Aina
dc.date.accessioned 2025-05-20T12:34:13Z
dc.date.available 2025-05-20T12:34:13Z
dc.date.issued 2025
dc.date.updated 2025-05-20T12:34:13Z
dc.description Data de publicació electrònica: 27-01-2025
dc.description.abstract We use instruction-tuned large language models (LLMs) like GPT-4, Llama 3, MiXtral, or Aya to position political texts within policy and ideological spaces. We ask an LLM where a tweet or a sentence of a political text stands on the focal dimension and take the average of the LLM responses to position political actors such as US Senators, or longer texts such as UK party manifestos or EU policy speeches given in 10 different languages. The correlations between the position estimates obtained with the best LLMs and benchmarks based on text coding by experts, crowdworkers, or roll call votes exceed.90. This approach is generally more accurate than the positions obtained with supervised classifiers trained on large amounts of research data. Using instruction-tuned LLMs to position texts in policy and ideological spaces is fast, cost-efficient, reliable, and reproducible (in the case of open LLMs) even if the texts are short and written in different languages. We conclude with cautionary notes about the need for empirical validation.
dc.description.sponsorship This research was funded by ERC Consolidator Grant 772268 from the European Commission to G.L.M, ICREA Academia grants to A.G and G.L.M, grants PID2021-123111OB-I00 (A.G.) and PID2022-137908NB-I00 (G.L.M.) funded by MICIN/AEI/10.13039/501100011033 and by 'ERDF/UE A way of making Europe', and the Severo Ochoa Programme for Centres of Excellence in R&D (Barcelona School of Economics CEX2019-000915-S) funded by MCIN/AEI/10.13039/501100011033.
dc.format.mimetype application/pdf
dc.identifier.citation Le Mens G, Gallego A. Positioning political texts with large language models by asking and averaging. Polit Anal. 2025 Jan 27. DOI: 10.1017/pan.2024.29
dc.identifier.doi http://dx.doi.org/10.1017/pan.2024.29
dc.identifier.issn 1047-1987
dc.identifier.uri http://hdl.handle.net/10230/70440
dc.language.iso eng
dc.publisher Cambridge University Press
dc.relation.ispartof Political Analysis. 2025 Jan 27
dc.relation.projectID info:eu-repo/grantAgreement/ES/3PE/PID2021-123111OB-I00
dc.relation.projectID info:eu-repo/grantAgreement/ES/3PE/PID2022-137908NB-I00
dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/772268
dc.rights © The Author(s), 2025. Published by Cambridge University Press on behalf of The Society for Political Methodology. This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.rights.uri http://creativecommons.org/licenses/by/4.0/
dc.subject.keyword LLM
dc.subject.keyword Ideology
dc.subject.keyword Scaling
dc.subject.keyword Text as data
dc.title Positioning political texts with large language models by asking and averaging
dc.type info:eu-repo/semantics/article
dc.type.version info:eu-repo/semantics/publishedVersion

Col·leccions

Articles (Departament d'Economia)
Documents OpenAIRE (Open Access Infrastructure for Research in Europe)