Positioning political texts with large language models by asking and averaging

dc.contributor.authorLe Mens, Gaël
dc.contributor.authorGallego, Aina
dc.date.accessioned2025-05-20T12:34:13Z
dc.date.available2025-05-20T12:34:13Z
dc.date.issued2025
dc.date.updated2025-05-20T12:34:13Z
dc.descriptionData de publicació electrònica: 27-01-2025
dc.description.abstractWe use instruction-tuned large language models (LLMs) like GPT-4, Llama 3, MiXtral, or Aya to position political texts within policy and ideological spaces. We ask an LLM where a tweet or a sentence of a political text stands on the focal dimension and take the average of the LLM responses to position political actors such as US Senators, or longer texts such as UK party manifestos or EU policy speeches given in 10 different languages. The correlations between the position estimates obtained with the best LLMs and benchmarks based on text coding by experts, crowdworkers, or roll call votes exceed.90. This approach is generally more accurate than the positions obtained with supervised classifiers trained on large amounts of research data. Using instruction-tuned LLMs to position texts in policy and ideological spaces is fast, cost-efficient, reliable, and reproducible (in the case of open LLMs) even if the texts are short and written in different languages. We conclude with cautionary notes about the need for empirical validation.
dc.description.sponsorshipThis research was funded by ERC Consolidator Grant 772268 from the European Commission to G.L.M, ICREA Academia grants to A.G and G.L.M, grants PID2021-123111OB-I00 (A.G.) and PID2022-137908NB-I00 (G.L.M.) funded by MICIN/AEI/10.13039/501100011033 and by 'ERDF/UE A way of making Europe', and the Severo Ochoa Programme for Centres of Excellence in R&D (Barcelona School of Economics CEX2019-000915-S) funded by MCIN/AEI/10.13039/501100011033.
dc.format.mimetypeapplication/pdf
dc.identifier.citationLe Mens G, Gallego A. Positioning political texts with large language models by asking and averaging. Polit Anal. 2025 Jan 27. DOI: 10.1017/pan.2024.29
dc.identifier.doihttp://dx.doi.org/10.1017/pan.2024.29
dc.identifier.issn1047-1987
dc.identifier.urihttp://hdl.handle.net/10230/70440
dc.language.isoeng
dc.publisherCambridge University Press
dc.relation.ispartofPolitical Analysis. 2025 Jan 27
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/3PE/PID2021-123111OB-I00
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/3PE/PID2022-137908NB-I00
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/772268
dc.rights© The Author(s), 2025. Published by Cambridge University Press on behalf of The Society for Political Methodology. This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject.keywordLLM
dc.subject.keywordIdeology
dc.subject.keywordScaling
dc.subject.keywordText as data
dc.titlePositioning political texts with large language models by asking and averaging
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
LeMens_pa_posi.pdf
Size:
2.2 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

License

Rights