Probing for referential information in language models

dc.contributor.authorSorodoc, Ionut-Teodor
dc.contributor.authorGulordava, Kristina
dc.contributor.authorBoleda, Gemma
dc.date.accessioned2021-02-03T11:00:35Z
dc.date.available2021-02-03T11:00:35Z
dc.date.issued2020
dc.descriptionComunicació presentada al 58th Annual Meeting of the Association for Computational Linguistics celebrat del 5 al 10 de juliol de 2020 de manera virtual.
dc.description.abstractLanguage models keep track of complex information about the preceding context – including, e.g., syntactic relations in a sentence. We investigate whether they also capture information beneficial for resolving pronominal anaphora in English. We analyze two state of the art models with LSTM and Transformer architectures, via probe tasks and analysis on a coreference annotated corpus. The Transformer outperforms the LSTM in all analyses. Our results suggest that language models are more successful at learning grammatical constraints than they are at learning truly referential information, in the sense of capturing the fact that we use language to refer to entities in the world. However, we find traces of the latter aspect, too.en
dc.description.sponsorshipThis project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 715154), and from the Spanish Ramón y Cajal programme (grant RYC-2015-18907). We thankfully acknowledge the computer resources at CTE-POWER and the technical support provided by Barcelona Supercomputing Center (RES-IM2019-3-0006).
dc.format.mimetypeapplication/pdf
dc.identifier.citationSorodoc IT, Gulordava K, Boleda G. Probing for referential information in language models. In: Jurafsky D, Chai J, Schluter N, Tetreault J, editors. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics; 2020 Jul 5-10; Stroudsburg, USA. Stroudsburg (PA): ACL; 2020. p. 4177-89. DOI: 10.18653/v1/2020.acl-main.384
dc.identifier.doihttp://dx.doi.org/10.18653/v1/2020.acl-main.384
dc.identifier.urihttp://hdl.handle.net/10230/46317
dc.language.isoeng
dc.publisherACL (Association for Computational Linguistics)
dc.relation.ispartofJurafsky D, Chai J, Schluter N, Tetreault J, editors. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics; 2020 Jul 5-10; Stroudsburg, USA. Stroudsburg (PA): ACL; 2020. p. 4177-89
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/715154
dc.rights© ACL, Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/)
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleProbing for referential information in language modelsen
dc.typeinfo:eu-repo/semantics/conferenceObject
dc.type.versioninfo:eu-repo/semantics/publishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
sorodoc_amaclacl_probi.pdf
Size:
694.47 KB
Format:
Adobe Portable Document Format

License

Rights