Cheng, Emily; Kervadec, Corentin; Baroni, Marco
(ACL (Association for Computational Linguistics), 2023)
For a language model (LM) to faithfully model
human language, it must compress vast, potentially infinite information into relatively few
dimensions. We propose analyzing compression in (pre-trained) LMs from two points ...